π¦ ClawHub
HTML Analysis
by @mzlzyca
Analyze the structure and content of HTML documents using MinerU. Returns structured Markdown with layout information, headings, and content hierarchy preser...
π‘ Examples
# Analyze a local HTML file (requires token)
mineru-open-api extract page.html -o ./out/Analyze a remote HTML file by URL (requires token)
mineru-open-api extract https://example.com/page.html -o ./out/Crawl a live web page (requires token)
mineru-open-api crawl https://example.com/article -o ./out/
π Tips & Best Practices
flash-extract β use extract with tokenmineru-open-api crawl instead of extract-o to save to a file or directoryTERMINAL
clawhub install html-analysis