Interactive wizard
Run siteone-crawler without any arguments and an interactive wizard guides you through the configuration. You choose a crawl mode, enter the target URL, fine-tune settings with the arrow keys, and the crawler starts immediately โ no need to remember CLI flags.
./siteone-crawlerChoosing a crawl mode
Section titled โChoosing a crawl modeโThe wizard opens with a searchable menu of 10 preset modes plus two entries for serving previously exported sites:
? Choose a crawl mode:โฏ Quick Audit Fast site health overview โ crawls all pages and assets SEO Analysis Extract titles, descriptions, keywords, and OpenGraph tags Performance Test Measure response times with cache disabled โ find bottlenecks Security Check Check SSL/TLS, security headers, and redirects site-wide Offline Clone Download entire website with all assets for offline browsing Markdown Export Convert pages to Markdown for AI models or documentation Stress Test High-concurrency load test with cache-busting random params Single Page Deep analysis of a single URL โ SEO, security, performance Large Site Crawl High-throughput HTML-only crawl for large sites (100k+ pages) Custom Start from defaults and configure every option manually โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ Browse offline export Serve a previously exported offline site via HTTP Browse markdown export Serve a previously exported markdown site via HTTP[โโ to move, enter to select, type to filter]Preset modes
Section titled โPreset modesโ| Mode | What it does |
|---|---|
| Quick Audit | Fast site health overview โ crawls all pages and assets. |
| SEO Analysis | Extract titles, descriptions, keywords, and OpenGraph tags. |
| Performance Test | Measure response times with cache disabled โ find bottlenecks. |
| Security Check | Check SSL/TLS, security headers, and redirects site-wide. |
| Offline Clone | Download entire website with all assets for offline browsing. |
| Markdown Export | Convert pages to Markdown for AI models or documentation. |
| Stress Test | High-concurrency load test with cache-busting random params. |
| Single Page | Deep analysis of a single URL โ SEO, security, performance. |
| Large Site Crawl | High-throughput HTML-only crawl for large sites (100k+ pages). |
| Custom | Start from defaults and configure every option manually. |
Serve a previous export
Section titled โServe a previous exportโ| Entry | What it does |
|---|---|
| Browse offline export | Serve a previously exported offline site via HTTP. |
| Browse markdown export | Serve a previously exported markdown site via HTTP. |
Settings form and equivalent CLI command
Section titled โSettings form and equivalent CLI commandโAfter you select a preset and enter the URL, the wizard shows a settings form where you can adjust options such as workers, timeout, content types, export options, and more.
Before the crawl starts, the wizard displays a configuration summary with the equivalent CLI command. Copy that command to reuse the exact same run later without the wizard โ for example in a script or a CI pipeline.
Serving existing exports
Section titled โServing existing exportsโIf existing offline or markdown exports are detected in ./tmp/, the wizard also offers to serve them via the built-in HTTP server directly from the menu, so you can browse a previous export without re-crawling.
Related
Section titled โRelatedโ- Command-line options โ the full flag reference behind every wizard setting.
- Browser rendering โ for crawling JavaScript/SPA sites in a real Chromium browser.