Skip to content

SiteOne Crawler

The free Swiss-Army-knife for your website. One dependency-free binary deep-crawls any site and turns it into an actionable audit report, a full offline copy, or clean Markdown for AI — in seconds, from a friendly GUI or a powerful CLI.

…or install the command-line version:

curl -1sLf https://dl.cloudsmith.io/public/janreges/siteone-crawler/setup.deb.sh | sudo -E bash
sudo apt install siteone-crawler
siteone-crawler --url=https://your-site.com
curl -1sLf https://dl.cloudsmith.io/public/janreges/siteone-crawler/setup.rpm.sh | sudo -E bash
sudo dnf install siteone-crawler
siteone-crawler --url=https://your-site.com
curl -1sLf https://dl.cloudsmith.io/public/janreges/siteone-crawler/setup.rpm.sh | sudo -E bash
sudo zypper install siteone-crawler
siteone-crawler --url=https://your-site.com
curl -1sLf https://dl.cloudsmith.io/public/janreges/siteone-crawler/setup.alpine.sh | sudo -E bash
sudo apk add siteone-crawler
siteone-crawler --url=https://your-site.com

Prefer Homebrew? brew install janreges/tap/siteone-crawler

# Homebrew (macOS)
brew install janreges/tap/siteone-crawler
siteone-crawler --url=https://your-site.com
  1. Download siteone-crawler-*-win-x64.zip from the releases page.
  2. Unzip it to any folder.
  3. Open cmd or PowerShell in that folder and run:
siteone-crawler.exe --url=https://your-site.com

Windows install guide →

All install options & first run →

See the website crawler in action (3 videos)

Section titled “See the website crawler in action (3 videos)”

Audit report + quality score

Every crawl ends with an interactive HTML audit report and a 0–10 quality score across Performance, SEO, Security, Accessibility & Best Practices — plus a “Fix first” panel that ranks the highest-impact issues so you know exactly where to start.

💡 See a live report for astro.build, netlify.com or nextjs.org.

Block bad deploys (CI/CD gate)

Wire it into your pipeline with the built-in CI/CD quality gate (--ci): configurable thresholds, JUnit XML and GitHub annotations, with exit code 10 on failure to fail the build automatically. Doubles as a post-deploy cache warmer.

Clone any website for offline use

Generate a static offline clone (mirror) of your whole site — assets, fonts, documents and 3rd-party resources included — to browse without a server. Great for archival and disaster-recovery plans.

💡 Browse a real clone of docs.astro.build or netlify.com.

Crawl JavaScript / SPA sites

Optional --browser mode renders each page in a real Chromium (CDP), so React, Vue, Svelte, Angular & Next.js sites are crawled, exported and converted with their post-render DOM.

Run it headful (--browser-headful) to watch the browser open and click through every page live, capture a screenshot of every crawled page (viewport or full-page, as PNG/JPG/WebP) and stitch them into a GIF or MP4 video time-lapse, and collect console / JavaScript / network diagnostics for each URL.

Bring your own LLM (AI assistant)

Plug any LLM into the analysis — commercial (OpenAI, Anthropic, Google Gemini) or fully local & private (vLLM, Ollama, LM Studio, or any OpenAI-compatible endpoint), so your data never has to leave your machine.

The opt-in AI assistant adds AI-graded SEO, an executive summary across security, accessibility, performance & infrastructure, spelling/grammar checks, llms.txt and your own custom prompts — with strict cost controls and a --ai-dry-run preview.

…and that’s not all: deep crawling, SEO & OpenGraph, accessibility, security, performance, broken links & 404s, sitemaps, stress testing, email reports and more. See all features →

Grab the desktop app or CLI from the top of this page — or explore every option: ready-to-use packages (Homebrew, apt, dnf, zypper, apk), system requirements, build from source and the desktop application.

Share it

Know someone who builds or maintains websites? Send them the link.