SOL

A CLI tool that converts webpages to clean markdown. It uses a headless Chromium browser to render pages (including JavaScript-heavy ones), extracts the readable content, and outputs markdown.

Use it to:

Convert any webpage to context for an LLM
Save an article or blog post for offline reading
Archive documentation as plain text
Extract content from JS-heavy SPAs that simple scrapers can't handle

It runs in stealth mode to bypass basic bot detection, though it won't get past more advanced protections like Cloudflare Turnstile or aggressive CAPTCHAs.

Installation

Pre-built binaries

Grab the latest binary for your platform from the releases page.

Build from source

sol requires Rust (2024 edition). Build and install with:

cargo install --path .

Usage

# download chromium (one-time)
sol setup

# print markdown to stdout
sol fetch https://example.com

# save to a file
sol fetch https://example.com --output article.md

# wait longer for JS-heavy pages (default: 5000ms)
sol fetch https://example.com --wait-for 10000

# set a custom user-agent
sol fetch https://example.com --user-agent "MyBot/1.0"

Flags

Flag	Description
`--output <PATH>`	Write output to a file instead of stdout
`--wait-for <MS>`	Time to wait for the page to render, in milliseconds (default: 5000)
`--user-agent <STRING>`	Custom user-agent string for the browser

How it works

Launches headless Chromium and navigates to the URL
Waits for the page to render (handles JavaScript)
Extracts readable content using Mozilla's Readability algorithm (strips nav, ads, sidebars)
Converts the cleaned HTML to markdown

Platform support

macOS and Linux. Windows is not supported.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
src		src
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SOL

Installation

Pre-built binaries

Build from source

Usage

Flags

How it works

Platform support

About

Uh oh!

Releases 1

Uh oh!

Languages

License

thetinygoat/sol

Folders and files

Latest commit

History

Repository files navigation

SOL

Installation

Pre-built binaries

Build from source

Usage

Flags

How it works

Platform support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Uh oh!

Languages