High-performance, parallel web crawler and knowledge graph system built in Rust. Open source and community-driven. Extract, analyze, and graph data from the web at scale.
Process 10,000+ pages per minute with Tokio async runtime
Built-in robots.txt compliance and rate limiting
Build entity-relationship graphs automatically
AI-powered entity recognition and classification
RocksDB, PostgreSQL, and graph databases
Handle JavaScript-heavy sites with automation
BASH001brew tap Pranav-Karra-3301/omnivore002brew install omnivore
BASH001docker run --rm -it omnivore:latest omnivore --help
BASH001git clone https://github.com/Pranav-Karra-3301/omnivore.git002cd omnivore003cargo install --path omnivore-cli --force
BASH001omnivore crawl https://example.com --workers 5 --depth 3
BASH001omnivore graph results.json --output knowledge-graph.db