-
auditable-info
High-level crate to extract the dependency trees embedded in binaries by
cargo auditable -
kreuzberg
High-performance document intelligence library for Rust. Extract text, metadata, and structured data from PDFs, Office documents, images, and 75+ formats with async/sync APIs.
-
scrape-cli
Command-line HTML extraction tool powered by scrape-rs
-
unbundle
media files - extract still frames, audio tracks, and subtitles from video files
-
serde-rename-rule
Serde RenameRule
-
rspack_plugin_extract_css
rspack extract css plugin
-
byteripper
extract the binary code from every function in a library
-
auditable-extract
Extract the dependency trees embedded in binaries by
cargo auditable -
legible
port of Mozilla's Readability.js for extracting readable content from web pages
-
select
extract useful data from HTML documents, suitable for web scraping
-
valq
macros for querying semi-structured data with the JavaScript-like syntax
-
serde_evaluate
Extract single scalar field values from Serializable structs without full deserialization
-
droid-juicer
Extract firmware from Android vendor partitions
-
jsonic
Fast, small JSON parsing library for rust with no dependencies
-
src2md
Turn source code into a Markdown document with syntax highlighting, or extract it back
-
kreuzberg-cli
Command-line interface for Kreuzberg document intelligence
-
rosetree
A fast command-line tool for scanning directories, analyzing file structures, and extracting file contents with gitignore support
-
readabilityrs
port of Mozilla's Readability library for extracting article content from web pages
-
readable-rs
A native Rust port of Mozilla's Readability algorithm for extracting readable content from HTML pages
-
langextract-rust
extracting structured and grounded information from text using LLMs
-
symposium-ferris
Ferris MCP server - helpful tools for Rust development
-
daipendency
AI coding assistants with public API from dependencies
-
mkq
Query and extract data from Markdown files
-
serde-attributes
Serde Attributes
-
rustapi-core
The core engine of the RustAPI framework. Provides the hyper-based HTTP server, router, extraction logic, and foundational traits.
-
keyword_extraction
Collection of algorithms for keyword extraction from text
-
ensemblcov
human genomics
-
oak-symbols
Symbol extraction and management for the Oak framework
-
meta_oxide
Universal metadata extraction library supporting 13 formats (HTML Meta, Open Graph, Twitter Cards, JSON-LD, Microdata, Microformats, RDFa, Dublin Core, Web App Manifest, oEmbed, rel-links…
-
http-scrap
HTTP parsing methods made easier to use
-
r402-mcp
MCP (Model Context Protocol) integration for the x402 payment protocol
-
zoecss
CLI for ZoeCSS — scan, extract, cache, and output CSS
-
kalax
High-performance time series feature extraction library
-
graph-rdfa-processor
Graph RDFa processor
-
mineru-cli
Command-line interface for MinerU document extraction
-
chadselect
Unified data extraction — Regex, XPath 1.0, CSS Selectors, and JMESPath behind one query interface
-
refyne
Official Rust SDK for the Refyne API - LLM-powered web extraction
-
oss_porter_cli
Command-line interface for OSS Porter: A tool to extract and sync projects from internal to public Git repositories
-
html2json
HTML to JSON extractor
-
omniparse
toolkit for detecting and extracting metadata, text, and content from various file formats
-
rialo-aggregators-utils
Rialo Aggregators Utils
-
trek-rs
A web content extraction library that removes clutter from web pages
-
rem-repairer
Lifetime repairer for Rusty Extraction Maestro
-
yew_extra
Extract Axum request data within Yew server functions similar to how
leptos_axumprovides extraction helpers for Leptos -
lol_chat_parser
A parser for League of Legends chat logs that extracts structured data into JSON
-
extract-strings
Extract ascii strings from files
-
html-query
jq, but for HTML
-
chill-json
At times JSON is enclosed in surrounding text and often created by tools like LLMs or humans with no strict adherence to formatting. JSON is often not complete or incorrect or commas are missing or braces are there…
-
email-extract
Intelligent email parsing with structured type extraction
-
exhume_body
Format-agnostic data extraction from disk images and other potential data structures
-
exhume_ntfs
Extract NT Filesystem specific artefacts from a given Partition
-
trace2power
Reads VCD and FST signal traces and extracts accumulated power activity data for use with power analysis tools
-
exhume_extfs
Extract Extended Filesystem specific artefacts from a Partition
-
ticker-sniffer
extracting multiple stock ticker symbols from a text document
-
ds-rom
extracting/building Nintendo DS ROMs
-
rawtojpg
A very fast embedded JPEG extractor from RAW files
-
line-rs
Extract lines from files without hacks!
-
gradientor
CLI tool to extract the most promiment colors in an image
-
exhume_partitions
Extract GPT and (E/M)BR partitions from a Body of data
-
c2r
A C to Rust conversion program
-
pdf2john
Extract a hash from an encrypted PDF for cracking with John the Ripper or Hashcat
-
seedance_ai_video
High-quality integration for https://supermaker.ai/video/seedance-ai-video/
-
fs_query
A Model Context Protocol server for efficient code symbol extraction using Tree-sitter
-
facebook_totem
extracting and analyzing Facebook post data
-
oss_porter_core
Core library for OSS Porter: Provides logic for Git operations, state management, extraction, and updates
-
surfing
parsing JSON objects from text streams
-
exhume_exfat
Extract exFAT Filesystem specific artefacts from a given Partition
-
bitlab
Extracting a range of bits from a binary data source
-
polyfuzzy
Fuzzy message detection with randomized and compact keys
-
netron
Extract Axum request data within Yew server functions similar to how
leptos_axumprovides extraction helpers for Leptos -
mineru-sdk
Rust SDK for MinerU API - document extraction services
-
tldextract
extract domain info from a url
-
git-indexer
extracting git repository information
-
atlas-token-confidential-transfer-proof-extraction
Atlas Program Library Confidential Transfer Proof Extraction
-
download-extract-progress
downloading and extracting files with progress tracking
-
isosurface
extraction algorithms
-
keyphrases
Rapid Automatic Keyword Extraction (RAKE) implementation in Rust
-
vec_extract_if_polyfill
Polyfill for Vec::extract_if
-
extract-frontmatter
that allows a user to extract an arbitrary number of lines of 'front-matter' from the start of any string
-
linky
Extract links from Markdown files and check links for brokenness
-
rust-audit-info
Command-line tool to extract the dependency trees embedded in binaries by
cargo auditable -
enontekio
solve problems with data extraction and manipulation, like Advent of Code puzzles
-
box_into_inner
Box::into_inner
-
rem-controller
Non-local control flow repairer for Rusty Extraction Maestro
-
drug-extraction-cli
A CLI for extracting drugs from text records
-
markdown-extract
Extract sections of a markdown file
-
rem-borrower
Permission repairer for Rusty Extraction Maestro
-
klepto
a general purpose extraction and data scraping utility
-
exeico
-
rustlol
A wad files lib
-
html_simple_parser
parser for html files to extract tags, child tags, attributes, etc
-
uurl
A transformer and manipulator for Urls. Can be used via CLI or as a library.
-
exhume_artefacts
This exhume module regroup all of the parsers maintained by the community to parse and extract artefact in a standardized way
-
archive
A unified interface for extracting common archive formats in-memory
-
bolivar
PDF content extraction library (WIP)
-
jpeg_extractor
Extract jpeg images from any binary via command line
-
orbis-pkg-util
CLI for working with PlayStation 4 PKG files
-
mdnt-support-macros
Proc-macros for defining groups for extraction
-
colorgram
that extracts colors from image. Port of colorgram.py
-
substr_rs
Easy substring extraction
-
zim
ZIM reading and extraction
-
select-html
Extract HTML using CSS selectors in the command-line
-
halldyll-parser
HTML/CSS parsing and content extraction for halldyll scraper
-
frontmatter
A Fairly Trivial Wrapper for yaml-rust to Extract Frontmatter from a String Slice
-
webbundle-cli
WebBundle cli
-
rem-extract
Providing extract method capability for the REM toolchain
-
scid_parse
Extract chess games from SCID databases
-
meshed
Graph creation and traversal tools
-
skyrim-cell-dump
binary for parsing Skyrim plugin files and extracting CELL data
-
halldyll-media
Media extraction (images, videos, links) for halldyll scraper
-
tldextract-rs
extract domain info from a url
-
log_parser_kma
Rust-based log file parser, helping extract datetime, log levels and messages
-
eorst
offers a library aiming to simplify the writing of raster processing pipelines in rust
-
windows-icons
extract icons from files on Windows
-
cargo-shipshape
Cargo subcommand to sort Rust file items by type and name
-
wappu
fast and flexible web scraping library for Rust, designed to efficiently navigate and extract data from websites. Perfect for data mining, content aggregation, and web automation tasks.
-
titlecat
extract title from URL
-
squid_ewe
A helper tool for squid that extracts CFG metadata from C code
-
cargo-context-ranger
Quickly grab your rust context for building LLM prompts to help
-
rust-pickaxe
HTML data extraction library
-
l3_extract
extract layer 4 connection from layer 3
-
loadsmith
Mod extraction and (un)installation utilities for a range of mod loaders
-
opengraph
Parses html and extracts Open Graph protocol markup
-
tstpmove
extract typescript types from multiple to a single file
-
swc-vanilla-extract-visitor
Vanilla-extract custom transform visitor for SWC
-
chinese2digits
The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.
-
drug-extraction-core
A core library for extracting drugs from text records
-
jpgfromraw
A very fast embedded JPEG extractor from RAW files
-
videohash
functionality for computing perceptual hashes (pHash) and difference hashes (dHash) from video files. This crate extracts frames from videos and computes these hashes for each frame.
-
swagstract
extract a subset of OpenAPI 3.* spec from yaml files
-
aoe-djin
Djin is a utility crate to extract Age of Empire II Definitive edition game data
-
unrpa_rs
A multithreaded CLI program and library to extract RenPy archives (RPAs)
-
toolcraft
A modular Rust toolkit
-
shank
Exposes macros to annotate Rust programs to extract solita compatible IDL in order to generate program SDKs
-
mapwords
HashMap based keyword extraction
-
rs-wikibzip2pages
Extract Wikipedia page tags from concatenated bzip2 files
-
torito-rs
extract bootable images from ISO files
-
board_game_parser
A Rust-based parser for board game data, designed for efficient data extraction and transformation
-
pluck
Extract values conveniently
-
msi-extract
extract files from MSI packages
-
repo2prompt
Extract repository content into XML, JSON, or plain text format
-
json-value
Helper method to get the json value
-
textcat
detect text categories. It can be used to detect the language of a given text
-
usb-bpm-exporter
USB Blood Pressure Monitor data extraction library and CLI tool
-
serde_extract
Enables remapping two structs using the Serde framework
-
tarrasque
zero-allocation parsing of binary formats
-
minifs-extractor
CLI tool to extract files from a minifs binary
-
bevy_subworlds
multi-world support for bevy
-
exe2swf
Extract Flash .swf files from Windows .exe files
-
keyword-tools
Rust tools for keyword extraction and similarity search
-
ad_event_log_parser
parser for analyzing ad event logs to extract insights from click report data
-
fitgirl-ddl-lib
extract DDL from fitgirl-repacks.site
-
rdmc
Run commands from you readme as if its a Makefile
-
sp-transaction-storage-proof
Transaction storage proof primitives
-
binary-extract
Extract a value from a json string without parsing the whole thing
-
indicator-extractor
Extract indicators (IP, domain, email, hashes, etc.) from a string or a PDF file
-
books_description_parser
A Rust-based parser to extract book details from structured markdown-like text and output them in formats like JSON or Rust structs for further processing
-
wiki_corpus
Extract text from Wikipedia dumps (.bz2) and convert it to JSONLines format
-
hax-driver
The custom rustc driver used by hax
-
simple-bits
trait to extract and replace bits in integer types
-
ziplyn
A fast and lightweight file compression and extraction tool built in Rust
-
cha-rs
Extract specific characters from an input
-
cda-dl
Minimal async library for extracting video stream URLs from cda.pl
-
spl-token-confidential-transfer-proof-extraction
Solana Program Library Confidential Transfer Proof Extraction
-
csv-slice
Extract rows or columns from CSV files without loading the entire file
-
signet-extract
Logic for extracting events and other data from host chain blocks, used in Signet
-
env-extractor
Modules to extract environment variables
-
cosmetics_parser
A Rust-based parser to extract product details from cosmetics catalogs in markdown format and output them in structured formats like JSON or Rust structs
-
dockerdump
Extract any file from any Docker image
-
jars
jar extraction library
-
wiktionary-part-of-speech-extract
English Wiktionary parsed for part-of-speech info and placed into a precompiled FST
-
tld_extract
extract domain info from a url
-
extract_jsons_from_string
extract valid JSONs from a string to a vector
-
tarutil
CLI utility to extract tarballs with conflicting file paths on case-insensitive operating systems
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
extract-repo-url
Small CLI tool to extract repository URL from text (from clipboard by default)
-
nds
handling Nintendo DS ROM files
-
pdf-sign
extract signed date from pdf file
-
demsf-rs
A rewrite of https://github.com/phlbrz/demsf - DEMSF are bash scripts to Download and extract Microsoft Sharepoint file
-
keepass-dump-extractor
Find and collect parts of a Keepass master key to recover it in plain text from a memory dump
-
rust-vhost
extract SNI from tls handshake where you transfer traffic to any destination you want
-
simple-bitrange
manipulating bit ranges which is common when working with IC registers
-
h2s_core
A core part of h2s
-
vimeo-rs
vimeo contents for Rust
-
libsubid-dylib
Shadow compatible nsswitch module for sub?id extraction from various sources
-
token_address_extractor
extracting blockchain addresses from text
-
gpt4ocr
Extract structured text from PDFs using OpenAI's GPT4o
-
readable-readability
Really fast readability
-
literate
programming tool that extracts code written in your Markdown files
-
dotext
extract readable text from specific document format like Word Document (docx). Currently only support several format, other format coming soon.
-
rialo-s-spl-token-confidential-transfer-proof-extraction
Solana Program Library Confidential Transfer Proof Extraction
-
cargo-dependencies
A Cargo extension that prepares (downloads & builds) the dependencies of a specific Rust project
-
wiki_corpus_parser
Extract text from Wikipedia dumps (.bz2) and convert it to JSONLines format
-
rex-regextract
extracts key value pairs out of text
-
docker_extract
extract the filesystem from a docker image
-
recipe-scraper
parsing structured recipes from the web
-
config-parse
check and extract certain key-values from your config files
-
invoice2storage
Extract email attachments and stores them in a different backends like webdav or folder
-
spring-boot-layertools
Faster Spring Boot layertools extraction in Rust
-
numscan
Extract numbers from text
-
top_n_tail
A CLI Utility to extract text from files or stdin
-
openapi-parser
Extract schemas definitions tree from OpenAPI documents
-
textract
extract text from various types of files
-
rust-govhost
extract SNI from tls handshake where you transfer traffic to any destination you want
-
elvui-refresh
A program to download, extract and install the latest version of ElvUI
-
axum-service-extract
axum (v0.5) service extract
-
hub-macro
Proc macros for hub method definitions with session types
-
ioncodes/snesutils
SNES Rom extraction utilites
-
dext
A CLI tool to extract and unpack the layers of a docker image
-
opengraph-rs
Parses html and extracts Open Graph protocol markup. Fork of https://github.com/kumabook/opengraph
Try searching with DuckDuckGo.