-
hkdf
HMAC-based Extract-and-Expand Key Derivation Function (HKDF)
-
psl
Extract root domain and suffix from a domain name
-
hmac-sha256
A small, self-contained SHA256, HMAC-SHA256, and HKDF-SHA256 implementation
-
warp
serve the web at warp speeds
-
grok
popular Java & Ruby grok library which allows easy text and log file processing with composable patterns
-
tracing-opentelemetry-instrumentation-sdk
A set of helpers to build OpenTelemetry instrumentation based on
tracingcrate -
rusp
USP toolkit
-
serde-reflection
Extract representations of Serde data formats
-
pdf-extract
extract content from pdfs
-
rexif
native Rust crate, written to extract EXIF data from JPEG and TIFF images
-
ipld-core
IPLD core types
-
auditable-info
High-level crate to extract the dependency trees embedded in binaries by
cargo auditable -
unrar
list and extract RAR archives
-
xx
A collection of useful Rust macros and small functions
-
bitter
Swiftly extracts unaligned bit-level data from a byte slice
-
mago-docblock
Analyzes PHP docblocks to extract annotations, tags, and documentation comments, aiding tools that rely on inline documentation
-
kreuzberg
High-performance document intelligence library for Rust. Extract text, metadata, and structured data from PDFs, Office documents, images, and 75+ formats with async/sync APIs.
-
html-to-markdown-rs
High-performance HTML to Markdown converter using the astral-tl parser. Part of the Kreuzberg ecosystem.
-
arx
A fast, mountable file archive based on Jubako container. Replacement of tar and zip.
-
phonelib
A comprehensive library for phone number validation, formatting, parsing, and manipulation
-
auto-palette
🎨 A Rust library that extracts prominent color palettes from images automatically
-
postgresql_archive
downloading and extracting PostgreSQL archives
-
html-to-markdown-cli
Command-line interface for html-to-markdown - high-performance HTML to Markdown converter
-
libcrux-hacl-rs
Formally verified Rust code extracted from HACL* - helper library
-
slint-tr-extractor
Tool used to update extract @tr string out of Slint files into gettext .po file
-
zip-extensions
An extension crate for zip
-
dom_smoothie
extracting relevant content from web pages
-
treeclip
A CLI tool to traverse your project files and extract them into a single file or clipboard
-
rstructor
Rust equivalent of Python's Instructor + Pydantic: Extract structured, validated data from LLMs (OpenAI, Anthropic, Grok, Gemini) using type-safe Rust structs and enums
-
transvoxel
Eric Lengyel's Transvoxel Algorithm
-
vicut
A CLI text processor that uses Vim commands to transform text and extract fields
-
document-features
Extract documentation for the feature flags from comments in Cargo.toml
-
error-toon
Compress verbose browser errors for LLM consumption. Save 70-90% tokens.
-
working-memory
Working memory for AI coding assistants
-
chewdata
Extract Transform and Load data
-
scrape-cli
Command-line HTML extraction tool powered by scrape-rs
-
zarja
Extract Protocol Buffer definitions from compiled binaries
-
nosy-cli
nosy: various contents summarization tool powered by artificial intelligence
-
merkurio
Quick k-mer-based FASTA/FASTQ sequence record extraction, and SAM/BAM record filtering plus file annotation with k-mer tags
-
inkanim
CLI tool to quickly extract infos from JSON exports of .inkwidget and .inkanim
-
telemetry-rust
Open Telemetry fox Axum and Tracing
-
rexturl
split urls in their protocol, host, port, path and query parts
-
exarch-cli
Command-line utility for secure archive extraction and creation
-
libsui
A injection tool for executable formats (ELF, PE, Mach-O) that allows you to embed files into existing binary and extract them at runtime
-
unbundle
media files - extract still frames, audio tracks, and subtitles from video files
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
turbovault-parser
Obsidian Flavored Markdown (OFM) parser
-
serde-rename-rule
Serde RenameRule
-
unidiff
Unified diff parsing/metadata extraction library for Rust
-
rspack_plugin_extract_css
rspack extract css plugin
-
askama_escape
HTML escaping, extracted from Askama
-
nameback
Rename files based on their metadata with multi-language OCR, HEIC support, and video frame extraction
-
iq
introspection with dynamic queries
-
article_scraper
Scrap article contents from the web. Powered by fivefilters full text feed configurations & mozilla readability.
-
kibank
Kilohearts banks
-
subtr-actor
Rocket League replay transformer
-
rake
Rapid Automatic Keyword Extraction (RAKE) algorithm
-
formatjs_cli
Command-line interface for FormatJS - A Rust-based CLI for internationalization
-
rsrpp
project for research paper pdf
-
exarch-core
Memory-safe archive extraction library with security validation
-
xtr
Extract strings from a rust crate to be translated with gettext
-
photo_sort
rename and sort photos/videos by its EXIF date/metadata. It tries to extract the date from the EXIF data or file name and renames the image file according to a given format string…
-
uefisettings
read/get/extract and write/change/modify BIOS/UEFI settings from Linux terminal
-
bolivar-cli
PDF text extraction CLI tools
-
rawler
extract images and metadata from camera raw formats
-
payload_dumper
A fast and efficient Android OTA payload dumper library and CLI
-
kractor
Extract reads from a FASTQ file based on taxonomic classification via Kraken2
-
byteripper
extract the binary code from every function in a library
-
urx
Extracts URLs from OSINT Archives for Security Insights
-
cargo-rail
Graph-aware testing, dependency unification, and crate extraction for Rust monorepos
-
rargz
Fast parallel tar + zstd archiver and extractor with optional chunked format
-
datasheet-cli
Extract footprint/land-pattern drawings from PDF datasheets
-
boundbook
rewrite of the Bound Book format by ef1500
-
dylex
A high-performance dyld shared cache extractor for macOS and iOS
-
ts2mp4
CLI tool for converting MPEG-TS files to MP4 format
-
legible
port of Mozilla's Readability.js for extracting readable content from web pages
-
auditable-extract
Extract the dependency trees embedded in binaries by
cargo auditable -
ai-blame
Extract provenance from AI agent execution traces - like git blame, but for AI-assisted edits
-
select
extract useful data from HTML documents, suitable for web scraping
-
sancus
open-source tool that extracts third-party license information from a deployment-ready application
-
inbq
parsing BigQuery queries and extracting schema-aware, column-level lineage
-
rem-utils
Extraction Maestro
-
snm-brightdata-client
Bright Data Wrapper Client Highly compacted Data implemented in Rust with Actix Web
-
colx
Extract the specified columns from FILES or stdin
-
markdown-harvest
designed to extract, clean, and convert web content from URLs found in text messages into clean Markdown format. Originally created as an auxiliary component for Retrieval-Augmented Generation (RAG)…
-
oxiarc-cli
Command-line interface for OxiArc archive operations
-
extract-shellcode
Small Rust toolkit for pulling shellcode out of a Windows PE and (optionally) executing it in-memory
-
haruspex
Vulnerability research assistant that extracts pseudocode from IDA Hex-Rays decompiler
-
unpdf
High-performance PDF content extraction to Markdown, text, and JSON
-
openapi-subset
extract a subset of an OpenAPI specification based on specified criteria
-
rattler_package_streaming
Extract and stream of Conda package archives
-
flintbase
Google / Firebase API key analyzer and APK secret scanner — tests keys against 20+ endpoints and extracts hardcoded credentials from Android apps
-
kurateart-com-oss
Organize, tag, and manage digital art collections with smart categorization, metadata extraction, and gallery management features
-
glmimage-app-oss
Extract and analyze metadata from AI-generated images, including model signatures, generation parameters, and provenance tracking
-
valq
macros for querying semi-structured data with the JavaScript-like syntax
-
barnacle-rs
Advanced rate limiting middleware for Axum with Redis backend, API key validation, and custom key extraction
-
serde_evaluate
Extract single scalar field values from Serializable structs without full deserialization
-
media_analyzer
Extract file-based information from photo and video files
-
droid-juicer
Extract firmware from Android vendor partitions
-
webtek-grader
Aids in the process of extracting student deliverables, and leverages GPT to generate a proposal for the student feedback
-
oas-forge
The zero-runtime OpenAPI 3.1 compiler for Rust. Extracts, links, and merges code-first documentation.
-
llmx
working with LLM outputs (e.g. fuzzy JSON extraction/parsing).
-
markdown-org-extract
CLI utility for extracting tasks from markdown files with Emacs Org-mode support
-
src2md
Turn source code into a Markdown document with syntax highlighting, or extract it back
-
codemd
CLI tool to extract code from markdown files
-
wallrust
a blazingly fast and feature-rich tool extract color palettes from images
-
kreuzberg-cli
Command-line interface for Kreuzberg document intelligence
-
dom-content-extraction
Content extraction via text density paper
-
augur
Reverse engineering assistant that extracts strings and related pseudocode from a binary file
-
dmgwiz
Extract filesystem data from DMG files
-
net-shell
A script execution and variable extraction framework with SSH remote execution and local execution support, pipeline orchestration, and flexible variable extraction via regex
-
jsonic
Fast, small JSON parsing library for rust with no dependencies
-
vimg
CLI for video images. Generates animated video contact sheets fast.
-
rosetree
A fast command-line tool for scanning directories, analyzing file structures, and extracting file contents with gitignore support
-
uvm_detect
Unity project detection and version extraction library
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
readable-rs
A native Rust port of Mozilla's Readability algorithm for extracting readable content from HTML pages
-
readabilityrs
port of Mozilla's Readability library for extracting article content from web pages
-
router_prefilter
Fast prefix-based prefiltering for router pattern matching
-
secretary
Transform natural language into structured data using large language models (LLMs) with powerful derive macros
-
symposium-ferris
Ferris MCP server - helpful tools for Rust development
-
ultra-nlp
A NLP library
-
js-source-scopes
extracting and dealing with scope information in JS code
-
svg_metadata
Extracts metadata (like the viewBox, width, and height) from SVG graphics
-
reasonkit-web
High-performance MCP server for browser automation, web capture, and content extraction. Rust-powered CDP client for AI agents.
-
spiffe-rustls-tokio
Tokio-native async accept/connect helpers for spiffe-rustls
-
mkq
Query and extract data from Markdown files
-
org-core
org-mode file operations with fuzzy search, outline extraction, ID-based lookups, and heading content retrieval
-
adk-browser
Browser automation tools for Rust Agent Development Kit (ADK-Rust) agents using WebDriver
-
auto-palette-cli
🎨 CLI tool to extract a prominent color palette from an image
-
safe_unzip
Secure zip extraction. Prevents Zip Slip and Zip Bombs.
-
langextract-rust
extracting structured and grounded information from text using LLMs
-
daipendency
AI coding assistants with public API from dependencies
-
unhwp
A high-performance library for extracting HWP/HWPX documents into structured Markdown
-
rustapi-core
The core engine of the RustAPI framework. Provides the hyper-based HTTP server, router, extraction logic, and foundational traits.
-
delharc
parsing and extracting files from LHA/LZH archives
-
keyword_extraction
Collection of algorithms for keyword extraction from text
-
jpq
A JSONPath command line tool to extract values from a JSON value
-
xsra
A performant and storage-efficient CLI tool to extract sequences from an SRA archive with support for FASTA, FASTQ, and BINSEQ outputs
-
bffextract
Extract content of BFF file (AIX Backup file format)
-
blind_watermark
Picture blind watermarking
-
lrcat-extractor
Extract data from Adobe Lightroom™ catalogs
-
sit-rs
Rust-native extraction for StuffIt Expander archive files
-
derive_from_env
Extract type safe structured data from environment variables with procedural derive macros
-
taskfinder
A terminal user interface that extracts and displays tasks from plain text files, hooking into your default terminal-based editor for editing
-
rust_info
Extracts the current rust compiler information
-
blitztext
fast keyword extraction and replacement in strings
-
serde-attributes
Serde Attributes
-
pbzx
parsing, extracting, and creating PBZX archives (Apple's payload format)
-
rawloader
extract the data from camera raw formats
-
phab-comments-to-md
Extract Phabricator review comments and format them as Markdown for analysis by LLM agents
-
remeta
extracting metadata from various audio, video, and image formats
-
undoc
High-performance Microsoft Office document extraction to Markdown
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
nginx-discovery
Parse, analyze, and extract information from NGINX configurations
-
eb_nordpool
Extract elspot prices from nordpool
-
injet
Inject and extract files into PNG images using LSB (Least Significant Bit) method
-
ensemblcov
human genomics
-
rar
Rust native RAR extractor based upon nom
-
kapy-exif
A minimal library that extracts and replaces EXIF for images
-
dpp
DMG + HFS+/APFS + PKG + PBZX pipeline - walk through Apple disk images to extract packages
-
tree-sitter-tags
extracting tag information
-
openapi-from-source
Generates OpenAPI document in YAML/JSON from RUST source code using Axum or Actix-Web
-
reqwest-scraper
Web scraping integration with reqwest
-
m4b-extractor
CLI tool to extract chapters, metadata and cover for M4B Audiobook
-
pnger
Cross-platform PNG steganography tool for embedding and extracting payloads
-
frontmatter-gen
generating and parsing frontmatter in various formats
-
oak-symbols
Symbol extraction and management for the Oak framework
-
pdfvec
High-performance PDF text extraction library for vectorization pipelines
-
enpow
Generating methods for user defined enums as known from Option<T> or Result<T, E>
-
media-type-version
Extract the format version from a media type string
-
meta_oxide
Universal metadata extraction library supporting 13 formats (HTML Meta, Open Graph, Twitter Cards, JSON-LD, Microdata, Microformats, RDFa, Dublin Core, Web App Manifest, oEmbed, rel-links…
-
styx-embed
Embed Styx schemas in binaries for zero-execution discovery
-
http-scrap
HTTP parsing methods made easier to use
-
r402-mcp
MCP (Model Context Protocol) integration for the x402 payment protocol
-
wascap
wasmCloud Capabilities. Library for extracting, embedding, and validating claims
-
ipware
Http Header Client Ip Extraction Utility
-
arcella-inspect
Static analysis of Rust code to extract structured metadata (functions, structs, calls) as YAML or structured data
-
cargo-extract-test
A Cargo subcommand to extract and run a single test function in isolation
-
sphinx-rustdocgen
Executable to extract rustdoc comments for Sphinx
-
bg3rustpaklib
reading and extracting Baldur's Gate 3 PAK files
-
zoecss
CLI for ZoeCSS — scan, extract, cache, and output CSS
-
structecs
A structural data access framework. Type-safe extraction from nested structures with Arc-based smart pointers.
-
pdf_oxide
The Complete PDF Toolkit: extract, create, and edit PDFs. Rust core with bindings for Python, Node, WASM, Go, and more.
-
entpxl
extract layers from pictures taken with Google Pixel phones
-
git_info
Extracts git repository information
-
ruvector-scipix
Rust OCR engine for scientific documents - extract LaTeX, MathML from math equations, research papers, and technical diagrams with ONNX GPU acceleration
-
sax
Smart archiving and extracting utility
-
rust-grib-decoder
decode GRIB2 CCSDS/AEC (template 5.0=42) payloads and extract Section 7 payloads per message
-
discord_rust_scraper
DiscordRustScraper is a powerful Discord data scraper built in Rust, designed to extract and format channel data for further analysis. It efficiently scrapes message history from specified…
-
kalax
High-performance time series feature extraction library
-
thaf
Extracts transcript sequences and gene maps from genome FASTA files using GFF3 annotations
-
apk-info-cli
A command-line tool to inspect and extract APK files
-
publicsuffix2
Extract root domain and suffix from a domain name
-
browser-url
Cross-platform (planned) library retrieving active browser URL and information
-
aimotioncontrol-net-oss
AI-Powered Motion Trajectory Analysis Library - Extract, analyze, and optimize motion control patterns from trajectory data
-
bamslice
Extract byte ranges from BAM files and convert to interleaved FASTQ format for parallel processing
-
rust-sitter-tool
The external tool for Rust Sitter that extracts grammars from Rust definitions
-
univideo-ai-oss
A comprehensive toolkit for AI-powered video processing, format conversion, and metadata extraction from video files
-
graph-rdfa-processor
Graph RDFa processor
-
mineru-cli
Command-line interface for MinerU document extraction
-
codegraph-c
C parser for CodeGraph - extracts code entities and relationships from C source files
-
chadselect
Unified data extraction — Regex, XPath 1.0, CSS Selectors, and JMESPath behind one query interface
-
refyne
Official Rust SDK for the Refyne API - LLM-powered web extraction
-
stagehand_sdk
Rust SDK for Stagehand - AI-powered browser automation with Browserbase
-
rconvolve
Fast convolution and impulse-response extraction for audio applications
-
git-tags-semver
extract SemVer Version Information from annotated git tags
-
yamlpath
Format-preserving YAML feature extraction
-
otadump
Extract partitions from Android OTA files
-
totebag
An API for extracting/archiving files and directories in multiple formats
-
oss_porter_cli
Command-line interface for OSS Porter: A tool to extract and sync projects from internal to public Git repositories
-
htmls
parsing HTML and extracting HTML elements or text
-
codegraph-swift
Swift parser for CodeGraph - extracts code entities and relationships from Swift source files
-
go-prefetch
Go module download accelerator
-
mtv-extract
Read media type strings, extract the format versions from them
-
cookie-scoop
Extract browser cookies from Chrome, Edge, Firefox, and Safari. Reads cookie databases directly with decryption support for macOS Keychain, Linux keyring, and Windows DPAPI.
-
unarc-cli
Universal Archive Extractor - CLI tool for extracting various archive formats
-
codegraph-python
Python parser plugin for CodeGraph - extracts code entities and relationships from Python source files
-
html2json
HTML to JSON extractor
-
omniparse
toolkit for detecting and extracting metadata, text, and content from various file formats
-
inf_vectorart
Extract vector art files from Halo Infinite
-
mace
Automated extration of malware configuration, focusing on C2 communication
-
cookie-scoop-cli
CLI tool for extracting browser cookies from Chrome, Edge, Firefox, and Safari
-
readability-rust
port of Mozilla's Readability library for extracting article content from web pages
-
unrar-async
List and extract .rar archives, async
-
ts-gettext-extractor
Extracts gettext strings from Javascript/TypeScript files
-
rialo-aggregators-utils
Rialo Aggregators Utils
-
rem-repairer
Lifetime repairer for Rusty Extraction Maestro
-
locksmith
Extract Postgres locks from a given SQL statement
-
reauth-sdk
Rust SDK for Reauth authentication
-
yew_extra
Extract Axum request data within Yew server functions similar to how
leptos_axumprovides extraction helpers for Leptos -
trek-rs
A web content extraction library that removes clutter from web pages
-
uninews
A universal news scraper for extracting content from various news blogs and news sites
-
image-palette
automatically extracting prominent color palettes from images
-
typed_tuple
Type-safe access, isolation and mutation of primitive tuple segments and elements
-
rasterkit
TIFF/GeoTIFF file structure analysis and manipulation tool
-
ocrs-cli
OCR CLI tool for extracting text from images
-
rust-strings
rust-stringsis a library to extract ascii strings from binary data -
lol_chat_parser
A parser for League of Legends chat logs that extracts structured data into JSON
-
doc_loader
A comprehensive toolkit for extracting and processing documentation from multiple file formats (PDF, TXT, JSON, CSV, DOCX) with Python bindings
-
systemprompt-security
Security module for systemprompt.io - authentication, authorization, JWT, and token extraction
-
union_square
A proxy/wire-tap service for making LLM calls and recording everything that happens in a session for later analysis and test-case extraction
-
thulp-browser
Browser automation tools for thulp
-
axgeom
that provides ability to extract 1d ranges out of 2d objects
-
webpage-info
Modern library to extract webpage metadata: title, description, OpenGraph, Schema.org, links, and more
-
extractous
fast and efficient way to extract content from all kind of file formats including PDF, Word, Excel CSV, Email etc... Internally it uses a natively compiled Apache Tika for formats are not supported natively by the Rust…
-
ripmap
Ultra-fast codebase cartography for LLMs
-
excelactor
Batch extracting data from many excel files based on input keyword
-
orbis-pkg
parsing and extracting PlayStation 4 PKG files
-
nodtool
CLI tool for extracting and converting GameCube and Wii disc images
-
postcode_extractor
extract and identify postcodes
-
libperl-config
Extract perl's build configs from Config.pm and others
-
extract-strings
Extract ascii strings from files
-
metastrip
Extract and strip metadata from image files (JPEG, PNG, TIFF, WebP)
-
phago-llm
LLM integration for Phago semantic intelligence
-
chill-json
At times JSON is enclosed in surrounding text and often created by tools like LLMs or humans with no strict adherence to formatting. JSON is often not complete or incorrect or commas are missing or braces are there…
-
ograph-rs
command-line utility to extract and print OpenGraph metadata from a given URL
-
html-query
jq, but for HTML
-
l4d2_addon_parser
Parse L4D2 VPK files and extract info from addoninfo.txt data and the mission data for campaigns, and other utilities
-
email-extract
Intelligent email parsing with structured type extraction
-
herzfeld
High-fidelity Epigraphic Rendering for Zonated Feature Extraction and Labelled Datasets
-
exhume_body
Format-agnostic data extraction from disk images and other potential data structures
-
linkcheck2
extracting and validating links
-
cargo-i18n
Cargo sub-command to extract and build localization resources to embed in your application/library
-
barbacane-spec-parser
OpenAPI 3.x and AsyncAPI 3.x spec parser with x-barbacane-* extension extraction
-
exhume_extfs
Extract Extended Filesystem specific artefacts from a Partition
-
ds-rom
extracting/building Nintendo DS ROMs
-
isosurface
extraction algorithms
Try searching with DuckDuckGo or on crates.io.