Skip to content
#

pdfium

Here are 66 public repositories matching this topic...

kreuzberg

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

  • Updated Apr 13, 2026
  • Rust
.github

Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 88+ document formats using streaming parsers and built-in OCR. Designed for RAG pipelines, batch workloads, and production deployments.

  • Updated Apr 10, 2026

Improve this page

Add a description, image, and links to the pdfium topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pdfium topic, visit your repo's landing page and select "manage topics."

Learn more