LangExtract Docling

LangExtract Docling is a lightweight wrapper around LangExtract that adds native support for processing PDF files via Docling.

Installation

pip install langextract-docling

Usage

import langextract_docling as lx

# Extract from plain text (same as LangExtract)
result = lx.extract(
    text_or_documents="Your document text here.",
    prompt_description="Extract entities",
    examples=[...]
)

# Extract from a local PDF
result = lx.extract(
    text_or_documents="path/to/document.pdf",
    prompt_description="Extract entities",
    examples=[...]
)

# Extract from a PDF URL
result = lx.extract(
    text_or_documents="https://example.com/document.pdf",
    prompt_description="Extract entities",
    examples=[...]
)

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
langextract_docling		langextract_docling
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
autoformat.sh		autoformat.sh
pyproject.toml		pyproject.toml
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LangExtract Docling

Installation

Usage

License

About

Uh oh!

Releases 2

Packages

Languages

License

tijoseymathew/langextract-docling

Folders and files

Latest commit

History

Repository files navigation

LangExtract Docling

Installation

Usage

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages