Stars
ProxyExplainer for Graph Neural Networks
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
Toolkit for linearizing PDFs for LLM datasets/training
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
A lightweight LMM-based Document Parsing Model
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Multilingual Document Layout Parsing in a Single Vision-Language Model
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
Python tool for converting files and office documents to Markdown.
Awesome Deep Research list! For more details, please refer to our survey paper -- A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library
[ACL 2025] Towards Text-Image Interleaved Retrieval
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
UltraRAG 2.0: Less Code, Lower Barrier, Faster Deployment! MCP-based low-code RAG framework, enabling researchers to build complex pipelines to creative innovation.
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
⏰ Collaboratively track worldwide conference deadlines (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
The hub for EleutherAI's work on interpretability and learning dynamics