Stars
Triton based sparse quantization attention kernel collection
[ICRA 2026] RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
MedEvalKit: A Unified Medical Evaluation Framework
[ACL 2025] FineReason: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving
The official implementation of Natural Language Fine-Tuning
Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
OCR, layout analysis, reading order, table recognition in 90+ languages
WebDesignAgent : Towards Effortless Website Creation
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
Source code of "PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks" (ACL23)
[NeurIPS 2023] Pre-training Machine-Reader (Instead of Masked Language Model) at Scale
A tool for extracting plain text from Wikipedia dumps
code for "Exploiting Reasoning Chains for Multi-hop Science Question Answering"
Acceptance rates for the major AI conferences
Pytorch Implementation of "Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation"