Starred repositories
A reinforcement learning agent built from scratch in C++, trained on the cart-pole environment.
Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggregation from our recent work on uncertain ground truth (TMLR'23…
Synthetic Dialog Generation and Analysis with LLMs
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
Share evaluation outcomes and implementation tips for using open safety models in Trust & Safety workflows
Scalable toolkit for efficient model reinforcement
OpenCE (Open Context Engineering): A community toolkit to implement, evaluate, and combine LLM context strategies (RAG, ACE, Compression). Evolved from the `ACE-open` reproduction.
An interface library for RL post training with environments.
Open-source repository for Semantic Modeling Language (SML)
Simplifying reinforcement learning for complex game environments
A RL env with procedurally generated symbolic reasoning data
A high-throughput and memory-efficient inference and serving engine for LLMs
A framework for few-shot evaluation of language models.
Official code release for "SuperBPE: Space Travel for Language Models"
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
An open-source Text2SQL tool that transforms natural language into SQL using graph-powered schema understanding. Ask your database questions in plain English, QueryWeaver handles the weaving.
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Multi-Agent System: Data Analysis → Optimization → Business Insights
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Scalable RL solution for advanced reasoning of language models
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step