Highlights
- Pro
Stars
A framework for the evaluation of autoregressive code generation language models.
SGLang is a fast serving framework for large language models and vision language models.
A project to improve skills of large language models
Block Diffusion for Ultra-Fast Speculative Decoding
ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.
utilities for decoding deep representations (like sentence embeddings) back to text
Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.
Official JAX implementation of End-to-End Test-Time Training for Long Context
[ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective
dInfer: An Efficient Inference Framework for Diffusion Language Models
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.
CANDI: Continuous and Discrete Diffusion
🧀 Pytorch code for the Fromage optimiser.
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
PC で動く高機能な将棋の GUI「ShogiHome」の開発リポジトリ
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
MTEB: Massive Text Embedding Benchmark
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
Collection of Summer 2026 tech internships!
PyTorch implementation of Variational Diffusion Models.
Awesome Reasoning LLM Tutorial/Survey/Guide
Discrete Flow Matching implemented in PyTorch
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
Remasking Discrete Diffusion Models with Inference-Time Scaling
Minimal Implementation of a D3PM in pytorch