Stars
My learning notes for ML SYS.
A kernel library written in tilelang
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Single-document unsupervised keyword extraction
Official implementation of Gumbel Distillation for Parallel Text Generation
[ICML 2026][Ultra Powerful Few-Step Diffusion RL] TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward
run openclaw in the multi-agent setup
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
deepreinforce-ai / original_performance_takehome
Forked from anthropics/original_performance_takehomeAnthropic's original performance take-home, now open for you to try!
Optimizing diffusion for production-ready speeds
A community database for the problems on the erdosproblems.com site
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
An interface library for RL post training with environments.
Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
List of papers about Proteins Design using Deep Learning
A complete computer science study plan to become a software engineer.
A repository with academic works about Kernel generation with LLM Agent
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
Official codebase for the paper: Pessimistic Verification for Open Ended Math Questions
Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality