Stars
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Python Implementation of Reinforcement Learning: An Introduction
CTF framework and exploit development library
OpenChat: Advancing Open-source Language Models with Imperfect Data
A modular RL library to fine-tune language models to human preferences
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
Playing Hollow Knight with reinforcement learning.
MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING
Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR 2025)
DataSciBench: An LLM Agent Benchmark for Data Science
KodCode-AI / code-r1
Forked from ganler/code-r1Reproducing R1 for Code with Reliable Rewards
Source code and dataset for IJCAI 2022 paper "Rethinking the Setting of Semi-supervised Learning on Graphs"
[AAAI'2025] Official PyTorch implementation of the paper "Identity-Text Video Corpus Grounding".