🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,374 32,869 Updated Apr 14, 2026

lynl7130 / MoDoMoDo

Official implementation of "MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning"

Python 10 2 Updated Aug 23, 2025

allenai / olmix

Python 37 3 Updated Mar 26, 2026

mihail911 / modern-software-dev-assignments

Assignments for CS146S: The Modern Software Dev (Stanford University Fall 2025)

Python 3,454 824 Updated Nov 10, 2025

QwenLM / Qwen3.5

Qwen3.5 is the large language model series developed by Qwen team, Alibaba Cloud.

2,585 148 Updated Mar 2, 2026

Orchestra-Research / AI-Research-SKILLs

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 6,837 533 Updated Apr 13, 2026

GAIR-NLP / OctoThinker

Revisiting Mid-training in the Era of Reinforcement Learning Scaling

Jupyter Notebook 187 14 Updated Jul 23, 2025

RUCBM / G-OPD

Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"

Python 92 8 Updated Mar 18, 2026

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 770 81 Updated Feb 18, 2026

Emilianopp / Privileged-Information-Distillation-and-Self-Distillation

13 Updated Feb 24, 2026

btzyd / Awesome-Linear-Attention-Survey

The Github repo for our survey paper: A Survey of Linear Attention: Algorithm, Theory, Application, and Infrastructure

8 Updated Feb 6, 2026

f / prompts.chat

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

HTML 159,706 20,914 Updated Apr 14, 2026

dreammis / social-auto-upload

自动化上传视频到社交媒体：抖音、小红书、视频号、tiktok、youtube、bilibili

Python 9,959 1,800 Updated Apr 13, 2026

neilrathi / token-filtering

Shaping capabilities with token-level pretraining data filtering

Python 93 6 Updated Jan 28, 2026

pengr / DataMan

Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".

Python 122 2 Updated Feb 7, 2026

Mid-Training / RMT

Python 13 Updated Sep 30, 2025

Shaobo (Steven) Wang gszfwsb

Highlights

Lists (11)

attribution

benchmarks

DD

diffusion

foundation model

🔮 Future ideas

ICL

✨ Inspiration

🚀 My stack

optimzation

pretrain

Starred repositories

tiny-imagenet

Ubuntu

Jekyll

Machine learning

Python

Jupyter Notebook

Algorithm