🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 160,700 33,240 Updated May 18, 2026

lynl7130 / MoDoMoDo

Official implementation of "MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning"

Python 10 2 Updated Aug 23, 2025

allenai / olmix

Python 40 5 Updated Mar 26, 2026

mihail911 / modern-software-dev-assignments

Assignments for CS146S: The Modern Software Dev (Stanford University Fall 2025)

Python 3,628 877 Updated Nov 10, 2025

QwenLM / Qwen3.6

Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.

3,391 220 Updated May 11, 2026

Orchestra-Research / AI-Research-SKILLs

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 8,561 654 Updated Apr 28, 2026

GAIR-NLP / OctoThinker

Revisiting Mid-training in the Era of Reinforcement Learning Scaling

Jupyter Notebook 188 14 Updated Jul 23, 2025

RUCBM / G-OPD

Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"

Python 164 11 Updated Mar 18, 2026

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 876 95 Updated Feb 18, 2026

Emilianopp / Privileged-Information-Distillation-and-Self-Distillation

12 Updated Feb 24, 2026

btzyd / Awesome-Linear-Attention-Survey

The Github repo for our survey paper: A Survey of Linear Attention: Algorithm, Theory, Application, and Infrastructure

10 Updated Feb 6, 2026

f / prompts.chat

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

HTML 162,431 21,145 Updated May 17, 2026

dreammis / social-auto-upload

自动化上传视频到社交媒体：抖音、小红书、视频号、tiktok、youtube、bilibili

Python 11,125 1,990 Updated May 17, 2026

neilrathi / token-filtering

Shaping capabilities with token-level pretraining data filtering

Python 93 7 Updated Jan 28, 2026

Shaobo (Steven) Wang gszfwsb

Highlights

Lists (11)

attribution

benchmarks

DD

diffusion

foundation model

🔮 Future ideas

ICL

✨ Inspiration

🚀 My stack

optimzation

pretrain

Starred repositories

tiny-imagenet

Ubuntu

Jekyll

Machine learning

Python

Jupyter Notebook

Algorithm