berlino

Follow

🏊‍♂️

Drown

Bailin berlino

🏊‍♂️

Drown

Follow

Postdoc @ MIT CSAIL, working on sequence models

211 followers · 390 following

Achievements

Achievements

Stars

ChenmienTan / RL2

Python 1,117 115 Updated Jan 19, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 3,648 487 Updated Feb 3, 2026

google / grain

Library for reading and processing ML training data.

Python 678 63 Updated Feb 4, 2026

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 4,861 689 Updated Feb 4, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,853 2,411 Updated Nov 24, 2025

mit-han-lab / Block-Sparse-Attention

A sparse attention kernel supporting mix sparse patterns

C++ 450 44 Updated Jan 18, 2026

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 6,110 483 Updated Feb 4, 2026

huggingface / smollm

Everything about the SmolLM and SmolVLM family of models

Python 3,593 271 Updated Jan 13, 2026

deepseek-ai / DeepSeek-V3

Python 101,432 16,506 Updated Aug 28, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,686 1,185 Updated Nov 21, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

TheAlgorithms / Python

All Algorithms implemented in Python

Python 217,463 50,035 Updated Feb 2, 2026

apple / ml-sigmoid-attention

Python 307 17 Updated Apr 23, 2025

jax-ml / jax-triton

jax-triton contains integrations between JAX and OpenAI Triton

Python 437 54 Updated Dec 11, 2025

google-deepmind / nanodo

Python 289 21 Updated Jul 15, 2024

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,031 691 Updated Feb 3, 2026

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,016 12,792 Updated Feb 4, 2026

OpenBMB / MiniCPM-o

MiniCPM-o 4.5: A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Mulitmodal Live Streaminig on Your Phone

Python 22,729 1,723 Updated Feb 4, 2026

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 31,978 2,608 Updated Feb 3, 2026

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,423 2,724 Updated Aug 12, 2024

sihyun-yu / PVDM

[CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space

Python 324 16 Updated May 14, 2024

universome / stylegan-v

[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2

Python 388 39 Updated Apr 19, 2023

lllyasviel / ControlNet

Let us control diffusion models!

Python 33,615 2,998 Updated Feb 25, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,275 201 Updated Oct 31, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 28,499 2,881 Updated Apr 30, 2025

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,265 113 Updated Jun 27, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,312 5,117 Updated Jan 30, 2026

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 3,120 234 Updated Feb 4, 2026

codekansas / rwkv

RWKV model implementation

Python 38 1 Updated Jul 15, 2023

shawntan / scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Python 263 26 Updated Oct 3, 2025