berlino

Follow

🏊‍♂️

Drown

Bailin berlino

🏊‍♂️

Drown

Follow

Postdoc @ MIT CSAIL, writing inefficient code for parsing

208 followers · 390 following

Achievements

Achievements

Stars

ChenmienTan / RL2

Python 962 101 Updated Dec 21, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,925 353 Updated Dec 21, 2025

google / grain

Library for reading and processing ML training data.

Python 632 59 Updated Dec 18, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,317 610 Updated Dec 21, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,745 2,405 Updated Nov 24, 2025

mit-han-lab / Block-Sparse-Attention

A sparse attention kernel supporting mix sparse patterns

C++ 408 39 Updated Dec 16, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,963 452 Updated Dec 21, 2025

huggingface / smollm

Everything about the SmolLM and SmolVLM family of models

Python 3,466 244 Updated Nov 20, 2025

deepseek-ai / DeepSeek-V3

Python 100,800 16,424 Updated Aug 28, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,485 1,154 Updated Nov 21, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,215 51 Updated Nov 16, 2024

TheAlgorithms / Python

All Algorithms implemented in Python

Python 214,991 49,654 Updated Dec 13, 2025

apple / ml-sigmoid-attention

Python 303 18 Updated Apr 23, 2025

jax-ml / jax-triton

jax-triton contains integrations between JAX and OpenAI Triton

Python 436 54 Updated Dec 11, 2025

google-deepmind / nanodo

Python 285 20 Updated Jul 15, 2024

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,861 646 Updated Dec 21, 2025

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,759 12,625 Updated Dec 17, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,428 1,689 Updated Sep 24, 2025

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 30,934 2,488 Updated Dec 21, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,198 2,685 Updated Aug 12, 2024

sihyun-yu / PVDM

[CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space

Python 322 16 Updated May 14, 2024

universome / stylegan-v

[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2

Python 387 39 Updated Apr 19, 2023

lllyasviel / ControlNet

Let us control diffusion models!

Python 33,452 2,992 Updated Feb 25, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,243 200 Updated Oct 31, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 28,137 2,815 Updated Apr 30, 2025

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,240 111 Updated Jun 27, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,049 5,089 Updated Dec 19, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 3,008 217 Updated Dec 9, 2025

codekansas / rwkv

RWKV model implementation

Python 38 1 Updated Jul 15, 2023

shawntan / scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Python 257 24 Updated Oct 3, 2025