Skip to content
View berlino's full-sized avatar
🏊‍♂️
Drown
🏊‍♂️
Drown

Block or report berlino

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1,259 130 Updated Feb 28, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,238 711 Updated Apr 9, 2026

Library for reading and processing ML training data.

Python 711 76 Updated Apr 10, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,369 887 Updated Apr 11, 2026

Fully open reproduction of DeepSeek-R1

Python 25,977 2,411 Updated Apr 2, 2026

A sparse attention kernel supporting mix sparse patterns

C++ 496 47 Updated Jan 18, 2026

Efficient Triton Kernels for LLM Training

Python 6,272 510 Updated Apr 8, 2026

Everything about the SmolLM and SmolVLM family of models

Python 3,705 285 Updated Apr 2, 2026

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,950 1,224 Updated Nov 21, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

All Algorithms implemented in Python

Python 219,510 50,320 Updated Apr 10, 2026
Python 308 18 Updated Apr 23, 2025

jax-triton contains integrations between JAX and OpenAI Triton

Python 444 57 Updated Mar 26, 2026
Python 308 23 Updated Jul 15, 2024

A PyTorch native platform for training generative AI models

Python 5,225 781 Updated Apr 11, 2026

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,458 12,907 Updated Apr 8, 2026

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,346 1,901 Updated Apr 1, 2026

DSPy: The framework for programming—not prompting—language models

Python 33,600 2,780 Updated Apr 10, 2026

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,670 2,756 Updated Aug 12, 2024

[CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space

Python 324 16 Updated May 14, 2024

[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2

Python 392 39 Updated Apr 19, 2023

Let us control diffusion models!

Python 33,790 3,007 Updated Feb 25, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,286 201 Updated Oct 31, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 28,860 2,930 Updated Apr 9, 2026

[CSUR] A Survey on Video Diffusion Models

2,288 114 Updated Mar 14, 2026

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,631 5,144 Updated Apr 9, 2026

Tile primitives for speedy kernels

Cuda 3,311 274 Updated Apr 8, 2026

RWKV model implementation

Python 37 1 Updated Jul 15, 2023

Triton-based implementation of Sparse Mixture of Experts.

Python 274 28 Updated Oct 3, 2025
Next