Skip to content
View berlino's full-sized avatar
🏊‍♂️
Drown
🏊‍♂️
Drown

Block or report berlino

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1,117 115 Updated Jan 19, 2026

slime is an LLM post-training framework for RL Scaling.

Python 3,648 487 Updated Feb 3, 2026

Library for reading and processing ML training data.

Python 678 63 Updated Feb 4, 2026

FlashInfer: Kernel Library for LLM Serving

Python 4,861 689 Updated Feb 4, 2026

Fully open reproduction of DeepSeek-R1

Python 25,853 2,411 Updated Nov 24, 2025

A sparse attention kernel supporting mix sparse patterns

C++ 450 44 Updated Jan 18, 2026

Efficient Triton Kernels for LLM Training

Python 6,110 483 Updated Feb 4, 2026

Everything about the SmolLM and SmolVLM family of models

Python 3,593 271 Updated Jan 13, 2026

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,686 1,185 Updated Nov 21, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

All Algorithms implemented in Python

Python 217,463 50,035 Updated Feb 2, 2026
Python 307 17 Updated Apr 23, 2025

jax-triton contains integrations between JAX and OpenAI Triton

Python 437 54 Updated Dec 11, 2025
Python 289 21 Updated Jul 15, 2024

A PyTorch native platform for training generative AI models

Python 5,031 691 Updated Feb 3, 2026

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,016 12,792 Updated Feb 4, 2026

MiniCPM-o 4.5: A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Mulitmodal Live Streaminig on Your Phone

Python 22,729 1,723 Updated Feb 4, 2026

DSPy: The framework for programming—not prompting—language models

Python 31,978 2,608 Updated Feb 3, 2026

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,423 2,724 Updated Aug 12, 2024

[CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space

Python 324 16 Updated May 14, 2024

[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2

Python 388 39 Updated Apr 19, 2023

Let us control diffusion models!

Python 33,615 2,998 Updated Feb 25, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,275 201 Updated Oct 31, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 28,499 2,881 Updated Apr 30, 2025

[CSUR] A Survey on Video Diffusion Models

2,265 113 Updated Jun 27, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,312 5,117 Updated Jan 30, 2026

Tile primitives for speedy kernels

Cuda 3,120 234 Updated Feb 4, 2026

RWKV model implementation

Python 38 1 Updated Jul 15, 2023

Triton-based implementation of Sparse Mixture of Experts.

Python 263 26 Updated Oct 3, 2025
Next