Skip to content
View berlino's full-sized avatar
🏊‍♂️
Drown
🏊‍♂️
Drown

Block or report berlino

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 962 101 Updated Dec 21, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,925 353 Updated Dec 21, 2025

Library for reading and processing ML training data.

Python 632 59 Updated Dec 18, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,317 610 Updated Dec 21, 2025

Fully open reproduction of DeepSeek-R1

Python 25,745 2,405 Updated Nov 24, 2025

A sparse attention kernel supporting mix sparse patterns

C++ 408 39 Updated Dec 16, 2025

Efficient Triton Kernels for LLM Training

Python 5,963 452 Updated Dec 21, 2025

Everything about the SmolLM and SmolVLM family of models

Python 3,466 244 Updated Nov 20, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,485 1,154 Updated Nov 21, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,215 51 Updated Nov 16, 2024

All Algorithms implemented in Python

Python 214,991 49,654 Updated Dec 13, 2025
Python 303 18 Updated Apr 23, 2025

jax-triton contains integrations between JAX and OpenAI Triton

Python 436 54 Updated Dec 11, 2025
Python 285 20 Updated Jul 15, 2024

A PyTorch native platform for training generative AI models

Python 4,861 646 Updated Dec 21, 2025

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,759 12,625 Updated Dec 17, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,428 1,689 Updated Sep 24, 2025

DSPy: The framework for programming—not prompting—language models

Python 30,934 2,488 Updated Dec 21, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,198 2,685 Updated Aug 12, 2024

[CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space

Python 322 16 Updated May 14, 2024

[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2

Python 387 39 Updated Apr 19, 2023

Let us control diffusion models!

Python 33,452 2,992 Updated Feb 25, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,243 200 Updated Oct 31, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 28,137 2,815 Updated Apr 30, 2025

[CSUR] A Survey on Video Diffusion Models

2,240 111 Updated Jun 27, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,049 5,089 Updated Dec 19, 2025

Tile primitives for speedy kernels

Cuda 3,008 217 Updated Dec 9, 2025

RWKV model implementation

Python 38 1 Updated Jul 15, 2023

Triton-based implementation of Sparse Mixture of Experts.

Python 257 24 Updated Oct 3, 2025
Next