Skip to content
View Yuxin-CV's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@hustvl

Block or report Yuxin-CV

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 612 55 Updated Oct 7, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 811 52 Updated May 17, 2026

MAGI-1: Autoregressive Video Generation at Scale

Python 3,690 237 Updated Jun 17, 2025

Drawing Bayesian networks, graphical models, tensors, technical frameworks, and illustrations in LaTeX.

TeX 2,014 188 Updated May 26, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,760 270 Updated Jul 18, 2025

VIT inference in triton because, why not?

Python 36 3 Updated May 31, 2024

Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on infer…

283 15 Updated Mar 6, 2025

Machine Learning Engineering Open Book

Python 17,938 1,141 Updated Mar 16, 2026

UNet diffusion model in pure CUDA

Cuda 657 33 Updated Jun 28, 2024

Ring attention implementation with flash attention

Python 1,020 98 Updated Sep 10, 2025

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 599 32 Updated May 13, 2026

LLM training in simple, raw C/CUDA

Cuda 29,923 3,594 Updated Jun 26, 2025

LL3M: Large Language and Multi-Modal Model in Jax

Python 74 4 Updated Apr 23, 2024

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,959 199 Updated Mar 16, 2024

Karras et al. (2022) diffusion models for PyTorch

Python 2,588 401 Updated Feb 12, 2026

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,643 358 Updated May 8, 2026

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,486 1,047 Updated Jul 1, 2024

Building blocks for foundation models.

620 27 Updated Jan 3, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 99,637 12,199 Updated Apr 15, 2026

Easy generative modeling in PyTorch

Python 436 70 Updated Sep 11, 2023

Simple, safe way to store and distribute tensors

Python 3,741 318 Updated May 15, 2026

Annotated version of the Mamba paper

Jupyter Notebook 501 20 Updated Feb 27, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,946 224 Updated Mar 8, 2024

A 2D Gaussian Splatting paper for no obvious reasons. Enjoy!

Jupyter Notebook 452 23 Updated Mar 3, 2025

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,968 129 Updated Dec 4, 2025

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Jupyter Notebook 3,151 679 Updated Mar 30, 2026

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,173 76 Updated Oct 21, 2024

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 602 27 Updated Dec 11, 2024

Examples for MS-AMP package.

Shell 30 13 Updated Jul 17, 2025
Python 39 Updated Mar 5, 2026
Next