Skip to content
View han-cai's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report han-cai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Python 313 9 Updated Oct 5, 2025

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

171 7 Updated Oct 5, 2025
Python 716 47 Updated Nov 30, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,823 1,812 Updated Oct 13, 2025
Python 2,483 239 Updated Jul 16, 2025

Scaling Vision Pre-Training to 4K Resolution

Python 217 10 Updated Aug 28, 2025

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 599 31 Updated Dec 9, 2025

[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training

Python 255 23 Updated Aug 9, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,817 2,570 Updated Dec 19, 2025

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,309 59 Updated Jul 23, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,692 85 Updated Feb 11, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,832 322 Updated Dec 20, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,075 2,285 Updated Dec 25, 2024

Efficient Segment Anything in Medical Images

Python 42 4 Updated Jul 27, 2024

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 1,070 43 Updated Jan 21, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,203 1,289 Updated May 23, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,782 235 Updated Dec 19, 2025

The official Meta Llama 3 GitHub site

Python 29,143 3,501 Updated Jan 26, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,257 8,584 Updated Nov 12, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,562 551 Updated Nov 10, 2025

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 714 31 Updated Dec 2, 2024

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,472 526 Updated Oct 8, 2025

A neural network training interface based on PyTorch, with a focus on flexibility

Python 63 14 Updated Jan 17, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,046 5,089 Updated Dec 19, 2025

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 7,494 836 Updated Dec 16, 2025

ImageBind One Embedding Space to Bind Them All

Python 8,907 835 Updated Nov 21, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,073 1,088 Updated Nov 18, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,091 1,144 Updated Dec 17, 2025

Efficient vision foundation models for high-resolution generation and perception.

Python 3,180 230 Updated Sep 5, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,963 6,181 Updated Sep 18, 2024
Next