Skip to content
View han-cai's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report han-cai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Python 364 11 Updated Oct 5, 2025

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

184 7 Updated Oct 5, 2025
Python 728 47 Updated Nov 30, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,035 1,964 Updated Jan 9, 2026
Python 2,498 243 Updated Jul 16, 2025

Scaling Vision Pre-Training to 4K Resolution

Python 224 10 Updated Jan 4, 2026

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 651 45 Updated Mar 6, 2026

[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training

Python 261 25 Updated Aug 9, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 28,408 2,646 Updated Apr 4, 2026

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,352 62 Updated Jul 23, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,716 87 Updated Feb 11, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 5,044 338 Updated Mar 17, 2026

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,845 2,411 Updated Mar 20, 2026

Efficient Segment Anything in Medical Images

Python 42 3 Updated Jul 27, 2024

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 1,094 44 Updated Jan 21, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,245 1,285 Updated May 23, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,976 252 Updated Apr 2, 2026

The official Meta Llama 3 GitHub site

Python 29,293 3,530 Updated Jan 26, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,040 9,547 Updated Nov 12, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,659 564 Updated Nov 10, 2025

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 726 34 Updated Dec 2, 2024

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,774 556 Updated Feb 11, 2026

A neural network training interface based on PyTorch, with a focus on flexibility

Python 63 15 Updated Jan 17, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,591 5,142 Updated Apr 3, 2026

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 8,631 924 Updated Apr 2, 2026

ImageBind One Embedding Space to Bind Them All

Python 9,005 843 Updated Nov 21, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,193 1,103 Updated Nov 18, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,633 1,199 Updated Mar 12, 2026

Efficient vision foundation models for high-resolution generation and perception.

Python 3,276 238 Updated Sep 5, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,851 6,310 Updated Sep 18, 2024
Next