Skip to content
View zhaoyue-zephyrus's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report zhaoyue-zephyrus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 5 Updated Dec 17, 2025

"E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.

Python 169 4 Updated Dec 19, 2025

Curate, Annotate, and Manage Your Data in LightlyStudio.

Python 678 15 Updated Dec 19, 2025

PyTorch media decoding and encoding

Python 878 80 Updated Dec 19, 2025

Matplotlib styles for scientific plotting

Python 8,453 781 Updated Nov 20, 2025
Python 123 5 Updated Aug 10, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 97,403 11,036 Updated Dec 19, 2025

code based for rectified flow

Python 260 18 Updated Nov 26, 2025

A python library for self-supervised learning on images.

Python 3,648 316 Updated Dec 19, 2025

Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)

Python 333 28 Updated Oct 2, 2025

Interactive Post-Training for Vision-Language-Action Models

Python 156 7 Updated Jun 4, 2025

official training and inference code of bitwise tokenizer

Python 58 2 Updated May 18, 2025

Normalized Transformer (nGPT)

Python 193 22 Updated Nov 19, 2024

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 1,294 256 Updated Mar 15, 2025

Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"

Python 370 11 Updated Nov 24, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,605 227 Updated Jun 17, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,916 125 Updated Dec 18, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,319 192 Updated Jun 5, 2025

Mastering Diverse Domains through World Models

Python 2,549 427 Updated Sep 23, 2025

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,644 887 Updated Dec 18, 2025

This package contains the original 2012 AlexNet code.

Cuda 2,791 360 Updated Mar 12, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,111 63 Updated Aug 7, 2025

JPEG XL image format reference implementation

C++ 3,248 321 Updated Dec 19, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,923 918 Updated Dec 15, 2025

[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Jupyter Notebook 94 3 Updated Mar 1, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,039 4,669 Updated Dec 19, 2025

An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers

Shell 56 4 Updated Jul 11, 2023

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,085 59 Updated Mar 20, 2025
Next