Skip to content
View bytetriper's full-sized avatar

Highlights

  • Pro

Block or report bytetriper

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Predicting the generation FID of latent diffusion, with a variant of reconstruction FID of Variational Auto-encoder.

Python 57 Updated Apr 8, 2026

[CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs/2603.19232

Python 53 Updated Apr 10, 2026

The first multiplayer video world model in Minecraft

Python 185 9 Updated Mar 3, 2026

Official repo for UAE

Python 192 7 Updated Apr 1, 2026

CVPR 2026 (Highlight)-Guiding a Diffusion Transformer with the Internal Dynamics of Itself (IG)

Python 66 2 Updated Apr 9, 2026
Jupyter Notebook 132 5 Updated Nov 8, 2025

Codebase for evaluation of deep generative models as presented in Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Jupyter Notebook 206 18 Updated Mar 3, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,849 76 Updated Feb 25, 2026

A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its variants as the primary backbone with support for ImageNet train…

Python 145 12 Updated Oct 16, 2025

健康学习到150岁 - 人体系统调优不完全指南

21,661 1,509 Updated Sep 10, 2025

Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).

Python 203 12 Updated Mar 20, 2026

Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning

Python 236 9 Updated Jan 22, 2026

[CVPR 2026] DDT: Decoupled Diffusion Transformer

Python 383 20 Updated Aug 22, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,441 57 Updated Dec 16, 2025

Karras et al. (2022) diffusion models for PyTorch

Python 2,580 399 Updated Feb 12, 2026

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,601 85 Updated Mar 16, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,892 120 Updated Feb 20, 2026

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,992 137 Updated Nov 7, 2025
Jupyter Notebook 116 7 Updated Oct 7, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,791 392 Updated Mar 27, 2026

This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and …

295 30 Updated Apr 10, 2024

Meaningful titles for tabs and PDF downloads! Also supports tab search.

JavaScript 337 26 Updated Nov 29, 2025

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,472 1,225 Updated Jul 30, 2024

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 17,880 3,703 Updated Nov 18, 2025

Collection of advice for prospective and current PhD students

2,054 150 Updated Jul 10, 2024

An open-source tool-augmented conversational language model from Fudan University

Python 12,085 1,133 Updated Jul 13, 2024

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

17,073 1,555 Updated Feb 13, 2023

Covert ANTLR4 book source code to Python3 version.

Python 430 94 Updated Dec 23, 2022
Next