- Shanghai
- https://thudzj.github.io/
Stars
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
Doodling our way to AGI ✏️ 🖼️ 🧠
Official PyTorch implementation for "Large Language Diffusion Models"
SIFT: Grounding LLM Reasoning in Contexts via Stickers
Sky-T1: Train your own O1 preview model within $450
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
A unified inference and post-training framework for accelerated video generation.
Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】
Code for "Deep Ensemble as a Gaussian Process Posterior"
Open-Sora: Democratizing Efficient Video Production for All
Code for orthogonal neural operator
Official implementation for "LOVECon: Text-driven Training-free Long Video Editing with ControlNet"
[ICML 2024] CLLMs: Consistency Large Language Models
Implementation of soft parameter sharing for neural networks
Resource-adaptive cluster scheduler for deep learning training.
Code for ''Understanding and Exploring the Network with Stochastic Architectures''
Adversarial Distributional Training (NeurIPS 2020)
Code for "BayesAdapter: Being Bayesian, Inexpensively and Robustly, via Bayeisan Fine-tuning"