Skip to content
View DSaurus's full-sized avatar
  • Tsinghua University
  • Beijing, China

Block or report DSaurus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,918 1,220 Updated Nov 21, 2025

Animation engine for explanatory math videos

Python 85,738 7,197 Updated Mar 26, 2026

Efficient vision foundation models for high-resolution generation and perception.

Python 3,276 238 Updated Sep 5, 2025

The best OSS video generation models, created by Genmo

Python 3,633 478 Updated Nov 14, 2025

This repository is the official implementation of Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer.

Python 108 5 Updated Nov 18, 2025

Official implementation of MagicClay: Sculpting Meshes with Generative Neural Fields (Siggraph Asia 2024)

Python 59 4 Updated Oct 24, 2024

The MongoDB Database

C++ 28,209 5,765 Updated Apr 3, 2026

Code repository for the paper "Tracking People by Predicting 3D Appearance, Location & Pose". (CVPR 2022 Oral)

Python 340 63 Updated Feb 7, 2026

Code of "NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation", CVPR 2023

Python 277 20 Updated Jul 17, 2023

[ICCV 2021, Oral] PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop

Python 652 68 Updated Sep 29, 2024

[CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"

Python 781 65 Updated Aug 26, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,843 2,410 Updated Mar 20, 2026

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

Jupyter Notebook 2,687 271 Updated May 6, 2025

Making large AI models cheaper, faster and more accessible

Python 41,371 4,521 Updated Mar 30, 2026

ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM

C++ 8,455 3,033 Updated Jul 24, 2024

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

C++ 10,137 4,755 Updated May 15, 2024

Official inference repo for FLUX.1 models

Python 25,377 1,872 Updated Jul 31, 2025

NeRF visualization library under construction

HTML 284 17 Updated Jun 28, 2023

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,954 1,016 Updated Aug 12, 2024

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,279 1,181 Updated Mar 14, 2025

Bilibili Downloader. 一个命令行式哔哩哔哩下载器.

C# 13,636 1,587 Updated Jan 10, 2026

[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.

Python 804 54 Updated Nov 10, 2025

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Python 444 25 Updated Jul 5, 2024

[NeurIPS 2024] Boosting the performance of consistency models with PCM!

Python 514 19 Updated Dec 11, 2024

Official repo for consistency models.

Python 6,475 432 Updated Mar 22, 2024

[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Python 417 18 Updated May 30, 2025

[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 286 11 Updated Dec 4, 2024

[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"

275 4 Updated Oct 3, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,955 1,720 Updated Feb 29, 2024

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,462 1,227 Updated Jul 30, 2024
Next