Skip to content
View DongJT1996's full-sized avatar

Organizations

@zju3dv

Block or report DongJT1996

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Skill package for ML/CV/NLP paper writing, curated and adapted from Prof. Peng Sida's open notes for Codex, Claude Code, and Gemini.

934 38 Updated Mar 5, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,410 44 Updated Mar 27, 2026

Public repository for Agent Skills

Python 105,534 11,669 Updated Mar 25, 2026

[CVPR 2026] InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Python 855 27 Updated Mar 25, 2026

Litex is a simple formal language Learnable in 2 hours.

Go 657 7 Updated Mar 21, 2026

🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Python 516 18 Updated Jan 20, 2026

SAM 3D Objects

Python 6,326 735 Updated Mar 12, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,774 167 Updated Mar 29, 2026

Native Multimodal Models are World Learners

Python 1,486 61 Updated Dec 30, 2025

[NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.

Python 234 7 Updated Oct 17, 2025

DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation

40 Updated Aug 3, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,825 123 Updated Mar 20, 2026
Python 48 2 Updated Aug 18, 2025
JavaScript 8 Updated May 20, 2025

Official PyTorch implementation of SIGMAN

Python 67 3 Updated Feb 3, 2026

[CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion

Python 141 10 Updated Aug 30, 2024
46 1 Updated Apr 24, 2024

The official Meta Llama 3 GitHub site

Python 29,299 3,529 Updated Jan 26, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,650 563 Updated Nov 10, 2025

Code for "SAM-guided Graph Cut for 3D Instance Segmentation" ECCV 2024

Python 127 4 Updated Dec 31, 2024

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,796 1,008 Updated Sep 20, 2025

[ICCV 2023, Oral] Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models

Python 96 8 Updated Jan 4, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,196 6,879 Updated Mar 29, 2026

🚀💪Maximize your efficiency and productivity. The ultimate hub to manage, customize, and share prompts. (English/中文/Español/العربية). 让生产力加倍的 AI 快捷指令。更高效地管理提示词,在分享社区中发现适用于不同场景的灵感。

TypeScript 8,282 923 Updated Mar 26, 2026
Python 601 47 Updated Mar 26, 2026

KeypointNeRF Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints

Python 375 27 Updated May 2, 2023

Official repository of NeuMan: Neural Human Radiance Field from a Single Video (ECCV 2022)

Python 1,286 149 Updated May 23, 2023

Hosts the Multiface dataset, which is a multi-view dataset of multiple identities performing a sequence of facial expressions.

Python 765 50 Updated Aug 1, 2023

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Python 9,280 1,192 Updated Apr 2, 2024

Code of [ECCV 2022] "AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture"

Python 182 22 Updated Nov 11, 2022
Next