DongJT1996

DongJT1996

109 followers · 26 following

jtdong.com

Achievements

Organizations

Lists (1)

Sort

🚀 My stack

1 repository

Stars

Master-cai / Research-Paper-Writing-Skills

Skill package for ML/CV/NLP paper writing, curated and adapted from Prof. Peng Sida's open notes for Codex, Claude Code, and Gemini.

934 38 Updated Mar 5, 2026

leofan90 / Awesome-World-Models

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,410 44 Updated Mar 27, 2026

anthropics / skills

Public repository for Agent Skills

Python 105,534 11,669 Updated Mar 25, 2026

zju3dv / InfiniDepth

[CVPR 2026] InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Python 855 27 Updated Mar 25, 2026

litexlang / golitex

Litex is a simple formal language Learnable in 2 hours.

Go 657 7 Updated Mar 21, 2026

jqtangust / Robust-R1

🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Python 516 18 Updated Jan 20, 2026

facebookresearch / sam-3d-objects

SAM 3D Objects

Python 6,326 735 Updated Mar 12, 2026

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,774 167 Updated Mar 29, 2026

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,486 61 Updated Dec 30, 2025

InternRobotics / InternScenes

[NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.

Python 234 7 Updated Oct 17, 2025

yisuanwang / DanceTog

DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation

40 Updated Aug 3, 2025

jonyzhang2023 / awesome-embodied-vla-va-vln

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,825 123 Updated Mar 20, 2026

cyjdlhy / TeleOpBench

Python 48 2 Updated Aug 18, 2025

Gorgeous2002 / TeleOpBench

JavaScript 8 Updated May 20, 2025

yyvhang / SIGMAN_release

Official PyTorch implementation of SIGMAN

Python 67 3 Updated Feb 3, 2026

huanngzh / EpiDiff

[CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion

Python 141 10 Updated Aug 30, 2024

DongJT1996 / TELA

46 1 Updated Apr 24, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,299 3,529 Updated Jan 26, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,650 563 Updated Nov 10, 2025

zju3dv / SAM-Graph

Code for "SAM-guided Graph Cut for 3D Instance Segmentation" ECCV 2024

Python 127 4 Updated Dec 31, 2024

HumanAIGC / AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,796 1,008 Updated Sep 20, 2025

snuvclab / chupa

[ICCV 2023, Oral] Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models

Python 96 8 Updated Jan 4, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,196 6,879 Updated Mar 29, 2026

rockbenben / ChatGPT-Shortcut

🚀💪Maximize your efficiency and productivity. The ultimate hub to manage, customize, and share prompts. (English/中文/Español/العربية). 让生产力加倍的 AI 快捷指令。更高效地管理提示词，在分享社区中发现适用于不同场景的灵感。

TypeScript 8,282 923 Updated Mar 26, 2026

SeanChenxy / Hand3DResearch

Python 601 47 Updated Mar 26, 2026

facebookresearch / KeypointNeRF

KeypointNeRF Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints

Python 375 27 Updated May 2, 2023

apple / ml-neuman

Official repository of NeuMan: Neural Human Radiance Field from a Single Video (ECCV 2022)

Python 1,286 149 Updated May 23, 2023

facebookresearch / multiface

Hosts the Multiface dataset, which is a multi-view dataset of multiple identities performing a sequence of facial expressions.

Python 765 50 Updated Aug 1, 2023

PeterL1n / RobustVideoMatting

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Python 9,280 1,192 Updated Apr 2, 2024

lizhe00 / AvatarCap

Code of [ECCV 2022] "AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture"

Python 182 22 Updated Nov 11, 2022