-
Max Planck Institute for Intelligent Systems
- Germany
- https://saidwivedi.in
- @saidwivedi
- in/saidwivedi
- @saidwivedi.in
Highlights
- Pro
Lists (32)
Sort Name ascending (A-Z)
2D / 3D Keypoints
3D-Avatar-NonParam
3D from Image/Video
3D from Text
3D + Language
Architecture
Curation List
Datasets
Depth Estimation
Digital Human <-> Robotics
Hand Mesh Recovery
HSI-Generation
Human Body Mesh
Human Motion
Human-Object-Interaction
Human-Object-Reconstruction-3D
Human Parsing
Human-Scene-Interaction
Image Generation
Inpainting / EditAnything
Large Scale Foundation Model
Misc
NeRF / SDF / Implicit
Object-6DOF
Object Detection
Object Tracking
Pose Embedding
Segmentation
Tools
Video + Language
Vision Embedding
Vision + Language
Stars
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
HY-Motion model for 3D human motion or 3D character animation generation.
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
Official repository of paper "CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos"
Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.
the Quest for Generalizable Motion Generation: Data, Model, and Evaluation
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress
FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation.
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
[ICCV 2025 Highlight] DIMO: Diverse 3D Motion Generation for Arbitrary Objects
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction
Unified framework for robot learning built on NVIDIA Isaac Sim
3D Reconstruction of Indoor Scenes with a Generative Framework
Official repository of paper "WeDetect: Fast Open-Vocabulary Object Detection as Retrieval"
End-to-end pipeline converting generative videos (Veo, Sora) to humanoid robot motions
Describe Anything, Anywhere, at Any Moment (DAAAM), a novel approach to real-time, large-scale, spatio-temporal memory
MotionStream: Real-Time Video Generation with Interactive Motion Controls
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…