Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 2,337 200 Updated Feb 3, 2026

Linum-AI / linum-v2

Linum v2 (text-to-video) models

Python 42 7 Updated Jan 22, 2026

IGL-HKUST / CoMoVi

Official repository of paper "CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos"

70 Updated Jan 16, 2026

snarktank / ralph

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

TypeScript 9,380 1,098 Updated Feb 2, 2026

saidwivedi / dotfiles

Shell 4 Updated Jan 13, 2026

MotrixLab / ViMoGen

the Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Python 70 3 Updated Jan 4, 2026

huangwl18 / PointWorld

PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation

353 7 Updated Jan 8, 2026

jarrodwatts / claude-hud

A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress

JavaScript 2,987 120 Updated Feb 3, 2026

Bria-AI / FIBO

FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation.

Python 302 15 Updated Jan 7, 2026

leofan90 / Awesome-World-Models

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,184 34 Updated Feb 4, 2026

IamCreateAI / NeoVerse

NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

341 7 Updated Jan 5, 2026

InternRobotics / InternVLA-A1

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation

Jupyter Notebook 322 20 Updated Feb 3, 2026

Friedrich-M / DIMO

[ICCV 2025 Highlight] DIMO: Diverse 3D Motion Generation for Arbitrary Objects

Python 145 2 Updated Jan 6, 2026

Lightricks / LTX-2

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 3,503 461 Updated Jan 29, 2026

NVlabs / CARI4D

CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction

JavaScript 105 10 Updated Dec 24, 2025

isaac-sim / IsaacLab

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 6,246 3,041 Updated Feb 4, 2026

cgtuebingen / 3D-RE-GEN

3D Reconstruction of Indoor Scenes with a Generative Framework

359 16 Updated Dec 22, 2025

WeChatCV / WeDetect

Official repository of paper "WeDetect: Fast Open-Vocabulary Object Detection as Retrieval"

Python 112 2 Updated Dec 20, 2025

amazon-far / holosoma

Python 936 120 Updated Feb 2, 2026

AIM-Intelligence / video2robot

End-to-end pipeline converting generative videos (Veo, Sora) to humanoid robot motions

Python 587 67 Updated Dec 18, 2025

MIT-SPARK / DAAAM

Describe Anything, Anywhere, at Any Moment (DAAAM), a novel approach to real-time, large-scale, spatio-temporal memory

143 3 Updated Dec 7, 2025

alex4727 / MotionStream

MotionStream: Real-Time Video Generation with Interactive Motion Controls

494 17 Updated Nov 13, 2025

Tongyi-MAI / Z-Image

Python 9,844 624 Updated Jan 30, 2026

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,534 973 Updated Feb 3, 2026

Sai Kumar Dwivedi saidwivedi

Highlights

Lists (32)

2D / 3D Keypoints

3D-Avatar-NonParam

3D from Image/Video

3D from Text

3D + Language

Architecture

Curation List

Datasets

Depth Estimation

Digital Human <-> Robotics

Hand Mesh Recovery

HSI-Generation

Human Body Mesh

Human Motion

Human-Object-Interaction

Human-Object-Reconstruction-3D

Human Parsing

Human-Scene-Interaction

Image Generation

Inpainting / EditAnything

Large Scale Foundation Model

Misc

NeRF / SDF / Implicit

Object-6DOF

Object Detection

Object Tracking

Pose Embedding

Segmentation

Tools

Video + Language

Vision Embedding

Vision + Language

Stars