Stars
[ICLR 2025 Spotlight] Official implementation for "DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes"
TAPIP3D: Tracking Any Point in Persistent 3D Geometry
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
[CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
⏰ AI conference deadline countdowns
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
An example RLDS dataset builder for X-embodiment dataset conversion.
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
A glimpse into your computer's temperature, voltage, fan speed, memory usage and CPU load.
Unified framework for robot learning built on NVIDIA Isaac Sim
Rich is a Python library for rich text and beautiful formatting in the terminal.
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
Convert your (Beamer) PDF slides to (Powerpoint) PPTX