Skip to content
View azai91's full-sized avatar

Block or report azai91

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Convert nuscenes data to mcap file format

Python 48 17 Updated May 27, 2026

Official models of Franka Robotics GmbH robots

Python 123 71 Updated May 4, 2026

A curated list of awesome robot descriptions (URDF, MJCF)

1,557 142 Updated Jun 3, 2026

A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.

Python 3,622 498 Updated Jun 15, 2026

A light-weight, pythonic ros2 package to connect the genesis simulator and ROS2

Python 200 22 Updated Jun 10, 2026

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 1,093 194 Updated Dec 20, 2025

C++ library designed to provide an abstraction for different rendering engines. It offers unified APIs for creating 3D graphics applications.

C++ 78 83 Updated Jun 18, 2026

Builds on top of Qt to provide widgets which are useful when developing robotics applications, such as a 3D view, plots, dashboard, etc, and can be used together in a convenient unified interface.

C++ 102 68 Updated Jun 10, 2026
Jupyter Notebook 1,893 119 Updated Nov 5, 2025

Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)

Python 1,354 256 Updated Oct 15, 2024

🔎 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 3,282 247 Updated May 31, 2026

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 8,125 614 Updated Jul 17, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 8,306 859 Updated Mar 24, 2026

Depth Anything 3

Python 5,592 615 Updated Mar 21, 2026

PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.

Python 4,893 1,019 Updated Apr 24, 2024

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Python 5,431 1,488 Updated Nov 30, 2023

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 732 102 Updated Feb 29, 2024

An open-source framework for training large multimodal models.

Python 4,107 321 Updated Aug 31, 2024

A Minimalist, Batteries-included Repository for Advancing World Model Science.

Python 625 34 Updated Jun 15, 2026

A minimal PyTorch implementation of the VQ-VAE model described in "Neural Discrete Representation Learning".

Python 92 15 Updated Feb 10, 2022

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,729 1,370 Updated Jun 12, 2026

Gemma open-weight LLM library, from Google DeepMind

Python 5,451 963 Updated Jun 17, 2026

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,760 168 Updated Dec 8, 2023
Python 1,959 124 Updated Sep 30, 2025

ACL 2025: Synthetic data generation pipelines for text-rich images.

Python 163 28 Updated Mar 1, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 29,116 2,975 Updated Apr 9, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,419 1,796 Updated Jan 30, 2026

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,371 57 Updated Mar 14, 2024

An open source implementation of CLIP.

Python 13,924 1,287 Updated Jun 18, 2026
Next