Stars
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Python bindings for FFmpeg - with complex filtering support
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Turn your Python application into an Android APK
OpenMMLab's next-generation platform for general 3D object detection.
Count the MACs / FLOPs of your PyTorch model.
An easy to use PyTorch to TensorRT converter
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Visual localization made easy with hloc
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
The devkit of the nuScenes dataset.
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Convolutional Neural Networks to predict the aesthetic and technical quality of images.
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Reference implementations of MLPerf® inference benchmarks
[ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)
A full Python implementation for real car surround view system