Lists (3)
Sort Name ascending (A-Z)
Stars
A self-contained CLI wrapper for mihomo (Clash.Meta) proxy. Bundles the binary + geo data + web UI in the wheel. One mhctl install configures everything (shell completion, setcap, polkit, NetworkMa…
[CVPR 2025] Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera
Collect paper about ground segmentation in 3D point cloud.
pengcheng001 / RoadBEV
Forked from ztsrxh/RoadBEVCodes for RoadBEV: road surface reconstruction in Bird's Eye View
Official code for CAVIS: Context-Aware Video Instance Segmentation
Catkin package that provides lidar motion undistortion based on an external 6DoF pose estimation input.
A header-only program of 3d rotary lidar undistortion using 9-axis imu
[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
Official repository for CVPR 2021 paper "Differentiable Diffusion for Dense Depth Estimation from Multi-view Images"
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Lightweight Armoury Crate alternative for Asus laptops with nearly the same functionality. Works with ROG Zephyrus, Flow, TUF, Strix, Scar, ProArt, Vivobook, Zenbook, Expertbook, ROG Ally, and many…
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
[ICLR 2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning
C++ implementation of the Python Numpy library
🇨🇳 GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
Collection of AWESOME vision-language models for vision tasks
The FORM project for symbolic manipulation of very big expressions
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Images to inference with no labeling (use foundation models to train supervised models).
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
[ICCV 2023] SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving
TensorRT deployment for CenterPoint Lidar Detection Model.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Pytorch and ncnn implementation of PPYOLOE、YOLOX、PPYOLO、PPYOLOv2、PicoDet and so on.
pengcheng001 / BEVFormer
Forked from fundamentalvision/BEVFormerThis is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.