-
Max Planck Institute for Informatics
- Germany
- shishaoshuai.com
Stars
[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life
[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
[CVPR 2024] A world model for autonomous driving.
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
An Efficient, Flexible, and General deep learning framework that retains minimal.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Virtual Sparse Convolution for Multimodal 3D Object Detection
[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"
(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
(NeurlPS 2022) Towards Efficient 3D Object Detection with Knowledge Distillation
[CVPR2022] This is the official code of "RBGNet: Ray-based Grouping for 3D Object Detection".
[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Cracking encrypted wechat message history from android
Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection
One Million Scenes for Autonomous Driving
A project demonstrating how to use CUDA-PointPillars to deal with cloud points data from lidar.
(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds