Skip to content
View qiqika's full-sized avatar

Block or report qiqika

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions

Python 51 6 Updated May 26, 2023

[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator

Python 1,312 175 Updated Jul 16, 2021

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 22,438 3,273 Updated Oct 17, 2025

A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023

Python 201 22 Updated Apr 16, 2023

Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024

Python 107 16 Updated Sep 27, 2025

[ICCV'23 Oral] Unmasking Anomalies in Road-Scene Segmentation

Python 61 10 Updated Apr 28, 2024

Official code for RbA: Segmenting Unknown Regions Rejected by All (ICCV 2023)

Python 72 11 Updated Jan 10, 2025

[GCPR 2023] UGainS: Uncertainty Guided Anomaly Instance Segmentation

Python 16 Updated Jul 31, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 6,425 608 Updated Feb 26, 2025

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

311 13 Updated Mar 14, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,415 210 Updated Mar 5, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,676 2,888 Updated Sep 2, 2024

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python 650 29 Updated Dec 23, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,900 1,129 Updated Jun 18, 2026

A curated list of awesome LLM/VLM/VLA/World Model for Autonomous Driving(LLM4AD) resources (continually updated)

1,853 108 Updated Jun 22, 2026

WEDGE: A multi-weather autonomous driving dataset built from generative vision-language models

JavaScript 37 3 Updated Mar 22, 2024

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Python 817 96 Updated Jun 26, 2024

[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Python 1,061 157 Updated Oct 11, 2023

[IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation

Python 166 12 Updated Apr 6, 2025

RAPiD: Rotation-Aware People Detection in Overhead Fisheye Images (CVPR 2020 Workshops)

Jupyter Notebook 223 62 Updated Nov 26, 2023

[ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient

Python 32 4 Updated Dec 8, 2023

[WACV'24] ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object Detection

Python 22 4 Updated Feb 4, 2024

Code base of the BEVDet series .

Python 1,790 306 Updated Jul 4, 2024

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Python 602 59 Updated Oct 23, 2024
Python 1,835 61 Updated Jun 28, 2024

[CVPR 2024] A world model for autonomous driving.

Python 435 15 Updated Dec 7, 2023

[ECCV 2024] 3D World Model for Autonomous Driving

Python 564 41 Updated Apr 12, 2024

[CVPR2024] NeuRAD: Neural Rendering for Autonomous Driving

Python 483 56 Updated Oct 27, 2025
Next