qiqika

Follow

yuanhuizhen qiqika

Follow

6 followers · 42 following

cidi
china

Starred repositories

alinlab / s-clip

S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions

Python 51 6 Updated May 26, 2023

d-li14 / involution

[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator

Python 1,312 175 Updated Jul 16, 2021

graphdeco-inria / gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 22,438 3,273 Updated Oct 17, 2025

tgxs002 / CORA

A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023

Python 201 22 Updated Apr 16, 2023

lucazanella / AnomalyCLIP

Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024

Python 107 16 Updated Sep 27, 2025

shyam671 / Mask2Anomaly-Unmasking-Anomalies-in-Road-Scene-Segmentation

[ICCV'23 Oral] Unmasking Anomalies in Road-Scene Segmentation

Python 61 10 Updated Apr 28, 2024

NazirNayal8 / RbA

Official code for RbA: Segmenting Unknown Regions Rejected by All (ICCV 2023)

Python 72 11 Updated Jan 10, 2025

kumuji / ugains

[GCPR 2023] UGainS: Uncertainty Guided Anomaly Instance Segmentation

Python 16 Updated Jul 31, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 6,425 608 Updated Feb 26, 2025

IrohXu / Awesome-Multimodal-LLM-Autonomous-Driving

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

311 13 Updated Mar 14, 2024

EvolvingLMMs-Lab / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,415 210 Updated Mar 5, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,676 2,888 Updated Sep 2, 2024

VITA-MLLM / Woodpecker

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python 650 29 Updated Dec 23, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,900 1,129 Updated Jun 18, 2026

Thinklab-SJTU / Awesome-LLM4AD

A curated list of awesome LLM/VLM/VLA/World Model for Autonomous Driving(LLM4AD) resources (continually updated)

1,853 108 Updated Jun 22, 2026

Infernolia / WEDGE

WEDGE: A multi-weather autonomous driving dataset built from generative vision-language models

JavaScript 37 3 Updated Mar 22, 2024

arekavandi / Transformer-SOD

172 24 Updated Jul 29, 2025

exiawsh / StreamPETR

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Python 817 96 Updated Jun 26, 2024

megvii-research / PETR

[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Python 1,061 157 Updated Oct 11, 2023

ChenhongyiYang / WidthFormer

[IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation

Python 166 12 Updated Apr 6, 2025

duanzhiihao / RAPiD

RAPiD: Rotation-Aware People Detection in Overhead Fisheye Images (CVPR 2020 Workshops)

Jupyter Notebook 223 62 Updated Nov 26, 2023

Senwang98 / MonoSKD

[ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient

Python 32 4 Updated Dec 8, 2023

arcanienz / ODM3D

[WACV'24] ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object Detection

Python 22 4 Updated Feb 4, 2024

HuangJunJie2017 / BEVDet

Code base of the BEVDet series .

Python 1,790 306 Updated Jul 4, 2024

filaPro / oneformer3d

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Python 602 59 Updated Oct 23, 2024

fanq15 / Stable-SAM

73 1 Updated Dec 6, 2023

ytongbai / LVM

Python 1,835 61 Updated Jun 28, 2024

BraveGroup / Drive-WM

[CVPR 2024] A world model for autonomous driving.

Python 435 15 Updated Dec 7, 2023

wzzheng / OccWorld

[ECCV 2024] 3D World Model for Autonomous Driving

Python 564 41 Updated Apr 12, 2024

georghess / neurad-studio

[CVPR2024] NeuRAD: Neural Rendering for Autonomous Driving

Python 483 56 Updated Oct 27, 2025

Starred topics

Awesome Lists