Skip to content
View aimicm's full-sized avatar

Block or report aimicm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 4,203 515 Updated Mar 23, 2026

[CVPR2026] SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving

Python 67 4 Updated Apr 15, 2026

[AAAI 2024] Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection

Python 10 1 Updated Jan 24, 2025

A Survey on Multimodal Retrieval-Augmented Generation

524 28 Updated Feb 20, 2026
Python 52 2 Updated May 6, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 13,996 1,439 Updated Oct 28, 2025

Source code of PivotNet (ICCV2023, PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction)

Python 125 11 Updated Mar 20, 2024

[ICLR 2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning

Python 214 18 Updated Jan 22, 2026

[ICRA2023] CoAlign: Robust Collaborative 3D Object Detection in Presence of Pose Errors

Python 185 12 Updated Jul 23, 2024

[Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective

612 36 Updated Jun 19, 2026

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,854 82 Updated Jul 27, 2025

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,651 1,120 Updated Sep 14, 2024

Efficient Multimodal Large Language Models: A Survey

385 21 Updated Apr 29, 2025

[ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric

Python 429 32 Updated Aug 15, 2024

Code of "OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments".

Python 362 19 Updated Jun 5, 2025

[ICLR2024] HEAL: An Extensible Framework for Open Heterogeneous Collaborative Perception ➡️ All You Need for Multi-Modality Collaborative Perception!

Python 232 21 Updated Jan 1, 2025

[ICLR2024] HEAL: An Extensible Framework for Open Heterogeneous Collaborative Perception ➡️ All You Need for Multi-Modality Collaborative Perception!

Python 1 Updated Mar 14, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,711 194 Updated Apr 20, 2024

Masked World Models for Visual Control

Python 138 10 Updated Jun 11, 2023

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

80,254 9,352 Updated Feb 5, 2026

(IEEE TIV) A Comprehensive Framework for 3D Occupancy Estimation in Autonomous Driving

Python 221 13 Updated Dec 5, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,942 550 Updated Mar 31, 2026

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 24,556 2,810 Updated May 25, 2026

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,223 573 Updated Aug 22, 2025

This repository is for CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection.

Python 27 6 Updated Oct 11, 2023

The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data

Python 65 10 Updated Dec 1, 2023

[CoRL 2023] Robot Parkour Learning

Python 1,072 148 Updated Oct 26, 2025

【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。

2,165 207 Updated Mar 30, 2024

how to optimize some algorithm in cuda.

Cuda 3,092 279 Updated Jun 20, 2026

[NeurIPS Workshop 2019] Official code of the paper "Probabilistic 3D Multi-Object Tracking for Autonomous Driving." First Place of the First NuScenes Tracking Challenge in the AI Driving Olympics W…

Python 399 79 Updated Jan 29, 2024
Next