Lists (26)
Sort Name ascending (A-Z)
2D Tracking
3D lane detection
3D Object Tracking
3D Occupancy Prediction
3D Reconstruction
3D Semantic Segmentation
AI-Assistant
AI Software Engineer
Autoware
BEV
Camera-Based 3D Detection
Deployment
Dev
End-to-end
Fusion
Lidar-Based 3D Detection
LLM
Motion Forcasting
Occupancy and Flow Prediction
Planning
Portrait
Semantic Segmentaion
SLAM
Video Generation
VLM
VLM Autonomous Driving
Stars
real time face swap and one-click video deepfake with only a single image
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
We write your reusable computer vision tools. 💜
Official inference framework for 1-bit LLMs
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
A generative world for general-purpose robotics & embodied AI learning.
"CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/
Fast and memory-efficient exact attention
Devika is the first open-source implementation of an Agentic Software Engineer. Initially started as an open-source alternative to Devin.
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Resume builder for academics and engineers
Ongoing research training transformer models at scale
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.
Official implementation of AnimateDiff.
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
"ClawTeam: Agent Swarm Intelligence" (One Command → Full Automation)
MAGI-1: Autoregressive Video Generation at Scale
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.
Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0 license. You combine them with any detection model you alre…
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation