Lists (3)
Sort Name ascending (A-Z)
Stars
Lightweight implementation of Robotic World Model (RWM) and Uncertainty-Aware Robotic World Model (RWM-U)
The SfM result of Metashape equirectangular (spherical) converts into COLMAP text format
This repo contains a curative list of scene change detection(SCD), including papers, videos, codes, and related websites.
This repository represents the official implementation of the paper titled "Towards Generalizable Scene Change Detection (CVPR 2025)".
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Real-time Vision Language Model interaction via webcam - WebRTC-based web interface
An rclcpp-compatible true zero-copy IPC middleware that supports all ROS message types, including message structs already generated by rosidl.
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition
Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research.
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
Release repo for our SLAM Handbook
PINTO0309 / DEIMv2
Forked from Intellindust-AI-Lab/DEIMv2[DEIMv2] Real Time Object Detection Meets DINOv3
Tongyi Deep Research, the Leading Open-source Deep Research Agent
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Perplexica is an AI-powered answering engine.
[CVPR 24] Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
[ICLR 2026] FastVGGT: Fast Visual Geometry Transformer
GenAI Agent Framework, the Pydantic way
SigLIP-based Aesthetic Score Predictor
[CVPR 2025 Highlight] Towards Enhanced Image Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency
COLMAP - Structure-from-Motion and Multi-View Stereo
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI