Stars
Real-time face swap for PC streaming or video calls
Open-source RAW photo processing and digital asset management software.
AI agents running research on single-GPU nanochat training automatically
NVIDIA Isaac Sim™ is an open-source application on NVIDIA Omniverse for developing, simulating, and testing AI-driven robots in realistic virtual environments.
The official implementation of ICCV'25 paper "FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution"
video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions, which is developed by the Department of Electronic Engineering at Tsin…
[ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark
Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
ABC: A Big CAD Model Dataset For Geometric Deep Learning
🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!
CADAM is the open source text-to-CAD web application
Pure TypeScript media toolkit for reading, writing, and converting video and audio files, directly in the browser.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Universal drop-in replacement SDK for Base44 projects to migrate to self-hosted Supabase with zero code changes
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
AI Q&A Search Engine ➡️ 基于LangChain和SearXNG打造的开源AI搜索引擎
An AI-powered search engine with a generative UI
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
An open source `vercel` like deployment platform for Comfy UI
Custom prompt styler node for SDXL in ComfyUI
A powerful set of mask-related nodes for ComfyUI
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
An extensive node suite for ComfyUI with over 210 new nodes