- New York
Highlights
- Pro
Stars
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
"RAG-Anything: All-in-One RAG Framework"
"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai Tech Report Link: https://arxiv.org/abs/2512.10971
"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
Post-training with Tinker
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
Research Agent service for the Agentic Workflow course
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…
This workshop teaches systematic approaches to evaluating Generative AI workloads for production use. You'll learn to build evaluation frameworks that go beyond basic metrics to ensure reliable mod…
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Open-Sora: Democratizing Efficient Video Production for All
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…
[AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
A Datacenter Scale Distributed Inference Serving Framework
Open-source search and retrieval database for AI applications.