Starred repositories
Teams-first Multi-agent orchestration for Claude Code
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A python package to analyze and compare voices with deep learning
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
[ICLR 2026] LongLive: Real-time Interactive Long Video Generation
Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"
Utility for generating 3D Gaussian head avatars directly from monocular 2D video streams
[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,β¦
Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021
This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".
This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolution"
This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"
This is LipNet network where model learn from Lip movement and predict text without voice.
π Make websites accessible for AI agents. Automate tasks online with ease.
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
OmniTransfer implementation for LTX-2 (work in progress)
Implicit Motion Function - (unofficial) Microsoft recreation
wip - running some training with overfitting - https://wandb.ai/snoozie/vasa-overfitting
Slimmed, cleaned and fine-tuned oh-my-opencode fork, consumes much less tokens
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM β deploy anywhere, swap anything π¦
π« Toolkit to help you get started with Spec-Driven Development
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
π¦ The Extras bucket for Scoop.
omo; the best agent harness - previously oh-my-opencode