Lists (29)
Sort Name ascending (A-Z)
alg
architecture
audio
backend
conditioning
diffusion
disentangle
flow
frontend
infra
language
llm
lora
manifold
ml_materials
mlops
MoE
monitoring_and_operation
music
optimization
personalization
quantization
reinforcement_learning
Scala
small_model
style_transfer
video
vision
web
Starred repositories
Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
A type-safe HTTP server framework for Scala.js that combines Express-style ergonomics with Scala's powerful type system
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
Marketing skills for Claude Code and AI agents. CRO, copywriting, SEO, analytics, and growth engineering.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Recursive Self-Aggregation evals on ARC-AGI
Pytorch implementation of MeanFlow on ImageNet and CIFAR10
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR 2025 Oral)
OLMoE: Open Mixture-of-Experts Language Models
ICLR'25 Oral: Improving Probabilistic Diffusion Models With Optimal Covariance Matching
Easy Data Preparation with latest LLMs-based Operators and Pipelines.
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
[ICLR 2026] rCM: SOTA JVP-Based Diffusion Distillation & Few-Step Video Generation & Scaling Up sCM/MeanFlow & Real-Time Autoregressive Video Diffusion
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
An asynchronous programming facility for Scala
A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing
A lightweight and high-performance reverse proxy for NAT traversal, written in Rust. An alternative to frp and ngrok.
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
NGINX and NGINX Plus Ingress Controllers for Kubernetes
Ingress NGINX Controller for Kubernetes