Stars
SGLang is a high-performance serving framework for large language models and multimodal models.
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
Tensors and Dynamic neural networks in Python with strong GPU acceleration
UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
Supercharge Your LLM with the Fastest KV Cache Layer
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
A high-throughput and memory-efficient inference and serving engine for LLMs
verl: Volcano Engine Reinforcement Learning for LLMs
FlagGems is an operator library for large language models implemented in the Triton Language.
DLRover: An Automatic Distributed Deep Learning System
FlagScale is a large model toolkit based on open-sourced projects.
Crane is a FinOps Platform for Cloud Resource Analytics and Economics in Kubernetes clusters. The goal is not only to help users to manage cloud cost easier but also ensure the quality of applicati…
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Tests with different WebRTC stacks to understand them
优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs
Real time interactive streaming digital human
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code