Lists (13)
Sort Name ascending (A-Z)
Starred repositories
An opinionated list of awesome Python frameworks, libraries, software and resources.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to clo…
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A cryptocurrency trading API with more than 100 exchanges in JavaScript / TypeScript / Python / C# / PHP / Go
A generative speech model for daily dialogue.
If you live in the terminal, kitty is made for you! Cross-platform, fast, feature-rich, GPU based.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A Conversational Speech Generation Model
✨ Agentic IM ChatBot Infrastructure — 聊天智能体基础设施 ✨ 多消息平台集成(QQ / Telegram / 企微 / 飞书 / 钉钉等),强大易用的插件系统,支持 OpenAI / Gemini / Anthropic / Dify / Coze / 阿里云百炼 / 知识库 / Agent 智能体
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的mcp框架。
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Kronos: A Foundation Model for the Language of Financial Markets
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
ValueCell is a community-driven, multi-agent platform for financial applications.
Python library for processing Chinese text
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation