Lists (6)
Sort Name ascending (A-Z)
Starred repositories
🦜🔗 The platform for reliable agents.
Robust Speech Recognition via Large-Scale Weak Supervision
Python tool for converting files and office documents to Markdown.
real time face swap and one-click video deepfake with only a single image
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Hunt down social media accounts by username across social networks
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The world's simplest facial recognition api for Python and the command line
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
A generative speech model for daily dialogue.
A community-maintained Python framework for creating mathematical animations.
Easily train a good VC model with voice data <= 10 mins!
Open-Sora: Democratizing Efficient Video Production for All
Xiaomi Home Integration for Home Assistant
Faster Whisper transcription with CTranslate2
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Lets make video diffusion practical!
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Wan: Open and Advanced Large-Scale Video Generative Models
Train your AI self, amplify you, bridge the world
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Python bindings for FFmpeg - with complex filtering support
Self-hosted YouTube downloader (web UI for youtube-dl / yt-dlp)