Starred repositories
Easily train a good VC model with voice data <= 10 mins!
💫 Industrial-strength Natural Language Processing (NLP) in Python
Real-time face swap for PC streaming or video calls
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monito…
Open-Sora: Democratizing Efficient Video Production for All
State-of-the-art 2D and 3D Face Analysis Project
An open-source RAG-based tool for chatting with your documents.
Code for the paper "Language Models are Unsupervised Multitask Learners"
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Universal LLM Deployment Engine with ML Compilation
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Intelligent automation and multi-agent orchestration for Claude Code
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
📷 Instagram Bot - Tool for automated Instagram interactions
Command-line program to download image galleries and collections from several image hosting sites
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
match command-line arguments to their help text
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Android in docker solution with noVNC supported and video recording
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs