Stars
real time face swap and one-click video deepfake with only a single image
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
The world's simplest facial recognition api for Python and the command line
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
A TTS model capable of generating ultra-realistic dialogue in one pass.
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
这是一份入门AI/LLM大模型的逐步指南,包含教程和演示代码,带你从API走进本地大模型部署和微调,代码文件会提供Kaggle或Colab在线版本,即便没有显卡也可以进行学习。项目中还开设了一个小型的代码游乐场🎡,你可以尝试在里面实验一些有意思的AI脚本。同时,包含李宏毅 (HUNG-YI LEE)2024生成式人工智能导论课程的完整中文镜像作业。
Fine-Tune popular face-recognition architectures with LFW and QMUL-Survface datasets for evaluating Low Resolution Face Recognition