Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Command-line program to download videos from YouTube.com and other video sites
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
real time face swap and one-click video deepfake with only a single image
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Hunt down social media accounts by username across social networks
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
No fortress, purely open ground. OpenManus is Coming.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
A generative speech model for daily dialogue.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Convert PDF to markdown + JSON quickly with high accuracy
Generative Models by Stability AI
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Official inference repo for FLUX.1 models
TradingAgents: Multi-Agents LLM Financial Trading Framework
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Intelligent automation and multi-agent orchestration for Claude Code
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Janus-Series: Unified Multimodal Understanding and Generation Models
Tongyi Deep Research, the Leading Open-source Deep Research Agent