Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
real time face swap and one-click video deepfake with only a single image
💫 Toolkit to help you get started with Spec-Driven Development
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Get your documents ready for gen AI
Interact with your documents using the power of GPT, 100% privately, no data leaks
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
Build, run, manage agentic software at scale.
Official inference framework for 1-bit LLMs
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
DSPy: The framework for programming—not prompting—language models
Installable GitHub library of 1,370+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill c…
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Generative Models by Stability AI
An autonomous agent that conducts deep research on any data using any LLM providers
An open-source RAG-based tool for chatting with your documents.
CLI tool for configuring and monitoring Claude Code
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Build and run agents you can see, understand and trust.