Lists (3)
Sort Name ascending (A-Z)
Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
Open Source framework for voice and multimodal conversational AI
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
The open source platform for AI-native application development.
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Automatic Video Generation from Scientific Papers
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically generating tailored teams of AI agents based on your project requi…
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Multi-backend whisper app. Blazing fast. Mac-arm optimized. Easy install. Input a local file or url and this service will transcribe it using Whisper AI. Completely private and Free 🤯🤯🤯
Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and version control agents across compatible frameworks.
This repository will have different projects using AutoGen and Tutorials
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
Sharing early versions of Ada, a personal AI Assistant built on OpenAIs Realtime API
Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
[EMNLP'25 findings] This is the official repo for the paper, HiRAG: Retrieval-Augmented Generation with Hierarchical Knowledge.
What if we could pack single purpose, powerful AI Agents into a single python file?
Repo of skills for autogen studio using model context protocol (mcp)