Stars
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Perform data science on data that remains in someone else's server
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Build effective agents using Model Context Protocol and simple workflow patterns
🚴 Call stack profiler for Python. Shows you why your code is slow!
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
Ready-to-use and customizable users management for FastAPI
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
A Unified Toolkit for Deep Learning Based Document Image Analysis
Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
Superduper: End-to-end framework for building custom AI applications and agents.
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
A fast inference library for running LLMs locally on modern consumer-class GPUs