Lists (1)
Sort Name ascending (A-Z)
Stars
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
React app for inspecting, building and debugging with the Realtime API
🧠 Motorhead is a memory and information retrieval server for LLMs.
Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
[arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
Production-ready platform for agentic workflow development.
From anywhere you can type, query and stream the output of any script (e.g. an LLM)
Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
🚀 PR Agent - The Original Open-Source PR Reviewer. This repo is not the Qodo free tier! Try the free version on our website.
Effortless data labeling with AI support from Segment Anything and other awesome models.
AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Question and Answer based on Anything.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Sample AI movies app built with ❍ Ion
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""