Lists (3)
Sort Name ascending (A-Z)
Stars
Python tool for converting files and office documents to Markdown.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Get your documents ready for gen AI
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Turns Data and AI algorithms into production-ready web applications in no time.
Janus-Series: Unified Multimodal Understanding and Generation Models
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
The Open edX LMS & Studio, powering education sites around the world!
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
a script to run docker-compose.yml using podman
Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in…
An OpenCV based document scanner
Anomaly detection using LoOP: Local Outlier Probabilities, a local density based outlier detection method providing an outlier score in the range of [0,1].
Pushkin is a free open source tool for sending push notifications
Large Language Model (LLM) Inference API and Chatbot
Measure size of an object (height and width) using a reference object
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced i…
Extracting and Exploring Blockchain Data from Ethereum
This project, pdf2md, transforms academic paper PDF files into digestible text files. By analyzing the layout of the PDF file, the application restructures paragraphs and translates desired content…
Addons like multipages for streamlit webapp
Convert PDF to Markdown via OpenAI multi-modal text/vision model.