Stars
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
Get your documents ready for gen AI
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
📖 NEW BOOK (/bin/zsh.99 launch): https://amzn.to/4cvxqSw — This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems.
LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impl…
Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
This repository implements computer vision for real-time chessboard detection and piece recognition. Using OpenCV and Numpy, the system processes video feeds to track physical chess games, detect b…
Implementation of paper - TrackNetV3: Enhancing ShuttleCock Tracking with Augmentations and Trajectory Rectification
Open-source Monocular Python HawkEye for Tennis
UnLimited TRAnsfers for Efficient Multimodal Journey Planning
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
[ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.
A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Mod…
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning
A python library for capturing the UDP telemetry data from the F1 2018 racing game