Skip to content
View troyjcross's full-sized avatar

Highlights

  • Pro

Block or report troyjcross

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

126 stars written in Python
Clear filter

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Python 4,672 776 Updated Nov 27, 2024

ACI.dev is the open source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function calling or a unified MCP server. The birthplace of VibeOps.

Python 4,670 453 Updated Sep 24, 2025

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Python 4,276 360 Updated Sep 1, 2025

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,754 260 Updated May 17, 2025

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,523 404 Updated Oct 16, 2025

The most accurate document search and store for building AI apps

Python 3,352 276 Updated Nov 7, 2025

Run all your local AI together in one package - Ollama, Supabase, n8n, Open WebUI, and more!

Python 3,214 1,306 Updated Oct 27, 2025

Improved file parsing for LLM’s

Python 3,127 138 Updated Nov 13, 2024

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,114 266 Updated Nov 6, 2025

A system for agentic LLM-powered data processing and ETL

Python 3,034 318 Updated Nov 7, 2025

Run Claude Code on OpenAI models

Python 2,436 332 Updated Aug 22, 2025

OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex layout handling, complicated table parsing and cross-page conte…

Python 2,371 146 Updated Aug 4, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,289 210 Updated Nov 1, 2025

Python library for Agentic Document Extraction from LandingAI

Python 2,146 228 Updated Oct 22, 2025

PipesHub is a fully extensible and explainable workplace AI platform for enterprise search and workflow automation

Python 1,960 290 Updated Nov 7, 2025

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

Python 1,795 134 Updated Aug 25, 2025

UltraRAG 2.0: Less Code, Lower Barrier, Faster Deployment! MCP-based low-code RAG framework, enabling researchers to build complex pipelines to creative innovation.

Python 1,791 153 Updated Nov 7, 2025

Efficient Retrieval Augmentation and Generation Framework

Python 1,742 164 Updated Jan 9, 2025

DeepAnalyze is the first agentic LLM for autonomous data science.

Python 1,666 207 Updated Nov 5, 2025

"MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"

Python 1,537 206 Updated Oct 16, 2025

DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.dig…

Python 1,498 211 Updated Oct 31, 2025

📄 🤖 AI for medical and scientific papers

Python 1,483 114 Updated Jul 9, 2025

MAESTRO is an AI-powered research application designed to streamline complex research tasks.

Python 1,341 123 Updated Oct 12, 2025

OpenAgents - AI Agent Networks for Open Collaboration

Python 1,117 144 Updated Nov 7, 2025

On-premises conversational RAG with configurable containers

Python 1,024 95 Updated Aug 13, 2025

Access OpenAI models programmatically through your ChatGPT subscription.

Python 988 132 Updated Oct 20, 2025

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…

Python 731 90 Updated Oct 21, 2025

Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)

Python 449 88 Updated Apr 11, 2024

Python PDF parser for scientific publications: content and figures

Python 439 68 Updated Mar 21, 2024

LLM Chain querying a scientific Zotero library, with citations

Python 437 8 Updated Aug 4, 2023