-
The University of Sydney
- Sydney
- https://www.sydney.edu.au/medicine-health/about/our-people/academic-staff/troy-cross.html
- https://orcid.org/0000-0003-2902-7787
Highlights
- Pro
Starred repositories
OpenMMLab Text Detection, Recognition and Understanding Toolbox
ACI.dev is the open source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function calling or a unified MCP server. The birthplace of VibeOps.
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
The most accurate document search and store for building AI apps
Run all your local AI together in one package - Ollama, Supabase, n8n, Open WebUI, and more!
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
A system for agentic LLM-powered data processing and ETL
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex layout handling, complicated table parsing and cross-page conte…
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
Python library for Agentic Document Extraction from LandingAI
PipesHub is a fully extensible and explainable workplace AI platform for enterprise search and workflow automation
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
UltraRAG 2.0: Less Code, Lower Barrier, Faster Deployment! MCP-based low-code RAG framework, enabling researchers to build complex pipelines to creative innovation.
Efficient Retrieval Augmentation and Generation Framework
DeepAnalyze is the first agentic LLM for autonomous data science.
"MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.dig…
MAESTRO is an AI-powered research application designed to streamline complex research tasks.
OpenAgents - AI Agent Networks for Open Collaboration
On-premises conversational RAG with configurable containers
Access OpenAI models programmatically through your ChatGPT subscription.
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…
Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)
Python PDF parser for scientific publications: content and figures
LLM Chain querying a scientific Zotero library, with citations