Open source libraries and APIs to build custom preprocessing pipelines
Parse files for optimal RAG
Central interface to connect your LLM's with external data
Framework that is dedicated to making neural data processing
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Extract schema, statistics and entities from datasets
A modular graph-based Retrieval-Augmented Generation (RAG) system
AI-data warehouse to enrich, transform and analyze unstructured data
Airweave lets agents search any app
Training data (data labeling, annotation, workflow) for all data types
Superlinked is a Python framework for AI Engineers
A Python library for extracting structured information
Deterministic LLMs Outputs for AI Applications and AI Agents
Synthetic data generators for structured and unstructured text
A web privacy measurement framework
The data structure for multimodal data
Open-source choice to scale, assess and maintain natural language data
Lightweight library for scraping web-sites with LLMs
Obsei is a low code AI powered automation tool
Python module for parsing semi-structured text into python tables
Dealing with all unstructured data, such as reverse image search
Making Enterprise Data Intelligent and Responsive for AI
Integrating LLMs into structured NLP pipelines
An open-source toolkit for monitoring Language Learning Models (LLMs)
Extensible AGI Framework