Topic Modelling for Humans
-
Updated
Nov 1, 2025 - Python
Topic Modelling for Humans
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Apache Lucene and Solr open-source search software
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Apache Lucene open-source search software
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
Retrieval and Retrieval-augmented LLMs
Apache Solr open-source search software
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
telegram group scraper tool. fetch all information about group members
Anserini is a Lucene toolkit for reproducible information retrieval research
MTEB: Massive Text Embedding Benchmark
Track any ip address with IP-Tracer. IP-Tracer is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracer.
Learning to Rank in TensorFlow
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
Fetches system/theme information in terminal for Linux desktop screenshots.
Deep neural network to extract intelligent information from invoice documents.
Add a description, image, and links to the information-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the information-retrieval topic, visit your repo's landing page and select "manage topics."