Stars
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
DSPy: The framework for programming—not prompting—language models
We have made you a wrapper you can't refuse
🤗 smolagents: a barebones library for agents that think in code.
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
🚀 The fast, Pythonic way to build MCP servers and clients
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
An open-source NLP research library, built on PyTorch.
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…
Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS 🎭
Implementation of my RAG system that won all categories in Enterprise RAG Challenge 2
Another benchmark for some python frameworks
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Rule-based token, sentence segmentation for Russian language