- Madagascar
- https://marolai.github.io/
- @Massa_Be
Stars
A feature-rich command-line audio/video downloader
🦜🔗 The platform for reliable agents.
Robust Speech Recognition via Large-Scale Weak Supervision
scikit-learn: machine learning in Python
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
A high-throughput and memory-efficient inference and serving engine for LLMs
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A collection of learning resources for curious software engineers
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Convert PDF to markdown + JSON quickly with high accuracy
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Data Apps & Dashboards for Python. No JavaScript Required.
Build resilient language agents as graphs.
SGLang is a fast serving framework for large language models and vision language models.
Faster Whisper transcription with CTranslate2
OCR, layout analysis, reading order, table recognition in 90+ languages
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Convert Machine Learning Code Between Frameworks
PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.
Python APIs for web automation, testing, and bypassing bot-detection with ease.