- Quebec City, Canada
- http://themlbook.com
Stars
High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts stru…
Benchmarks of approximate nearest neighbor libraries in Python
⚡ TabPFN: Foundation Model for Tabular Data ⚡
A Simplified Pytorch Version of the Dreamer Algorithm
A completely customizable framework for building rich text editors. (Currently in beta.)
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
Collect posts from the Bluesky firehose and save them to a JSONL file
🔥 The open-source no-code platform for web scraping, crawling, search and AI data extraction • Turn websites into structured APIs in minutes 🔥
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
🔥 The Web Data API for AI - Power AI agents with clean web data
real time face swap and one-click video deepfake with only a single image
A Bulletproof Way to Generate Structured JSON from Language Models
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
An implementation of Shazam's song recognition algorithm.
A vector search SQLite extension that runs anywhere!
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Apps Script samples for Google Workspace products.
Use Large Language Models (LLM) in Google Sheets
🔥Highlighting the top ML papers every week.
Data validation using Python type hints
Query Engine for AI Analytics: Build self-reasoning agents across all your live data
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)