Stars
🔊 High-precision web player for multi-device audio playback and spatial audio.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Pyvigate: A Python framework that combines headless browsing with LLMs that assists you in your data solutions, product tours, building RAG applications, web automation, functional testing, and man…
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
Paisa – Personal Finance Manager. https://paisa.fyi demo: https://demo.paisa.fyi
TypeChat is a library that makes it easy to build natural language interfaces using types.
Chat language model that can use tools and interpret the results
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
An AMQP 0-9-1 Go client maintained by the RabbitMQ team. Originally by @streadway: `streadway/amqp`
A Bulletproof Way to Generate Structured JSON from Language Models
Structured and typehinted GPT responses in Python
Django-ORM-Standalone Template - Use the power of Django's database functionality in regular python scripts.
One Dark theme for JetBrains.
This project uses basic concepts of data analysis: cleaning, exploring and visualization to describe simple data from Google Fit Application.
This tool can converts JSON/SQL to a Go type definition.
Awesome illustrated guides or children's books on technical topics.
A homebrew tap for qemu with support for 3d accelerated guests
Index your Gmail Inbox with Elasticsearch
Utilities for archiving JPEGs for long term storage.
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Electron desktop application replacement for G14ControlR3.
An ongoing list of pandas quirks
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.