Stars
Distributed DuckDB - dual execution and differential storage
A fast, helpful, and open-source document parser
PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
Visualize and share your data. All in SQL. Powered by DuckDB.
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, GitHub Copilot CLI, OpenClaw, Factory Droid, Trae, Google Antigravity). Turn any folder of code, docs, papers, images, o…
AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.
Entire CLI hooks into your Git workflow to capture AI agent sessions as you work. Sessions are indexed alongside commits, creating a searchable record of how code was written in your repo.
Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
MCP Toolbox for Databases is an open source MCP server for databases.
Join Discord: https://discord.gg/5TUQKqFWd / claw-code Rust port parity work - it is temporary work while claw-code repo is doing migration
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
Visual testing tool for MCP servers
A extension for DuckDB, which captures lineage events for executed queries
csnayak / Genesis
Forked from xeeva/GenesisBootstrap fully-equipped Claude Code projects in under two minutes. Agents, skills, hooks, memory, MCP configs — all scaffolded from a single conversation.
Bootstrap fully-equipped Claude Code projects in under two minutes. Agents, skills, hooks, memory, MCP configs — all scaffolded from a single conversation.
LLM Finetuning with peft
LinkLite - Scalable URL Shortening Service (FastAPI, PostgreSQL, Redis)
A framework-agnostic, git-native standard for defining AI agents
High-performance, allocation-free text scanning and Arrow ingestion engine for Go.
Create normal and sensitive variables across multiple workspaces using a simple YAML configuration (yes—YAML, who doesn’t like it?) 😃
Demystify RAG by building it from scratch. Local LLMs, no black boxes - real understanding of embeddings, vector search, retrieval, and context-augmented generation.
Open-source text-to-SQL and text-to-chart GenBI agent with a semantic layer. Ask your database questions in natural language — get accurate SQL, charts, and BI insights. Supports 12+ data sources (…
Apache Iggy: Hyper-Efficient Message Streaming at Laser Speed