Stars
Scalable and efficient data transformation framework - backwards compatible with dbt.
Event streaming platform for agentic AI. Continuously ingest, transform, and serve event streams in real time, at scale.
Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent.
Type less, code more: Cody is an AI code assistant that uses advanced search and codebase context to help you write and fix code.
DuckLake is an integrated data lake and catalog format
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
DataGrip IDE — JetBrains SQL environment for PostgreSQL, MySQL, and Oracle databases with smart code completion and version control.
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
Experimenting with scripting an architecture for testing purposes
Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.
A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.
Skybridge is a full-stack TypeScript framework for MCP Apps and ChatGPT Apps. Type-safe. React-powered. Platform-agnostic.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Composable AI Reference Architectures (CAIRA)
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
A deployment of a secure, extensible and integrated environment for running AI Foundry workloads in Production. It simplifies the process of including essential Azure services necessary to run miss…
Get your documents ready for gen AI
Local, open-source AI app builder for power users ✨ v0 / Lovable / Replit / Bolt alternative 🌟 Star if you like it!
Sparsify transformers with SAEs and transcoders
Data engineering with dbt, published by Packt
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
A Python to Typescript Interface Generator
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Virtual whiteboard for sketching hand-drawn like diagrams
Free, simple, and intuitive online database diagram editor and SQL generator.