Stars
A drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
This package provides materializations for creating and managing BigQuery-specific resources.
CLI to help users refactor dbt projects by automatically fixing deprecations
Apache Doris is an easy-to-use, high performance and unified analytics database.
A lightweight Python-based tool for extracting and analyzing column lineage for dbt projects
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
A MCP (Model Context Protocol) server for interacting with dbt.
LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answers questions about your knowledge base.
A games launcher for GOG, Amazon and Epic Games for Linux, Windows and macOS.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Compatibility tool for Steam Play based on Wine and additional components
open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for designing complex, interactive environments where agents can act,…
A CLI to convert SQL models across database dialects in your dbt projects.
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Microservices Starter Project - Spring Boot, Java 11, Log4j2, TestContainer, JUnit5, Code Coverage, Checkstyle, Kotlin DSL, Postgres, Vault Secrets, Gatling Load Testing, Sonar
OLake - Fastest Databases, Kafka & S3 Replication to Apache Iceberg with Table optimization (Called OLake Fusion). ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supported …
Android application for running Windows applications with Wine and Box86/Box64
A framework for managing and maintaining multi-language pre-commit hooks.
MessagePack serialization library for Python derived from orjson, written in Rust using PyO3
Distributed Task Queue (development branch)
binary releases of VS Code without MS branding/telemetry/licensing
Online Karaoke game with pitch detection in your browser
Nginx proxy server in a Docker container to Authenticate & Proxy requests to Ollama from Public Internet via Cloudflare Tunnel