Stars
💫 Toolkit to help you get started with Spec-Driven Development
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing
freeCodeCamp.org's open-source codebase and curriculum. Learn math, programming, and computer science for free.
Kafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, , 20+ connectors
Confluent Schema Registry for Kafka
Efficient reliable UDP unicast, UDP multicast, and IPC message transport
😱 从源码层面,剖析挖掘互联网行业主流技术的底层实现原理,为广大开发者 “提升技术深度” 提供便利。目前开放 Spring 全家桶,Mybatis、Netty、Dubbo 框架,及 Redis、Tomcat 中间件等
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualiz…
Using trino to sync data from sharding jdbc table
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Apache Doris is an easy-to-use, high performance and unified analytics database.
ChatGPT 中文指南🔥,ChatGPT 中文调教指南,指令指南,应用开发指南,精选资源清单,更好的使用 chatGPT 让你的生产力 up up up! 🚀
An elegant lightweight and efficient SQL Query Builder with fluid interface SQL syntax supporting bindings and complicated query generation.
Flink CDC is a streaming data integration tool
SQL optimizer and rewriter(assisted SQL tuning). - SQL 优化器和重写器(辅助 SQL 调优)。
A Spark plugin for reading and writing Excel files
Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display
Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.
Pentaho Data Integration ( ETL ) a.k.a Kettle