- Hangzhou
Stars
A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others
Kafka GUI for Apache Kafka to manage topics, topics data, consumers group, schema registry, connect and more...
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Examples demonstrating usage of Spring AI & Spring AI Alibaba 📜
Do not send pull requests! Automated Git clone of various OpenJDK branches
Apache Fluss is a streaming storage built for real-time analytics.
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever…
The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.
LangChat: Java LLMs/AI Project, Supports Multi AI Providers( Gitee AI/ 智谱清言 / 阿里通义 / 百度千帆 / DeepSeek / 抖音豆包 / 零一万物 / 讯飞星火 / OpenAI / Gemini / Ollama / Azure / Claude 等大模型), Java生态下AI大模型产品解决方案,快速构建企…
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond
Tencent Kona is a no-cost, production-ready distribution of the Open Java Development Kit (OpenJDK), Long-term support(LTS) with quarterly updates. Tencent Kona serves as the default JDK internally…
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
A simple Java library for interacting with Ollama server.
Uniffle is a high performance, general purpose Remote Shuffle Service.
Benchmarks for queries over continuous data streams.
Flink Agents is an Agentic AI framework based on Apache Flink
HoloInsight is a cloud-native observability platform with a special focus on real-time log analysis and AI integration.
分享一些在工作中的大数据实战案例,包括flink、kafka、hadoop、presto等等。欢迎大家关注我的公众号【Hello大数据】,一起成长。