Stars
Design patterns implemented in Java
😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识
An powerful enhanced toolkit of MyBatis for simplify development
Apache Doris is an easy-to-use, high performance and unified analytics database.
Apache Pulsar - distributed pub-sub messaging system
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
Algorithms, 4th edition textbook code and libraries
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
🚁🚀基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
Maxwell's daemon, a mysql-to-json kafka producer
A tiny IoC container refer to Spring.
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
A AI-Driven, Distributed and high-performance monitoring system, for comprehensive monitoring and management of kafka cluster.
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
NettyRPC is high performance java rpc server base on Netty,using kryo,hessian,protostuff support message serialization.
MapReduce, Spark, Java, and Scala for Data Algorithms Book
https://blog.csdn.net/QXC1281/article/details/89070285