Stars
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
World's most advanced database DevSecOps solution for Developer, Security, DBA and Platform Engineering teams. The GitHub/GitLab for database DevSecOps.
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Apache Fluss is a streaming storage built for real-time analytics.
【2025最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset
This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source tools
AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
The Lineage Analysis system for FlinkSQL supports advanced syntax such as Watermark, UDTF, CEP, Windowing TVFs, and CTAS.
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI
一站式云原生实时流数据平台,通过0侵入、插件化构建企业级Kafka服务,极大降低操作、存储和管理实时流数据门槛
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
🌍 针对小白的算法训练 | 包括四部分:①.大厂面经 ②.力扣图解 ③.千本开源电子书 ④.百张技术思维导图(项目花了上百小时,希望可以点 star 支持,🌹感谢~)推荐免费ChatGPT使用网站
Essential Spark extensions and helper methods ✨😲
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去…
A demo combining Kafka Streams and Drools to create a lightweight real-time rules engine.
Complex Event Processing on top of Kafka Streams
A Java utility is designed to FLATTEN nested JSON objects and even more to UNFLATTEN them back