- Hangzhou China
- http://wuchong.me
- @jarkwu
- in/jarkwu
Lists (1)
Sort Name ascending (A-Z)
Stars
email for agents. Built for AI agents that need to send, receive, and understand emails programmatically
Video processing (webcam) in real time using Kafka and Spark.
Logstash output plugin for pulsar
Apache Fluss is a streaming storage built for real-time analytics.
Playground for Flink Table Store with use cases and performance features
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
A Data Streaming Library for Efficient Neural Network Training
.asf.yaml documentation and schema
A lightweight data processing framework built on DuckDB and 3FS.
A cloud native embedded storage engine built on object storage.
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
A modern, lambda-friendly, 120 character Java formatter.
AutoMQ is a diskless Kafka® on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
Restate is the platform for building resilient applications that tolerate all infrastructure faults w/o the need for a PhD.
📚 Tech blogs & talks by companies that run Apache Flink in production
Facebook's branch of the Oracle MySQL database. This includes MyRocks.
A RocksDB compliant high performance scalable embedded key-value store
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
A Cloud Native Batch System (Project under CNCF)