-
Amazon Web Services (AWS)
- Tokyo
- @_Bassari
- in/hsotaro
- https://bering.hatenadiary.com/
Lists (1)
Sort Name ascending (A-Z)
Stars
Collection of code examples for Amazon Managed Service for Apache Flink
This tool kit provides a quickstart for working with OpenSearch and ML models, especially LLMs for vector embeddings to power sementic and semantic sparse search.
Apache Druid: a high performance real-time analytics database.
Olympia is a storage-only open catalog format for big data analytics, ML & AI.
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…
⚡ Fastest SQL ETL pipeline in a single C++ binary, built for stream processing, observability, analytics and AI/ML
Spark Accelerator framework ; It enables secondary indices to remote data stores.
DuckDB-powered data lake analytics from Postgres
Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Repository for the book "Crafting Interpreters"
Specification for storing geospatial vector data (point, line, polygon) in Parquet
Fancy stream processing made operationally mundane
Awaitility is a small Java DSL for synchronizing asynchronous operations