-
Databricks, Inc.
- San Francisco
Stars
An open protocol for secure data sharing
A native Rust library for Delta Lake, with bindings into Python
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
Apache Spark - A unified analytics engine for large-scale data processing
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
Guice (pronounced 'juice') is a lightweight dependency injection framework for Java 11 and above, brought to you by Google.
Protocol Buffers - Google's data interchange format
The official home of the Presto distributed SQL query engine for big data
The Official Couchbase Spark Connector
Log analyser / visualiser for Java HotSpot JIT compiler. Inspect inlining decisions, hot methods, bytecode, and assembly. View results in the JavaFX user interface.
Interactive and Reactive Data Science using Scala and Spark.
A curated list of awesome big data frameworks, ressources and other awesomeness.
RxScala – Reactive Extensions for Scala – a library for composing asynchronous and event-based programs using observable sequences
Reactive Streams Specification for the JVM
An advanced, composable, functional reactive model-view-viewmodel framework for all .NET platforms that is inspired by functional reactive programming. ReactiveUI allows you to abstract mutable sta…
Streaming MapReduce with Scalding and Storm
A chrome extension for editing custom request(GET or POST) to web server.
A fault tolerant, protocol-agnostic RPC system