Skip to content
View MiguelPeralvo's full-sized avatar

Highlights

  • Pro

Block or report MiguelPeralvo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
37 stars written in Java
Clear filter

Google core libraries for Java

Java 51,242 11,086 Updated Nov 6, 2025

Apache Flink

Java 25,449 13,755 Updated Nov 6, 2025

High Performance Inter-Thread Messaging Library

Java 18,073 3,958 Updated Apr 2, 2025

Graphs for Everyone

Java 15,339 2,530 Updated Nov 5, 2025

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running mat…

Java 14,138 3,858 Updated Oct 26, 2025

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 7,099 2,954 Updated Apr 29, 2025

Tutorials for using RabbitMQ in various ways

Java 6,825 3,576 Updated Nov 4, 2025

A Flexible and Powerful Parameter Server for large-scale machine learning

Java 6,781 1,589 Updated Oct 13, 2025

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

Java 6,576 2,832 Updated Nov 3, 2025

OrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text and Geospatial models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID…

Java 4,899 875 Updated Nov 4, 2025

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

Java 4,319 581 Updated Nov 6, 2025

Example code from Learning Spark book

Java 3,895 2,422 Updated Jul 12, 2025

Please visit https://github.com/h2oai/h2o-3 for latest H2O

Java 2,259 556 Updated Oct 24, 2024

A cluster computing framework for processing large-scale geospatial data

Java 2,230 739 Updated Nov 6, 2025

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

Java 1,784 404 Updated Aug 16, 2021

Open Source ML Model Versioning, Metadata, and Experiment Management

Java 1,740 288 Updated Jul 23, 2024

Maven plugin which includes build-time git repository information into an POJO / *.properties). Make your apps tell you which version exactly they were built from! Priceless in large distributed de…

Java 1,693 304 Updated Oct 27, 2025

MapReduce, Spark, Java, and Scala for Data Algorithms Book

Java 1,081 659 Updated Oct 14, 2024

An open source ML system for the end-to-end data science lifecycle

Java 1,068 501 Updated Nov 1, 2025

Client library for Amazon Kinesis

Java 654 480 Updated Nov 5, 2025

Snippets and small examples demonstrating kafka features and configs

Java 652 384 Updated Jul 1, 2022

Building Microservices with Spring Boot

Java 640 526 Updated Oct 13, 2022

AWS libraries/modules for working with Kinesis aggregated record data

Java 377 147 Updated Oct 4, 2024

Source code for Big Data: Principles and best practices of scalable realtime data systems

Java 332 164 Updated Jun 8, 2024

Code From Learning Akka

Java 261 164 Updated Oct 20, 2017

Code repository for O'Reilly Hadoop Application Architectures book

Java 163 100 Updated May 26, 2015

Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive

Java 140 48 Updated Sep 29, 2025

A practical Storm Trident tutorial

Java 122 56 Updated Dec 16, 2023

Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)

Java 98 25 Updated Jul 22, 2022
Next