Skip to content
View MiniZhuwei's full-sized avatar

Block or report MiniZhuwei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache Spark - A unified analytics engine for large-scale data processing

Scala 43,027 29,133 Updated Mar 24, 2026

Go configuration with fangs

Go 30,169 2,097 Updated Jan 12, 2026

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 24,535 3,551 Updated Feb 21, 2026

Mirror of the official PostgreSQL GIT repository. Note that this is just a *mirror* - we don't work with pull requests on github. To contribute, please see https://wiki.postgresql.org/wiki/Submitti…

C 20,371 5,518 Updated Mar 24, 2026

Stack trace visualizer

Perl 19,378 2,091 Updated Oct 20, 2024

A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

Scala 13,269 3,567 Updated Mar 23, 2026

Java Native Access

Java 8,906 1,683 Updated Jan 1, 2026

StarCraft II Learning Environment

Python 8,264 1,164 Updated Jul 23, 2024

A Flexible and Powerful Parameter Server for large-scale machine learning

Java 6,785 1,591 Updated Oct 13, 2025

Notes talking about the design and implementation of Apache Spark

5,363 1,831 Updated Apr 2, 2024

A small utility to modify the dynamic linker and RPATH of ELF executables

C 4,171 522 Updated Dec 15, 2025

A high performance and generic framework for distributed DNN training

Python 3,715 494 Updated Oct 3, 2023

LinDB is a scalable, high performance, high availability distributed time series database.

Go 3,058 281 Updated Mar 6, 2026

Stream summarizer and cardinality estimator.

Java 2,266 556 Updated Nov 28, 2019

A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means

Java 2,147 229 Updated Feb 17, 2025

Deep Learning Pipelines for Apache Spark

Python 1,993 492 Updated Mar 30, 2023

spark ml 算法原理剖析以及具体的源码实现分析

1,960 821 Updated Mar 25, 2019

Tonbo is an embedded database for serverless and edge runtimes.

Rust 1,508 97 Updated Mar 24, 2026

Get Method Sampling from Java Flight Recorder Dump and convert to FlameGraph compatible format.

Java 269 63 Updated Oct 25, 2023

An end-to-end machine learning and data mining framework on Hadoop

Java 256 111 Updated May 13, 2024

Type-safe data migration tool for Slick, Git and beyond.

Scala 190 32 Updated Aug 19, 2024

Classical RecSys algorithms implemented by using TensorFlow Estimators

Python 184 72 Updated Nov 1, 2018

Scalable NameNode RPC Proxy for HDFS Federation

Java 87 16 Updated Apr 19, 2016

An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.

Java 75 41 Updated May 20, 2022

Minimalistic dark Vim color schemes

Vim Script 22 2 Updated Aug 14, 2025

Algorithm implementation for filling a polygon

3 Updated Oct 9, 2013

An end-to-end machine learning and data mining framework on Hadoop

Java 1 Updated Apr 1, 2021

Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"

Java 1 Updated Dec 22, 2014

Aim to create distributed inverted indexes of English Wikipedia dump using Hadoop.

Java 1 Updated Dec 28, 2013
PHP 1 Updated Dec 13, 2013
Next