Skip to content
View guoyuepeng's full-sized avatar

Block or report guoyuepeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

SPARK

20 repositories

Capture the logical plan from Spark (SQL)

Scala 22 9 Updated Mar 6, 2021

This project is used for tracking lineage when using spark. Our team is aimed at enhancing the ability of column relation during logical plan analysis.

Scala 20 10 Updated Jan 7, 2022

挖坑与填坑

GCC Machine Description 688 272 Updated Aug 18, 2016

Notes talking about the design and implementation of Apache Spark

5,348 1,837 Updated Apr 2, 2024

English SDK for Apache Spark

Python 879 136 Updated Jun 12, 2024

Qubole Sparklens tool for performance tuning Apache Spark

Scala 586 143 Updated Jun 26, 2024

Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark

Java 1,367 848 Updated Aug 22, 2023

Spark Knowledge Base

333 135 Updated Oct 1, 2020

Spark reference applications

Scala 652 338 Updated Oct 3, 2024

Development in Shark has been ended.

Scala 994 324 Updated Aug 11, 2015

SQL parser written using Scala's parser combinator library

Scala 103 53 Updated Mar 20, 2016

A Macro library for working with Spark SQL in a typesafe way.

Scala 10 2 Updated Nov 5, 2014

Apache Spark docker image

Shell 2,061 704 Updated Apr 21, 2023

Extensible Rules Engine for custom Dataframe / Dataset validation

Scala 137 31 Updated May 7, 2024

Official Dockerfile for Apache Spark

Dockerfile 160 52 Updated Dec 18, 2025

Bitnami container images

Shell 4,308 6,577 Updated Dec 21, 2025

Compass is a task diagnosis platform for bigdata

Java 403 150 Updated Nov 23, 2024

Examples for High Performance Spark

Scala 525 240 Updated Nov 28, 2025

Magic to help Spark pipelines upgrade

Python 34 18 Updated Sep 29, 2024