Skip to content
View divithraju's full-sized avatar

Block or report divithraju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Jan 5, 2026
Python 1 Updated Jan 5, 2026

I built an AI‑powered Data Analytics SaaS that replaces core responsibilities of a Data Analyst. It uses open‑source LLMs to interpret natural language queries and automatically perform descriptive…

Python 1 Updated Jan 4, 2026

Large-Scale Data Pipeline Migration from Mainframe to Hadoop | Hadoop | Spark | Hive | Sqoop | Oozie | MySQL Migrated a legacy mainframe data warehouse to a modern Hadoop-based big data ecosystem, …

Python 1 Updated Jan 5, 2026
Python 1 Updated Aug 11, 2025

Implementing PostgresSQL best practices for Data Engineer

2 Updated Jan 5, 2026

Implementing best practices for PySpark ETL jobs and applications.

Python 1 1 Updated Sep 9, 2024

Pandas, Polars, and Spark DataFrame comparison for humans and more!

Python 1 Updated Oct 5, 2025

Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

Python 638 157 Updated Apr 9, 2026

Implementing best practices for PySpark ETL jobs and applications.

Python 2,094 801 Updated Jan 1, 2023

A curated list of awesome Apache Spark packages and resources.

Python 1 Updated Sep 9, 2024

A curated list of awesome Apache Spark packages and resources.

Shell 1,870 344 Updated Feb 27, 2026

Cookbook to install Hadoop 2.0+ using Chef

Ruby 81 74 Updated Feb 16, 2023

The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Cente…

Shell 79 23 Updated Feb 27, 2023

PySpark-Tutorial provides basic algorithms using PySpark

Jupyter Notebook 1,277 473 Updated May 26, 2025

Pyspark RDD, DataFrame and Dataset Examples in Python language

Python 1 Updated Sep 8, 2024

Pyspark RDD, DataFrame and Dataset Examples in Python language

Python 1,350 978 Updated Dec 7, 2025
Python 1 Updated Jan 21, 2025
Next