Skip to content
View subratrout's full-sized avatar

Block or report subratrout

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pyspark RDD, DataFrame and Dataset Examples in Python language

Python 1,316 956 Updated Mar 28, 2024
Python 194 134 Updated Nov 3, 2025

Roadmap to becoming a data engineer in 2021

12,712 1,362 Updated Jan 25, 2022

A list of useful resources to learn Data Engineering from scratch

3,905 560 Updated Jun 19, 2024

Data Engineering Practice Problems

Python 2,381 745 Updated Jan 8, 2025

The best place to learn data engineering. Built and maintained by the data engineering community.

CSS 1,811 221 Updated Sep 30, 2025

Implementing best practices for PySpark ETL jobs and applications.

Python 2,016 775 Updated Jan 1, 2023

An Awesome List of Open-Source Data Engineering Projects

2,858 502 Updated Oct 4, 2024

datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments

Python 134 44 Updated Nov 29, 2022

This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.

136 29 Updated Jan 18, 2025

Data Engineering Project with Hadoop HDFS and Kafka

Python 118 29 Updated Nov 4, 2023

A curated list of data engineering tools for software developers

7,975 1,394 Updated Oct 31, 2025

💎 A curated list of awesome Competitive Programming, Algorithm and Data Structure resources

13,544 2,611 Updated Dec 8, 2024

AWS Glue code samples

Python 1,522 834 Updated Nov 5, 2025

More than 2000+ Data engineer interview questions.

1,453 506 Updated Sep 2, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 38,479 7,396 Updated Oct 29, 2025

Advanced Python Mastery (course by @dabeaz)

Python 12,221 2,062 Updated Oct 22, 2025

An intuitive spreadsheet-like interface that lets users of all technical skill levels view, edit, query, and collaborate on Postgres data directly—100% open source and self hosted, with native Post…

Svelte 4,604 410 Updated Nov 5, 2025

Animation engine for explanatory math videos

Python 81,634 6,922 Updated Oct 20, 2025

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

28,487 3,823 Updated Jul 18, 2024

A complete daily plan for studying to become a machine learning engineer.

28,636 6,217 Updated Jun 11, 2024

Auto-generate models, views, controllers, and routes in a Rails app based on database structure

Ruby 371 9 Updated Aug 20, 2025

Rails app used in book 📚 "High Performance PostgreSQL for Rails"

Ruby 238 90 Updated Oct 10, 2025

📈 Download filings from the SEC EDGAR database using Python

Python 630 155 Updated Sep 8, 2025

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

77,752 8,439 Updated Apr 4, 2025

Replace Splunk in your small company with this one weird trick!

Python 412 35 Updated Feb 27, 2025

Resolve production issues, fast. An open source observability platform unifying session replays, logs, metrics, traces and errors powered by Clickhouse and OpenTelemetry.

TypeScript 9,026 323 Updated Nov 5, 2025

Be great at emacs in one year

6,430 884 Updated Oct 17, 2022

An authentication system generator for Rails applications.

Ruby 1,835 63 Updated Dec 5, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 18,409 2,548 Updated Aug 18, 2024
Next