Skip to content
View subratrout's full-sized avatar

Block or report subratrout

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pyspark RDD, DataFrame and Dataset Examples in Python language

Python 1,316 956 Updated Mar 28, 2024
Python 194 135 Updated Nov 3, 2025

Roadmap to becoming a data engineer in 2021

12,714 1,363 Updated Jan 25, 2022

A list of useful resources to learn Data Engineering from scratch

3,906 561 Updated Jun 19, 2024

Data Engineering Practice Problems

Python 2,383 746 Updated Jan 8, 2025

The best place to learn data engineering. Built and maintained by the data engineering community.

CSS 1,810 222 Updated Sep 30, 2025

Implementing best practices for PySpark ETL jobs and applications.

Python 2,019 776 Updated Jan 1, 2023

An Awesome List of Open-Source Data Engineering Projects

2,862 503 Updated Oct 4, 2024

datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments

Python 134 44 Updated Nov 29, 2022

This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.

136 29 Updated Jan 18, 2025

Data Engineering Project with Hadoop HDFS and Kafka

Python 118 29 Updated Nov 4, 2023

A curated list of data engineering tools for software developers

7,983 1,394 Updated Oct 31, 2025

💎 A curated list of awesome Competitive Programming, Algorithm and Data Structure resources

13,546 2,611 Updated Dec 8, 2024

AWS Glue code samples

Python 1,522 834 Updated Nov 5, 2025

More than 2000+ Data engineer interview questions.

1,453 507 Updated Sep 2, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 38,499 7,405 Updated Oct 29, 2025

Advanced Python Mastery (course by @dabeaz)

Python 12,226 2,062 Updated Oct 22, 2025

An intuitive spreadsheet-like interface that lets users of all technical skill levels view, edit, query, and collaborate on Postgres data directly—100% open source and self hosted, with native Post…

Svelte 4,606 410 Updated Nov 6, 2025

Animation engine for explanatory math videos

Python 81,665 6,922 Updated Oct 20, 2025

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

28,491 3,823 Updated Jul 18, 2024

A complete daily plan for studying to become a machine learning engineer.

28,636 6,216 Updated Jun 11, 2024

Auto-generate models, views, controllers, and routes in a Rails app based on database structure

Ruby 371 9 Updated Aug 20, 2025

Rails app used in book 📚 "High Performance PostgreSQL for Rails"

Ruby 238 91 Updated Oct 10, 2025

📈 Download filings from the SEC EDGAR database using Python

Python 632 155 Updated Sep 8, 2025

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

77,771 8,441 Updated Apr 4, 2025

Replace Splunk in your small company with this one weird trick!

Python 412 35 Updated Feb 27, 2025

Resolve production issues, fast. An open source observability platform unifying session replays, logs, metrics, traces and errors powered by Clickhouse and OpenTelemetry.

TypeScript 9,031 323 Updated Nov 6, 2025

Be great at emacs in one year

6,430 884 Updated Oct 17, 2022

An authentication system generator for Rails applications.

Ruby 1,837 63 Updated Dec 5, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 18,445 2,555 Updated Aug 18, 2024
Next