Aspiring Data Engineer building production-ready data pipelines
I'm transitioning into Data Engineering with a systematic, depth-first approach. I believe in understanding the why before the how, and building systems that last rather than scripts that work once.
- ๐ฑ Currently mastering: SQL for Data Engineering
- ๐ญ Next: Python ETL pipelines, Apache Airflow
- ๐ผ Seeking: Remote Junior Data Engineer positions
- ๐ Location: India ๐ฎ๐ณ
- ๐ก Philosophy: "Work in silence, let results make the noise"
Core:
- SQL (PostgreSQL, DuckDB, SQL Server)
- Python (Pandas, SQLAlchemy)
- Linux (Fedora KDE)
- Git & GitHub
Learning:
- Apache Airflow
- dbt (Data Build Tool)
- Docker/Podman
Future:
- Apache Spark
- AWS/GCP
- Apache Kafka
Analyzing data engineering job market trends using advanced SQL
- Stack: PostgreSQL, DuckDB
- Focus: Complex queries, CTEs, window functions
- Status: ๐ง In Progress
๐ Python ETL Pipeline
End-to-end data pipeline with error handling and logging
- Stack: Python, Pandas, PostgreSQL
- Focus: Data extraction, transformation, loading
- Status: ๐ Coming Soon (Month 3)
๐ ๏ธ Airflow Data Orchestration
Automated data pipeline orchestration
- Stack: Apache Airflow, Docker
- Focus: Scheduling, monitoring, retries
- Status: ๐ Coming Soon (Month 4)
Currently reading:
- ๐ Learning SQL โ Alan Beaulieu
- ๐ Fundamentals of Data Engineering โ Reis & Housley
Courses completed:
- TBD
Month 1-2 โ SQL Mastery & Projects
Month 3 โ Python for Data Engineering
Month 4 โ Apache Airflow
Month 5 โ dbt (Data Build Tool)
Month 6 โ Portfolio Polish & Job Hunt ๐ฏ
Building data systems that matter. One pipeline at a time. ๐