Skip to content

darktheDE/darktheDE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 

Repository files navigation


About Me

I am a Data Engineering student at Ho Chi Minh City University of Technology and Engineering (HCM-UTE). I have a deep passion for exploring how data moves and transforms to create value. I enjoy building data pipelines, working with backend systems, and constantly learning new ways to build better software.

  • Academics: Data Engineering Major (GPA: 8.41/10.0).
  • Mentorship: Teaching Assistant for Database Systems.
  • Leadership: Co-founder at HCMUTE RTIC, sharing the love for tech and innovation with my peers.
  • Professional: Freelance Business Analyst.
  • Learning: Currently diving into Data Lakehouse (Apache Iceberg, Trino) and Distributed Systems.

Tech Stack & Arsenal

Languages & Frameworks

Data Engineering & Cloud

*(+ Apache Spark, Apache Airflow, Delta Lake, Apache Iceberg, Trino, MinIO, SSIS, SSAS)*

Business Intelligence & Tools

*(+ Power BI, Scrum/Agile)*

Highlighted Projects

Modern E-commerce Platform with BFF Architecture

  • Description: A modular monolithic platform for mobile commerce, featuring a decoupled Frontend (Next.js) and Backend (Spring Boot). Integrated VNPay/Momo and AI Chatbots.
  • Core Tech: Spring Boot 3.5, Next.js 16, PostgreSQL, Redis, Docker, Zustand.

Comprehensive Multidimensional Analytics for Flight Performance

  • Description: End-to-end DWH system analyzing 2015 U.S. domestic flights correlated with FAA registry data. Implemented Kimball methodology with full OLAP capabilities.
  • Core Tech: MS SQL Server, SSIS (ETL), SSAS (OLAP Cube), Power BI, SCD Type 2.

End-to-End Medallion Architecture on Local Infrastructure

  • Description: Built a robust Data Lakehouse using Medallion architecture (Bronze/Silver/Gold). Localized Docker environment mimicking enterprise cloud data platforms.
  • Core Tech: PySpark, Delta Lake, Apache Airflow, MinIO, Hive Metastore.

Weighted SCENA Ensemble Learning for Cancer Classification

  • Description: ML architecture combining K-Means++, Hierarchical, and DBSCAN clustering using adaptive weighting to classify cancer subtypes from high-dimensional RNA-Seq data.
  • Core Tech: Python, Scikit-learn, PCA, Streamlit, Plotly.

GitHub Analytics


Profile Views
"Learn by building. Grow by doing."

About

Config files for my GitHub profile.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors