Skip to content
View darktheDE's full-sized avatar

Highlights

  • Pro

Block or report darktheDE

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
darktheDE/README.md

About Me

I am a Data Engineering student at Ho Chi Minh City University of Technology and Engineering (HCM-UTE). I have a deep passion for exploring how data moves and transforms to create value. I enjoy building data pipelines, working with backend systems, and constantly learning new ways to build better software.

  • Academics: Data Engineering Major (GPA: 8.41/10.0).
  • Mentorship: Teaching Assistant for Database Systems.
  • Leadership: Co-founder at HCMUTE RTIC, sharing the love for tech and innovation with my peers.
  • Professional: Freelance Business Analyst.
  • Learning: Currently diving into Data Lakehouse (Apache Iceberg, Trino) and Distributed Systems.

Tech Stack & Arsenal

Languages & Frameworks

Data Engineering & Cloud

*(+ Apache Spark, Apache Airflow, Delta Lake, Apache Iceberg, Trino, MinIO, SSIS, SSAS)*

Business Intelligence & Tools

*(+ Power BI, Scrum/Agile)*

Highlighted Projects

Modern E-commerce Platform with BFF Architecture

  • Description: A modular monolithic platform for mobile commerce, featuring a decoupled Frontend (Next.js) and Backend (Spring Boot). Integrated VNPay/Momo and AI Chatbots.
  • Core Tech: Spring Boot 3.5, Next.js 16, PostgreSQL, Redis, Docker, Zustand.

Comprehensive Multidimensional Analytics for Flight Performance

  • Description: End-to-end DWH system analyzing 2015 U.S. domestic flights correlated with FAA registry data. Implemented Kimball methodology with full OLAP capabilities.
  • Core Tech: MS SQL Server, SSIS (ETL), SSAS (OLAP Cube), Power BI, SCD Type 2.

End-to-End Medallion Architecture on Local Infrastructure

  • Description: Built a robust Data Lakehouse using Medallion architecture (Bronze/Silver/Gold). Localized Docker environment mimicking enterprise cloud data platforms.
  • Core Tech: PySpark, Delta Lake, Apache Airflow, MinIO, Hive Metastore.

Weighted SCENA Ensemble Learning for Cancer Classification

  • Description: ML architecture combining K-Means++, Hierarchical, and DBSCAN clustering using adaptive weighting to classify cancer subtypes from high-dimensional RNA-Seq data.
  • Core Tech: Python, Scikit-learn, PCA, Streamlit, Plotly.

GitHub Analytics


Profile Views
"Learn by building. Grow by doing."

Pinned Loading

  1. gene-expression-ensemble-clustering gene-expression-ensemble-clustering Public

    Đề tài: Phân cụm dữ liệu biểu hiện gen với Ensemble Learning (Weighted SCENA-based Approach), là đồ án cuối kỳ môn Machine Learning, Team of 3 - HCM-UTE.

    Jupyter Notebook

  2. ute-phonehub ute-phonehub Public

    Final project for the Software Engineering course | Team of 10 | HCM-UTE. UTE Phone Hub là một nền tảng thương mại điện tử chuyên kinh doanh điện thoại di động và phụ kiện, được xây dựng với kiến t…

    TypeScript

  3. healthcare-lakehouse-covid19 healthcare-lakehouse-covid19 Public

    Final project for the Big Data Analysis course | Team of 4 | HCM-UTE. This project implements a professional End-to-End Data Lakehouse solution designed to process and analyze large-scale healthcar…

    HTML

  4. Olist-E-Commerce-Lakehouse Olist-E-Commerce-Lakehouse Public

    This is the solo Data Lakehouse project. This project focuses on building a complete, end-to-end Data Lakehouse pipeline to process, manage, and analyze massive volumes of e-commerce data.

    Jupyter Notebook

  5. airline-dwh airline-dwh Public

    Final project for the Data Warehouse course | Team of 3 | HCM-UTE. This project implements a comprehensive Data Warehouse (DWH) system designed to analyze U.S. domestic flight performance integrate…

    TSQL

  6. QuangDuyReal/nyc-taxi-trip-analysis QuangDuyReal/nyc-taxi-trip-analysis Public

    Jupyter Notebook