I'm Bin (Smars) Hu, 27 years old, and I was born and raised in China and relocated to Ontario, Canada🇨🇦. A developer also loves hitting the gym, hiking, videomaking, hiphop dancing, and minimalism
God help those who help themselves
自救者,人恒救之
🔨 Data Engineer | 3 Years of Experience in big data development
💼 Manulife | Canadian Global Insurance Cooperation | HKCAS IT Delivery Team | 1 Year 7 Months
💼 G7 | Leading IoT & Big Data Company in China | Data Product Team, Infrastructure R&D | 1 Year
🎓 Master of Science in Big Data Analytics @ Trent University, Ontario, Canada 🇨🇦
📝 My Bio https://www.smars.online/ (resume, project and tech blogs)
📞 Reach me via smarshu@trentu.ca
Simulated an enterprise-level on-premise self-managed big data distributed cluster using Docker containers. Integrated components include Hadoop, Zookeeper, Spark, Hive, MySQL, Airflow, Prometheus, ClickHouse, and Power BI. Developed a data warehouse for an e-commerce backend based on dimensional modeling theory and built a BI analytics system for reporting and data analysis.
Reproduced a modern enterprise-grade Azure cloud data engineering architecture widely adopted in North America. Leveraged technologies such as Databricks, PySpark, ADLS Gen2, Unity Catalog, Delta Lake, Power BI, and Azure Data Factory (ADF) to develop cloud-native data pipelines on Azure and perform exploratory data analysis (EDA).
☘️ Languages
☘️ Distributed Computation & Data Warehouse
☘️ Streaming & Lakehouse Architecture
☘️ Data Engineering Practices
☘️ Databases: OLAP, OLTP & NoSQL
☘️ Cloud-Native Data Engineering, Containerization & Platform Tools
(Synapse, ADLS Gen2, Databricks, Data Factory)
☘️ DevOps & Monitoring:
☘️ Basic Tools
(OS, Version Control, API, Dev environment)