Skip to content
View Mohith-akash's full-sized avatar

Block or report Mohith-akash

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Mohith-akash/README.md

Mohith Akash

Data Engineer · AI Engineer

LinkedIn Email Portfolio

Building Production Data Platforms & Agentic AI Systems

3 Live Projects · 20M+ Records Processed


About Me

Analytics Engineer focused on data pipelines, SQL analytics, and LLM integration. Building production-grade data systems with Databricks, dbt, and Python. Based in Leipzig, Germany as a Chancenkarte (Opportunity Card) holder, eligible to work in Germany and available immediately for data engineering and analytics roles.


🚀 Featured Projects

Live Demo Code

Real-time streaming platform for AI-powered cart recovery

  • ⚡ Azure Event Hub → <500ms end-to-end latency
  • 🧠 Agentic AI with Cerebras Llama 3.1 + 5 customer archetypes
  • 🏗️ Delta Live Tables: Bronze → Silver → Gold
  • 🔍 Semantic search with Voyage AI embeddings
  • 📊 A/B testing with z-score + 95% confidence intervals

Databricks Azure Event Hub DLT Cerebras dbt Streamlit

Live Demo Code

Hybrid RAG system for querying 20M+ geopolitical news events

  • 📊 20M+ events from GDELT + GKG feeds
  • 🔄 100K+ daily ingestion with 15-min refresh cycles
  • 🤖 Dual AI agents: Vector Search + Text-to-SQL
  • ⚡ Polars engine, ~10x faster than pandas
  • 💰 $0/month fully serverless on MotherDuck

Python Polars dbt MotherDuck Dagster LlamaIndex

Live Demo Code

End-to-end Databricks Lakehouse for e-commerce analytics

  • 🏗️ Medallion Architecture: Bronze → Silver → Gold
  • 📦 100K+ orders processed with Delta Lake
  • ⭐ Kimball star schema: fct_orders + 3 dimension tables
  • 🔐 Unity Catalog with row-level governance
  • 📊 Power BI interactive dashboards

Databricks Delta Lake SQL Power BI Unity Catalog

Code

Interactive Excel dashboard analyzing 13K+ data job postings

  • 🧹 12,894 postings cleaned via 70+ step Power Query ETL
  • 🧮 228K+ rows modeled in Power Pivot with custom DAX
  • 🏷️ 4,000+ messy job titles standardized into role categories
  • 📊 Slicers for skill demand, seniority and geography

Excel Power Query Power Pivot DAX


🛠️ Tech Stack

Data Engineering    Python SQL Databricks Azure Delta Lake DuckDB

Pipelines & Transforms    dbt Dagster Polars MotherDuck

AI & LLM    RAG LlamaIndex Cerebras Voyage AI NLP

Visualization & DevOps    Streamlit Power BI GitHub Actions


🎓 Certifications

Google Advanced Data Analytics   AI for Data Professionals   Google Cybersecurity


💼 Open to Opportunities — Based in Germany

Data Engineer · AI Engineer · Analytics Engineer

Available immediately · EU work authorization (Chancenkarte) · Fluent in English

🎓 B.Tech, Electrical & Electronics Engineering · SRM University, Chennai (2020)

🇩🇪 German: A2 (ongoing)

Let's Connect

Pinned Loading

  1. Global-News-Intel-Platform Global-News-Intel-Platform Public

    AI-powered geopolitical news intelligence platform. Ingests 100K+ daily events from GDELT, stores in MotherDuck (DuckDB), orchestrates with Dagster, and features an AI chat interface with Text-to-S…

    Python 18 2

  2. Vortex-The-Revenue-Recovery-Engine Vortex-The-Revenue-Recovery-Engine Public

    Real-Time Lakehouse on Azure & Databricks. Ingests clickstream events via Event Hubs & Delta Live Tables (DLT) with <500ms latency to trigger Agentic AI recovery workflows.

    Python

  3. olist-analytics-platform olist-analytics-platform Public

    End-to-end analytics platform: CSV → Databricks → Delta Lake → Streamlit Dashboard | 100K+ Brazilian e-commerce orders

    Python

  4. german-frequency-deck german-frequency-deck Public

    5,009-word German Anki deck (A1 to C1) with neural audio, images, full verb conjugations, Goethe level badges and word families

    Python 1