ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ βββ ββββββββββββββ βββ βββββββ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ βββ ββββββββββββββ βββ βββββββββ
βββββββββββββββββββββββββββββββββββ`. βββββββββββββββ ββββββββββββββ βββ βββ βββ βββ
βββββββββββββββββββββββββββββββββ ΒΏβββββββββ/ "ββββββββββββ ββββββββββββββ βββ βββ βββ βββ
βββββββββββββββββββββββββββββββ. ββββββββββββ βββββββββββ βββ βββββββββββββββββββββββββββββββββββββββ
ββββββββββββββββββββββββββββββ ` ββββββββββββ ` ββββββββββ βββ βββββββββββββββββββββββββββ βββββββ βββ
ββββββββββββββββββββββββββββββ ` ββββββββββββ βββββββββββ
ββββββββββββββββββββββββββββββββ ββββββββββββ. ββββββββββββ ββββ ββββββ βββββββββββββββ βββββββββ βββββββ
ββββββββββββββββββββββββββββββββββ β©β¦ββββββ β£`,βββββββββββββ βββββ ββββββββββββββββββββββ ββββββββββββββββββ
βββββββββββββββββββββββββββββββββββββ .... ,ββββββββββββββββ ββββββ βββββββββ ββββββ βββ βββ βββ
βββββββββββββββββββββββββ βͺβ’%β¦ββ~β,β ββββ ββ|,ββββ€H βββββ ββββββββββββββββ ββββββ βββ βββ βββ
ββββ-ββββββββββββββββββ ββββββββββββ’β¦ β -β£ββββββββββ’β ββ βββ βββββββββββββββββββββββββ βββ βββββββββ
ββββββββββββββββββββββββ βββββββββββββββ ββββ βββ ββββββββ βββββββββββββββ βββ βββββββ
ββββββββββββ βββββββββ β β©βββββββββΓ βββ
βββββββΓββ βββββββββββ`ββββ£ β ββm βββ’ββββ© -β£β β βββ ββββ ββββ βββββββββββββββββββββββββββββ βββ βββ βββββββ βββ βββ
ββββ -"" ββ,β.ββββββββββ βββ ββ Γ - S. βββββ β ββββββββ βββββ ββββββββββββββββββββββββββββββ ββββ ββββββββββββββββ βββ
βββββββββ -Β« ββ.β",ββ βββ ββ£ββ `Γβ ββ βββ βββ βββββββββββββββββ ββββββ βββ βββββββ βββ ββββββ βββ
ββββββββββ ΒΊ β€β£ββ£β©^",ββββββ βββ£" ''''''' ββ `ββ βββββββββββββββββ ββββββ βββ βββββ βββ ββββββ βββ
βββββββββ β βββββββββββ --------- L'βββ ββ βββ βββ βββββββββββββββββββ βββ βββ ββββββββββββββββββ
βββββββββββββ- ββββββββββ 'β§β§β§β§β§β§β§β§β§` β β§β§β§- β βββ βββββββββββββββββββ βββ βββ βββββββ βββββββ
AI Systems Builder β’ Agentic AI β’ LLMs β’ Multimodal Intelligence β’ Edge AI
I design, build, and deploy production-grade AI systems that combine LLMs, agentic reasoning, retrieval pipelines, multimodal perception, and efficient deployment. My work sits at the intersection of research depth and engineering rigor.
Iβm an AI/ML Engineer and system builder focused on transforming cutting-edge research into robust, scalable, real-world AI systems.
My experience spans Generative AI, agentic workflows, RAG architectures, multimodal pipelines, computer vision, and edge optimization, with strong emphasis on clean system design, reproducibility, and performance under constraints.
I enjoy breaking complex problems into modular, testable, production-ready AI pipelinesβfrom data ingestion to deployment.
- π€ Autonomous Agentic Systems (tool-augmented, multi-agent, reasoning-driven workflows)
- π§ Advanced RAG Architectures for long documents, scientific data, and domain-heavy knowledge
- ποΈ Voice-based AI Systems (speech β reasoning β action pipelines)
- βοΈ Efficient AI for Edge & Resource-Constrained Devices
- π End-to-end AI products integrating cloud, APIs, and real-time inference
- π€ Agentic AI Systems β multi-tool orchestration, research agents, decision pipelines
- π Retrieval-Augmented Generation (RAG) β embeddings, vector stores, structured retrieval
- ποΈ Vision & Multimodal Pipelines β VLMs, OCR-free document intelligence, scene understanding
- π± Edge & Embedded AI β INT8 quantization, pruning, Raspberry Pi & Jetson deployments
- π Full-Stack AI Applications β APIs, dashboards, cloud-native pipelines
- π³ MLOps & Deployment β Dockerized, reproducible, scalable AI workflows
- Large Language Models (LLMs), Transformers
- Retrieval-Augmented Generation (RAG), Embeddings
- Multimodal AI, Vision-Language Models
- Prompt Engineering, Structured Reasoning
- LangChain, n8n
- Tool-Augmented LLMs
- Multi-Agent Workflows
- INT8 Quantization, Structured Pruning
- Efficient Inference
- Resource-Constrained Deployment
- Docker, GCP, Firebase
- FastAPI, Flask, REST APIs
- SQL, NoSQL, MongoDB
- Pinecone, Firebase Realtime DB
- Smart Contracts
- Token-Based Systems
- Digital Signatures
- Operating Systems
- DBMS, Computer Networks
- System Design, DSA
Iβve worked on many applied AI systems across domainsβbelow are representative projects showcasing depth, scale, and system design.
LangChain β’ LLMs β’ HuggingFace β’ Web APIs
- Designed an agentic research assistant orchestrating multiple LLM-powered tools
- Implemented agents for literature retrieval, summarization, and structured insight generation
- Integrated academic sources (ArXiv) and web search APIs
- Reduced manual literature review by auto-generating consolidated research notes
n8n β’ Pinecone β’ Gemini Embeddings β’ Docker
- Built a full RAG pipeline to extract structured medical insights from unstructured PDFs
- Automated ingestion, chunking, embedding, retrieval, and summarization using n8n
- Enabled contextual Q&A over medical documents
- Fully containerized for reproducible experimentation
Donut β’ BART β’ FLAN-T5 β’ INT8 Quantization
- Designed a modular multimodal pipeline for long, layout-complex scientific PDFs
- OCR-free layout understanding using Donut
- Fine-tuned summarization and QA models on large scholarly datasets
- Applied INT8 quantization across pipeline
- π Results:
- ~70% model size reduction
- ~40% faster inference
- +15% ROUGE-L, +28% METEOR
- 85.3% QA accuracy
- Conducted ablation studies for system analysis
YOLOv12 β’ Raspberry Pi β’ Jetson Nano
- Optimized object detection models using pruning + INT8 quantization
- Reduced model size by 65% with accuracy preservation
- Designed an embedded ADAS pipeline integrating:
- Object detection
- Lane detection
- Ultrasonic sensing
- GPIO-based control
- Achieved ~210 ms end-to-end latency under CPU-only constraints
LangChain β’ Gemini APIs β’ yFinance
- Built a multi-agent financial analysis system
- Implemented data analysis, market reasoning, and decision-support agents
- Generated natural language insights from time-series data
- Designed for automated decision-support simulation
BLIP β’ HuggingFace
- Implemented a vision-language pipeline for structured scene descriptions
- Fine-tuned BLIP on custom datasets
- Integrated detection, tracking, and counting for real-time context-aware captions
- Enabled downstream reasoning by LLMs
- NFC / QR Token Payment Prototype β secure peer-to-peer transfers
- Blockchain Certificate Issuance Platform β tamper-resistant verification via smart contracts