Skip to content
View kautilyaa's full-sized avatar

Block or report kautilyaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kautilyaa/README.md

👋 Hi, I'm Arunbh Yashaswi

LinkedIn Portfolio Email

🎯 About Me

Data Scientist and ML Engineer building production-ready AI systems that actually work under pressure.

  • 🏢 3+ years at UnitedHealth Group building document intelligence & NLP systems for healthcare
  • 🎓 MS in Data Science @ University of Maryland (GPA: 3.95) | Graduating May 2026
  • 🔬 Currently: Graduate Research Assistant @ UMD | Former Data Science Intern @ VITG
  • 💡 Focused on: Healthcare AIDocument IntelligenceProduction MLOpsAgentic Systems

"The best ML work isn't polished demos—it's systems that hold up when real users depend on them."


🛠️ Tech Stack

Languages & Core
Python SQL C++ JavaScript

ML/DL Frameworks
PyTorch TensorFlow scikit-learn Hugging Face

MLOps & Cloud
AWS Azure Docker Databricks

Specializations
NLPComputer VisionDocument IntelligenceTime-Series ForecastingA/B TestingStatistical ModelingAgentic AI


🚀 Featured Projects

Neuro-symbolic AI for healthcare communication
Making clinical text accessible without sacrificing accuracy through neural LMs + deterministic verification.
PyTorch Transformers Medical NER Entity Preservation

Orchestrating 7 specialized services via Claude + MCP
Production-grade agentic system coordinating flights, hotels, weather, finance in real-time.
Claude API Model Context Protocol FastAPI Distributed Systems

End-to-end document intelligence for visual media
Automated OCR → Translation → Visual restoration pipeline reducing manual effort by 80%.
Computer Vision OCR Sequence Models Image Processing

Interpretable ML for financial decisioning
Temporal feature engineering + explainable models for regulatory-ready credit scoring.
Feature Engineering Interpretable ML Risk Modeling

Terminal-based MCP agent with modular reasoning
Experimentation platform for structured reasoning with clear action boundaries.
Agentic AI MCP Python Local LLM Integration

Real-time market signals + news sentiment → price movements
Backtested framework combining time-series models with external sentiment signals.
Time-Series Analysis Sentiment Analysis Backtesting


📊 What I'm Working On

  • 🔬 Research on document layout understanding for medical forms
  • 🏗️ Building production MLOps pipelines for UMD projects
  • 📝 Contributing to open-source ML tools (watch this space!)
  • 🎓 Graduating May 2026 and exploring Data Scientist / ML Engineer roles

📈 GitHub Activity

🎯 Open to Opportunities

Currently seeking Data Scientist, ML Engineer, or Computer Vision Engineer roles where:

  • Production reliability matters as much as model performance
  • Healthcare, GovTech, or regulated domains are in focus
  • End-to-end ownership (research → deployment) is valued
  • Mentorship and collaboration drive team culture

📧 Reach out: arunbh.y@gmail.com | 🔗 LinkedIn: arunbh-yashaswi


💬 Let's Connect

I'm always excited to discuss:

  • 👁️ Computer Vision for document intelligence and OCR pipelines
  • 🏥 Healthcare AI and clinical NLP challenges
  • 🤖 Agentic systems and reasoning frameworks
  • 📊 Production ML war stories and lessons learned

"Some data science work lives in notebooks. The work that matters shows up in production."


⭐ If you find my projects useful, consider starring them!

Profile Views

Pinned Loading

  1. TERMINUS TERMINUS Public

    A terminal based MCP for accessing your system

    Python 1

  2. ticket-creation-customer-support ticket-creation-customer-support Public

    Automatic ticket creation for Customer Support.

    Python 2 1

  3. LiveTranslatorScreen LiveTranslatorScreen Public

    An python application to translate real time data

    Jupyter Notebook 1

  4. TRUST TRUST Public

    TRUST: Targeted Risk Understanding & Scoring Technology aims to predict loan defaults using the Home Credit Default Risk dataset. By employing data science methodologies, we identify key risk facto…

    Jupyter Notebook 2 2

  5. AMSP AMSP Public

    Antimicrobial stewardship system

    Jupyter Notebook

  6. TravelGenie TravelGenie Public

    TravelGenie is an AI-powered travel planner that helps you coordinate flights, hotels, events, weather, and basically everything else you need for a complete trip itinerary. It's built on Model Con…

    Python 2 2