Skip to content
View mauro-nievoff's full-sized avatar

Block or report mauro-nievoff

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mauro-nievoff/README.md

Hi, I'm Mauro 👋

Medical Doctor & Healthcare Data Scientist

Working at the intersection of medicine and artificial intelligence

LinkedIn PyPI Location


🧬 About Me

I'm a Medical Doctor who taught himself data science — and that combination shapes everything I build. With a PhD in Medical Imaging and an MSc in Epidemiology & Health Policy, I bring academic depth to real-world AI problems in healthcare.

Over 7+ years in the industry I've worked across medical NLP, computer vision, RAG pipelines, CDSS, and clinical automations — as individual contributor, team lead, and consultant.


🔬 What I Work On

  • 🏥 RAG & CDSS — multi-source and graph RAG pipelines for clinical decision support
  • 🧠 Medical NLP — LLM fine-tuning, prompt engineering, medical text analysis
  • 🖼️ Medical Imaging — computer vision, classification, multimodal data integration
  • ⚙️ Healthcare Automations — AI-driven clinical workflow automation
  • 📐 ML Research — complex classification frameworks, ontology engineering

📦 Featured Projects

An open-source multimodal clinical case dataset built from PubMed Central case reports.

  • 98,000+ clinical cases · 139,000+ medical images · 140+ image classes
  • Published in MDPI Data

Python library to create customized subsets of the MultiCaRe Dataset.

pip install multiversity

A theoretical framework for complex ML classification with any number of classes and logical relations.


📄 Selected Publications

Journal Paper
JAMIA Distilling LLMs for patient-to-clinical trial matching
npj Digital Health PRISM: clinical trial matching with real-world EHRs
MDPI Data MultiCaRe: multimodal clinical case dataset
Applied Ontology Multiplex Classification Framework

🛠️ Tech Stack

Python PyTorch HuggingFace LangChain Google Cloud Pandas OpenCV Docker


🤝 Let's Connect

I'm available for freelance projects and consulting in healthcare AI.


If you find my open-source work useful, consider giving it a ⭐

Pinned Loading

  1. MultiCaRe_Dataset MultiCaRe_Dataset Public

    Open-source multimodal dataset: 98K+ clinical cases & 139K+ medical images from PubMed Central

    Jupyter Notebook 79 16

  2. Multiplex_Classification Multiplex_Classification Public

    A framework for complex ML classification with any number of classes and logical relations.

    Jupyter Notebook

  3. multiversity multiversity Public

    Python library to create customized subsets of the MultiCaRe clinical case dataset

    Python