Skip to content
View JorgePulgar's full-sized avatar

Block or report JorgePulgar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
JorgePulgar/README.md

English · Español

Hi, I'm Jorge 👋

Junior AI Engineer · Building LLM-powered applications on Azure
Madrid, Spain · Open to Junior AI Engineer roles

LinkedIn Email Location Languages


About me

I build production-grade LLM applications — RAG systems, fine-tuned models, and AI-powered backends — with a strong bias toward retrieval-first architectures that don't hallucinate.

I'm completing a Professional Master's in AI and Data Engineering at Tajamar (Madrid) and hold four Azure certifications: AI-102, DP-100, DP-300, and DP-900. My most recent role was an AI Application Developer internship at Datarmony, where I built a financial statement ingestion app on Google Cloud + Gemini.

Most of what I build sits on the same stack: Azure AI Foundry + Azure AI Search + FastAPI + React, orchestrated with Python and the OpenAI SDK.


Featured projects

Full-stack multi-tenant RAG platform with structural index isolation (one Azure AI Search index per assistant), conversational memory, LLM-based query rewriting, and verifiable citations. Hard "I don't know" fallback when retrieval is empty — no hallucinations by design. ~2.6k LOC Python + ~1.5k LOC TypeScript, 56 unit tests, built in 7 days.

Azure AI Foundry · Azure AI Search · FastAPI · React · TypeScript · Hybrid Search · Semantic Reranking

Real-time facial-emotion analyzer for sales calls. Two CNN architectures benchmarked end-to-end (custom 4-block CNN vs fine-tuned MobileNetV2, 63 % test accuracy) feeding a rolling receptivity index as a coaching signal. Live in the browser via TensorFlow.js — no install, runs on phone or laptop. FastAPI service + Streamlit demo share one inference module. 4 pedagogical notebooks documenting every training failure and fix.

Deep Learning · Computer Vision · TensorFlow · TensorFlow.js · CNN · Transfer Learning · FastAPI · Streamlit

Full-stack SaaS for Spanish freelancers and SMEs. Uploads invoice PDFs, runs structured GPT-4o extraction via Azure AI Foundry, validates the JSON, and surfaces a dashboard with KPIs, monthly trends, top clients/suppliers, and quarterly VAT. Built in one week with a teammate. Deliberate single-call extraction (no agents) — more reliable, cheaper, and easier to debug for structured invoice data.

Azure AI Foundry · GPT-4o · Structured Extraction · React · TypeScript · Node.js · Express · SQLite

Fine-tuned gpt-4o-mini to act as a financial education assistant with consistent format, tone, and legal disclaimers. Custom JSONL dataset, overfitting analysis, multi-client behavior debugging.

Fine-tuning · Azure AI Foundry · gpt-4o-mini · JSONL · Evaluation

Full-stack web app combining Azure Computer Vision OCR to extract menu data from photos with a scikit-learn classification model to predict daily menus. FastAPI backend, Next.js frontend, Supabase persistence.

Computer Vision · OCR · scikit-learn · FastAPI · Next.js · Supabase

Full ML pipeline (EDA → k-fold CV → hyperparameter tuning → Flask REST API) comparing a neural network against a linear regression baseline. Finding: linear regression wins on small datasets. A case study in choosing the right tool, not the fanciest one.

Deep Learning · Regression · TensorFlow · Keras · scikit-learn · Flask · K-Fold CV


Tech stack

AI / ML Python OpenAI Anthropic Gemini LangChain TensorFlow scikit-learn OpenCV

Azure Azure Azure AI Foundry Azure AI Search

Backend FastAPI SQLAlchemy Pydantic PostgreSQL SQLite

Frontend React TypeScript Tailwind Next.js Vite

Cloud & Tooling Google Cloud Docker Git Claude Code


Certifications & education


Currently

TFM @ Integra — Public tender analysis system (team of 3, private repo) Building an AI-powered system that parses public tender documents (licitaciones), extracts structured data, summarizes them in plain language, and helps the company draft stronger bid proposals. My role: backend + AI implementation on Azure AI Foundry with a RAG pipeline over tender corpora.

RAG · Azure AI Foundry · Document Analysis · FastAPI · Team project

Also:

  • Designing a multi-agent system for automated job application management
  • Open to Junior AI Engineer roles in Madrid or remote — let's talk

Let's talk

Hiring a Junior AI Engineer? I'm available for full-time roles in Madrid or remote, building production LLM and ML systems on Azure that solve real business problems — not just demos. Reach me by email or LinkedIn — I reply within 24h.


GitHub stats Top languages

Pinned Loading

  1. Rag-Assistants-Platform Rag-Assistants-Platform Public

    Multi-tenant full-stack RAG platform with structural index isolation, hybrid search, and verifiable citations. Built on Azure AI Foundry + AI Search + FastAPI + React.

    Python

  2. sales-receptivity-cnn sales-receptivity-cnn Public

    Real-time facial emotion analyzer for sales calls. CNN benchmark (custom vs MobileNetV2, 63% acc) feeding a rolling receptivity index. Live web demo.

    Jupyter Notebook

  3. ai-invoice-analyzer ai-invoice-analyzer Public

    Full-stack SaaS MVP for Spanish freelancers and SMEs. Extracts fiscal data from invoice PDFs via Azure AI Foundry (GPT-4o) and surfaces a dashboard with KPIs, monthly trends, top clients/suppliers,…

    TypeScript

  4. FinBot-Fine-Tuning-with-Azure-AI-Foundry FinBot-Fine-Tuning-with-Azure-AI-Foundry Public

    Fine-tuned gpt-4o-mini for financial education with consistent tone, format, and legal disclaimers. Custom JSONL dataset, overfitting analysis, multi-client debugging.

    Jupyter Notebook

  5. OCR-and-ML-Pipeline-for-Menu-Predictions OCR-and-ML-Pipeline-for-Menu-Predictions Public

    Full-stack web app for restaurants: Azure Computer Vision OCR pipeline to extract menu data from photos + scikit-learn classification model to predict daily menus. Built with FastAPI, Next.js and S…

    Python

  6. housing-price-neural-network housing-price-neural-network Public

    Neural network vs linear regression for housing price prediction. Full ML pipeline (EDA, k-fold CV, hyperparameter tuning), Flask REST API, and analysis of when simpler models outperform deep learn…

    Jupyter Notebook