Skip to content
View pankajbaid567's full-sized avatar

Highlights

  • Pro

Block or report pankajbaid567

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pankajbaid567/README.md

Typing SVG

     

 


> whoami

pankaj = {
    "role":        "AI + Backend Engineer",
    "building":    "Manthan AI — 13-project production AI ecosystem",
    "stack":       ["LangGraph", "FastAPI", "Node.js", "Docker", "PostgreSQL", "Redis"],
    "expertise":   ["RAG Systems", "Agentic AI", "LLM Fine-Tuning", "AI Evaluation"],
    "location":    "Delhi NCR, India",
    "open_to":     "AI / GenAI Internships 2027 · Remote · OSS Collaboration",
    "philosophy":  "If it can't be deployed, monitored, evaluated, and scaled — it isn't finished.",
}

Most AI engineers understand models. The rare ones can build the complete system around the model — retrieval, orchestration, evaluation, infrastructure, and deployment. That's where I operate.


> ls -la ~/projects   ⚡ Featured Work

🔬 ScholarForge AI   —   Production RAG System

AI-powered research and knowledge intelligence platform with enterprise-grade retrieval architecture.

Metric Result
RAG Faithfulness (Ragas) 87%+
Answer Relevance 85%+
P95 Retrieval Latency < 1.5s
Documents Indexed 10K+
Architecture: Hybrid BM25 + Dense Retrieval → Cross-Encoder Reranking → LLM Generation → Ragas Eval Pipeline

Stack: LangChain LangGraph FastAPI PostgreSQL Docker Python

Key decisions:

  • Hybrid BM25 + dense retrieval over naive vector search — keyword recall matters for domain-specific corpora
  • Ragas evaluation pipeline integrated into CI/CD — faithfulness gates every deployment
  • FastAPI with SSE streaming, JWT auth, rate limiting — not Streamlit

🤖 Multi-Agent Research Assistant   —   LangGraph Orchestration

Autonomous research system with stateful multi-step reasoning, tool orchestration, and memory-aware workflows.

Agent Loop: Plan → Tool Call → Error Recovery → State Update → Memory Write → Next Step

Capabilities:

  • Multi-step reasoning with persistent state graphs (LangGraph)
  • Tool registry: web search, code execution, retrieval, structured output
  • Short-term context + long-term vector memory
  • Streaming FastAPI backend with real-time agent thought visibility
  • Full Docker + GitHub Actions CI/CD pipeline

Stack: LangGraph FastAPI Docker GitHub Actions Redis


🧠 Domain-Adapted Fine-Tuned LLM   —   LoRA on Phi-3-mini

End-to-end fine-tuning pipeline: data curation → LoRA training → quantization → production serving.

Stage Detail
Base Model Phi-3-mini-4k (Microsoft)
Fine-Tuning LoRA via PEFT — 1% compute, 90% performance
Dataset 1000+ curated domain examples
Quantization GGUF via llama.cpp
Serving Ollama + FastAPI endpoint
Hosting HuggingFace Hub

Stack: HuggingFace PyTorch FastAPI Docker


🔁 MLOps Pipeline with Drift Detection

Production ML monitoring system with automated drift detection, retraining triggers, and experiment tracking.

Data → FastAPI Endpoint → MLflow Tracking → Evidently AI Drift → GitHub Actions Retrain → Model Registry

Stack: MLflow FastAPI Docker GitHub Actions


> cat tech_stack.json

🤖 AI / GenAI Engineering

LangChain LangGraph HuggingFace PyTorch OpenAI Ollama FAISS ChromaDB Ragas LoRA MLflow

⚙️ Backend Engineering

FastAPI Node.js Express PostgreSQL MongoDB Redis Prisma Python REST WebSockets

🏗️ Infrastructure & DevOps

Docker GitHub Actions Linux Git CI/CD AWS


> git log --oneline   GitHub Stats

 

> cat achievements.log

Signal Detail
🏆 IIT Roorkee Cognizance AI Hackathon — Runner-Up (AI Track, competing vs IIT/NIT teams)
🚀 Founding Team @ in.culcate — EdTech startup, backend architecture, REST APIs, 10K+ users
🌍 Open Source Contributor — PRs merged to major AI frameworks
💻 500+ DSA Problems solved (LeetCode, patterns-focused)
🎓 B.Tech CS + AI @ Rishihood University (NST) — CGPA: 9.043
Manthan AI — 13-project production AI ecosystem (3 ultra-elite)

> ls manthan_ai/   🚀 The Manthan AI Ecosystem

A unified platform of 13 interconnected AI projects — not a portfolio of demos, but a production-grade AI engineering ecosystem. Each project exposes a FastAPI layer, is containerized with Docker, and feeds signal into the next.

manthan-ai/
├── scholarforge-ai/          ← Production RAG + Evaluation  [SHIPPED]
├── multi-agent-research/     ← LangGraph Agentic System     [SHIPPED]
├── domain-llm-pipeline/      ← LoRA Fine-Tune + Serving     [SHIPPED]
├── mlops-drift-pipeline/     ← MLflow + Evidently AI        [SHIPPED]
├── ai-saas-platform/         ← Real users, full-stack AI    [IN PROGRESS]
├── llm-eval-framework/       ← Custom DeepEval harness      [PLANNED]
├── code-review-agent/        ← LangGraph + GitHub API       [PLANNED]
└── ...6 more                 ← Evolution Engine             [ROADMAP]

The evolution engine: Each project feeds a critic-agent feedback loop — prompts evolve, architectures refine, and evaluation scores compound across the ecosystem.


> cat /etc/experience

Founding Team Member — in.culcate   Sep 2024 – Mar 2026

EdTech startup · Delhi · Full-time

  • Joined as one of the earliest engineering members; shaped both product and backend architecture from the ground up
  • Designed and built REST API infrastructure with Node.js + Express; integrated MongoDB for content management at scale
  • Built and shipped end-to-end features: user authentication, content delivery pipelines, internal CMS
  • Developed the initial React Native mobile application MVP
  • Contributed to system design decisions targeting 10K+ student users
  • Worked directly with founders — product thinking, cross-functional execution, zero-to-one delivery

Node.js Express MongoDB React Native REST APIs System Design


> ping connections

   



Open to: AI/GenAI internship opportunities · OSS collaboration · Referrals · Technical conversations


"If it can't be deployed, monitored, evaluated, and scaled — it isn't finished."

Pinned Loading

  1. ScholarForge_AI ScholarForge_AI Public

    Production-grade Research Intelligence Platform powered by Hybrid RAG, BM25 + Dense Retrieval, Cross-Encoder Reranking, RAGAS Evaluation, FastAPI, and LLMs.

    Python

  2. Multi_Step_AI_Agent Multi_Step_AI_Agent Public

    Python