Skip to content
View bakzalitdootvala's full-sized avatar
🚀
Coding the future
🚀
Coding the future

Block or report bakzalitdootvala

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bakzalitdootvala/README.md

Hi, I'm Nik - AI/LLM Engineer

I architect and ship end-to-end AI systems - from RAG pipelines and multi-agent orchestration to production backends and infrastructure.

My focus is on building things that work reliably at scale: retrieval systems, LLM integrations, intelligent assistants, backend machinery, and production deployment.


🔥 What I Do

  • LLM Engineering - designing and building multi-agent systems, orchestrating tool calls, and optimizing context for production workloads.
  • RAG & Retrieval - architecting end-to-end search pipelines with hybrid retrieval, reranking, and vector DB optimization.
  • Backend Engineering - building async Python backends with clean architecture, auth, and integrations.
  • Bots & Automation - shipping production-grade AI assistants for Telegram and WhatsApp with multilingual support and knowledge bases.
  • DevOps - containerizing, deploying, and monitoring services with CI/CD and observability.

🧩 Current Focus

  • Building enterprise-level AI assistants with RAG, memory, and multi-step reasoning.
  • Architecture of modular toolstores, agent routers, prompt stores, and model abstractions.
  • Designing LLM-first backends with clean boundaries and predictable tool execution.
  • Adaptive pipelines for production: ingestion, validation, observability, evaluation.

🛠️ My Tech Stack

Languages & Core

  • Primary: Python 3.12 (async, typing, Pydantic-first)
  • Data & Queries: SQL, JSON, YAML, TOML
  • Web & Scripting: JavaScript / TypeScript, HTML, CSS
  • Systems & Automation: Bash, PowerShell, Dockerfile, Makefile
  • Exploring: Go, Rust, C, C++ (reading-level)

LLM & AI

  • OpenAI API, Anthropic API, Google Gemini, Grok, DeepSeek
  • Open-source LLMs
  • Ollama - local model deployment
  • Embedding models: text-embedding-3-large, SentenceTransformers, multilingual
  • LangChain, LangGraph
  • Qdrant, ChromaDB, FAISS
  • HuggingFace, tiktoken
  • Cross-encoder reranking (BAAI/bge-reranker-v2-m3, Jina AI, .etc)
  • Langfuse - LLM observability
  • Prompt engineering (structured output, function calling, tool calling)
  • Multi-provider architecture (OpenAI, Anthropic, Gemini, DeepSeek, Groq)
  • SSE Streaming - real-time response streaming
  • Document parsing: PDF (pypdf), DOCX (python-docx), TXT, JSON, .etc

Backend

  • FastAPI, Uvicorn (ASGI)
  • Pydantic v2 - data validation, settings management
  • Starlette - middleware (rate limiting, security)
  • AsyncIO, SQLAlchemy 2.0
  • Alembic (migrations)
  • PostgreSQL, psycopg 3 + psycopg-pool
  • MongoDB
  • Redis
  • httpx, aiohttp - async HTTP clients
  • Multiprocessing & multithreading
  • JWT auth (python-jose) - access + refresh tokens
  • OAuth 2.0 (Google)
  • Webhook systems (Telegram, WhatsApp Cloud API)
  • OpenAPI / Swagger UI / ReDoc

Telegram & Bots

  • aiogram - Telegram Bot API framework
  • aiogram FSM - finite state machines for multi-step dialogs

Search & Retrieval

  • RAG (Retrieval-Augmented Generation), Hierarchical RAG
  • Hybrid Search - Dense + Sparse (BM25)
  • PostgreSQL Full-Text Search (tsvector, GIN indexes)
  • Reciprocal Rank Fusion (RRF)
  • Cross-Encoder Reranking

Testing

  • Apache Bench, JMeter - load testing
  • pytest, pytest-cov, pytest-asyncio, pytest-mock
  • FastAPI TestClient / httpx

Infra & DevOps

  • Docker / Compose (dev, prod configs)
  • GitHub Actions
  • Ubuntu Server, systemd
  • Health Checks (/health, /ready, /metrics)

Monitoring

  • Structured logging (JSON/text, rotation)
  • Custom metrics (latency, cache hit/miss, error rates)
  • Psutil - CPU/RAM monitoring
  • SMTP/webhook alerts

AI Security

  • Prompt Injection Protection - custom multilingual guardrails (regex filters, input sanitization, context boundary enforcement)
  • LLM output validation, content filtering, safe tool execution
  • Rate Limiting (per-user/IP via Redis)
  • Retry with Exponential Backoff

Code Quality

  • Ruff - linter & formatter

Platforms & Integrations

  • Flowise - no-code LLM orchestration
  • Chroma Cloud
  • n8n

📬 Contact

Feel free to reach out:


"AI systems are built, not summoned. Good engineering beats magic every time."

Popular repositories Loading

  1. rag-beir-scifact rag-beir-scifact Public

    RAG evaluation on BEIR SciFact: BM25, dense and hybrid retrieval with LLM answers.

    Jupyter Notebook 1

  2. bakzalitdootvala bakzalitdootvala Public

  3. doc-qa-ollama doc-qa-ollama Public

    RAG API over your HTML docs - ChromaDB + sentence-transformers + Ollama. Ask questions, get answers only from your data.

    Python

  4. k6-llm-harness k6-llm-harness Public

    Production-ready k6 harness for load-testing Flowise / Langflow / LLM platforms. Closed-model VU, 5-phase campaign, LLM-aware thresholds.

    JavaScript