Skip to content
View Bijaykars's full-sized avatar

Highlights

  • Pro

Block or report Bijaykars

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Bijaykars/README.md

Hi, I'm Bijay Karki πŸ‘‹

Master's-trained engineer turned AI/ML researcher β€” I bridge the gap between rigorous engineering methodology and modern AI systems. Based in Kathmandu, Nepal πŸ‡³πŸ‡΅

I hold an M.S. in Civil Engineering (Earthquake Engineering) from Tribhuvan University, where my thesis combined Python, Finite Element Methods, and a modified Particle Swarm Optimization algorithm to reduce structural computation costs by 51.7%. That project sparked a deeper interest in machine learning, LLM evaluation, and agentic AI β€” which is where my focus lies today.


πŸ”¬ What I Work On

  • LLM Evaluation β€” benchmarking reasoning quality, designing error taxonomies (hallucinations, reasoning gaps, instruction failures), and building human-in-the-loop workflows
  • AI/ML Research β€” experiment design, model evaluation, feature engineering, and statistical analysis
  • Agentic Systems β€” building document research agents with retrieval, prompt orchestration, and multi-step QA
  • Optimization β€” metaheuristic methods (PSO), FEM-based frameworks, computational efficiency

πŸ› οΈ Tech Stack

Languages & Data Python NumPy Pandas SciPy Matplotlib

ML & AI scikit-learn TensorFlow Keras PyTorch supervised learning regression cross-validation

LLM & Evaluation prompt design LLM benchmarking error analysis structured annotation quality assurance

Tools Jupyter Notebook Git/GitHub VS Code Docker


πŸ’Ό Experience

Advanced AI Video Trainer @ Verity Labs (2025 – Present, Remote) Evaluate AI-generated technical content for factual accuracy, reasoning quality, and clarity. Provide structured feedback on model errors and contribute to human-in-the-loop evaluation pipelines.

Data Specialist @ CloudFactory (Apr 2022 – Jun 2024, Remote) Processed and validated large structured datasets with a focus on quality, consistency, and specification adherence.


πŸš€ Projects

Document Research Agent for Multi-Step QA (2026) LLM-based research assistant that retrieves information from documents and answers multi-step questions. Implements document chunking, retrieval, and prompt orchestration β€” evaluated on factual accuracy and reasoning reliability.

LLM Reasoning Benchmark & Error Analysis (2025) Built a benchmark of multi-step reasoning tasks (logical reasoning, quantitative problem-solving, instruction-following). Designed an error taxonomy for systematic model evaluation.

AI Output Evaluation Pipeline (2025) Structured workflow to assess AI responses for factual accuracy, reasoning quality, and instruction adherence. Includes evaluation templates, scoring rubrics, and error analysis reports.

Structural Optimization via FEM + PSO (2025 β€” M.S. Thesis) Python-based framework combining Finite Element Methods with modified Particle Swarm Optimization for planar truss size optimization. Reduced computational cost by 51.7%.


πŸ“œ Certifications

  • Machine Learning Specialization β€” DeepLearning.AI (2025)
  • ML for Engineering Applications β€” Skill Shiksha (2024)
  • Python Programming β€” Skill Shiksha (2024)

🌱 Currently Exploring

  • Advanced LLM reasoning evaluation and benchmark design
  • Agentic AI systems and retrieval-augmented generation (RAG)
  • SQL for data analysis workflows

πŸ“Š GitHub Stats

Bijay's GitHub Stats Top Languages


πŸ“« Let's Connect

LinkedIn GitHub Email

Open to collaborations in LLM evaluation, AI research, and applied ML. Feel free to reach out!

Popular repositories Loading

  1. Data-Science-Classes Data-Science-Classes Public

    Class Detail

    HTML

  2. Bijaykars Bijaykars Public

    Config files for my GitHub profile.

  3. gemini-embedding-model gemini-embedding-model Public

    Desktop RAG app using Google's Gemini embedding model for local document search, multimodal embeddings, and AI chat with citations

    JavaScript