Skip to content

Asimawad/Asimawad

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 

Repository files navigation

Hi, I'm Asim

AI Research Engineer based in Cape Town, South Africa πŸ‡ΏπŸ‡¦

I specialize in Multi-Agent Reinforcement Learning and LLM Agents & Engineering

Currently at InstaDeep working on MARL research, I'm currently focused on combining Contrastive Goal Conditioned Reinforcement Learnining and Unsupervised Environment Design (UED) in Multi Agent settings.

πŸŽ“ MSc in AI from University of Cape Town & AIMS South Africa (Google DeepMind Scholar)


πŸ”¬ What I Work On

  • Multi-Agent RL β€” Contrastive learning, goal-conditioned RL, and curriculum strategies in JAX
  • LLM Agents β€” Autonomous agents for ML engineering, scientific discovery, and code generation
  • Inference-Time Scaling β€” Making open-source LLMs competitive with proprietary models
  • LLM Engineering β€” Fine-tuning, RLHF (PPO/GRPO/DPO), vLLM serving, distributed training

Skills

I'm good with Python JAX/Flax PyTorch vLLM HuggingFace TRL Unsloth LangGraph/LangSmith TPU/GPU

Website LinkedIn Email

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors