Hi, I'm Asim

AI Research Engineer based in Cape Town, South Africa 🇿🇦

I specialize in Multi-Agent Reinforcement Learning and LLM Agents & Engineering

Currently at InstaDeep working on MARL research, I'm currently focused on combining Contrastive Goal Conditioned Reinforcement Learnining and Unsupervised Environment Design (UED) in Multi Agent settings.

🎓 MSc in AI from University of Cape Town & AIMS South Africa (Google DeepMind Scholar)

🔬 What I Work On

Multi-Agent RL — Contrastive learning, goal-conditioned RL, and curriculum strategies in JAX
LLM Agents — Autonomous agents for ML engineering, scientific discovery, and code generation
Inference-Time Scaling — Making open-source LLMs competitive with proprietary models
LLM Engineering — Fine-tuning, RLHF (PPO/GRPO/DPO), vLLM serving, distributed training

Skills

I'm good with Python JAX/Flax PyTorch vLLM HuggingFace TRL Unsloth LangGraph/LangSmith TPU/GPU

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hi, I'm Asim

🔬 What I Work On

Skills

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Hi, I'm Asim

🔬 What I Work On

Skills

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages