hmishfaq

Haque Ishfaq hmishfaq

ML PhD at McGill, Mila • Stanford BS, MS

67 followers · 133 following

McGill University, MILA
Montreal, QC
https://hmishfaq.github.io/

Achievements

Highlights

Stars

Ryan-Rhys / EB1A

EB1A Green Card Template for Self-Petition

TeX 71 30 Updated Nov 17, 2025

romkatv / powerlevel10k

A Zsh theme

Shell 52,060 2,380 Updated Apr 29, 2025

ohmyzsh / ohmyzsh

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 183,522 26,308 Updated Dec 22, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,313 331 Updated Dec 24, 2025

ChenmienTan / RL2

Python 966 101 Updated Dec 23, 2025

PabloHendriks / AdamLMCDQN

Jax implementation of LMC-LSVI and Adam LMCDQN .

Python 1 Updated Jun 24, 2025

NVlabs / QeRL

QeRL enables RL for 32B LLMs on a single H100 GPU.

Python 470 46 Updated Nov 27, 2025

VsonicV / es-fine-tuning-paper

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

Python 277 27 Updated Nov 24, 2025

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 338 34 Updated Dec 23, 2025

HyperPotatoNeo / RSA

Python 79 8 Updated Sep 29, 2025

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

29,597 2,411 Updated Jun 18, 2024

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 584 50 Updated Dec 23, 2025

OpenHands / OpenHands

🙌 OpenHands: AI-Driven Development

Python 65,899 8,110 Updated Dec 24, 2025

Infini-AI-Lab / Kinetics

Kinetics: Rethinking Test-Time Scaling Laws

Python 84 2 Updated Jul 11, 2025

spiral-rl / spiral

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Python 173 20 Updated Sep 18, 2025

databricks / compose-rl

Python 58 17 Updated Sep 18, 2025

THuix / fg_mcmc

Python 2 Updated Apr 19, 2023

djsutherland / arxiv-collector

A little Python script to collect LaTeX sources for upload to the arXiv.

Python 372 27 Updated Jul 5, 2025

shreyashankar / create-ml-app

Template Makefile for ML projects in Python.

Python 525 46 Updated Nov 24, 2020

ageron / handson-ml3

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Jupyter Notebook 11,946 4,650 Updated Oct 28, 2025

srush / Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,854 348 Updated Jul 15, 2024

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,812 281 Updated Dec 23, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,515 1,536 Updated Apr 24, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,763 2,888 Updated Dec 24, 2025

McGill-NLP / VinePPO

Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"

Python 183 22 Updated May 25, 2025

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,490 107 Updated Apr 24, 2025

deepseek-ai / DeepSeek-V3

Python 100,827 16,424 Updated Aug 28, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,101 12,165 Updated Dec 24, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,206 31,532 Updated Dec 24, 2025

jonkrohn / ML-foundations

Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science

Jupyter Notebook 4,471 2,150 Updated Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haque Ishfaq hmishfaq

Achievements

Achievements

Highlights

Block or report hmishfaq

Stars

Ryan-Rhys / EB1A

romkatv / powerlevel10k

ohmyzsh / ohmyzsh

hiyouga / EasyR1

ChenmienTan / RL2

PabloHendriks / AdamLMCDQN

NVlabs / QeRL

VsonicV / es-fine-tuning-paper

ServiceNow / PipelineRL

HyperPotatoNeo / RSA

google-research / tuning_playbook

sail-sg / oat

OpenHands / OpenHands

Infini-AI-Lab / Kinetics

spiral-rl / spiral

databricks / compose-rl

THuix / fg_mcmc

djsutherland / arxiv-collector

shreyashankar / create-ml-app

ageron / handson-ml3

srush / Tensor-Puzzles

hkust-nlp / simpleRL-reason

Jiayi-Pan / TinyZero

volcengine / verl

McGill-NLP / VinePPO

RLHFlow / RLHF-Reward-Modeling

deepseek-ai / DeepSeek-V3

vllm-project / vllm

huggingface / transformers

jonkrohn / ML-foundations