Skip to content
View hmishfaq's full-sized avatar

Highlights

  • Pro

Block or report hmishfaq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EB1A Green Card Template for Self-Petition

TeX 71 30 Updated Nov 17, 2025

A Zsh theme

Shell 52,060 2,380 Updated Apr 29, 2025

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 183,522 26,308 Updated Dec 22, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,313 331 Updated Dec 24, 2025
Python 966 101 Updated Dec 23, 2025

Jax implementation of LMC-LSVI and Adam LMCDQN .

Python 1 Updated Jun 24, 2025

QeRL enables RL for 32B LLMs on a single H100 GPU.

Python 470 46 Updated Nov 27, 2025

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

Python 277 27 Updated Nov 24, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 338 34 Updated Dec 23, 2025
Python 79 8 Updated Sep 29, 2025

A playbook for systematically maximizing the performance of deep learning models.

29,597 2,411 Updated Jun 18, 2024

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 584 50 Updated Dec 23, 2025

🙌 OpenHands: AI-Driven Development

Python 65,899 8,110 Updated Dec 24, 2025

Kinetics: Rethinking Test-Time Scaling Laws

Python 84 2 Updated Jul 11, 2025

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Python 173 20 Updated Sep 18, 2025
Python 58 17 Updated Sep 18, 2025
Python 2 Updated Apr 19, 2023

A little Python script to collect LaTeX sources for upload to the arXiv.

Python 372 27 Updated Jul 5, 2025

Template Makefile for ML projects in Python.

Python 525 46 Updated Nov 24, 2020

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Jupyter Notebook 11,946 4,650 Updated Oct 28, 2025

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,854 348 Updated Jul 15, 2024

Simple RL training for reasoning

Python 3,812 281 Updated Dec 23, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,515 1,536 Updated Apr 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,763 2,888 Updated Dec 24, 2025

Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"

Python 183 22 Updated May 25, 2025

Recipes to train reward model for RLHF.

Python 1,490 107 Updated Apr 24, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,101 12,165 Updated Dec 24, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,206 31,532 Updated Dec 24, 2025

Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science

Jupyter Notebook 4,471 2,150 Updated Nov 20, 2024
Next