Skip to content
View mukhal's full-sized avatar

Block or report mukhal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

generate coding exercises from any github repo

Python 2 Updated Oct 28, 2025

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 605 45 Updated Dec 21, 2025

Processed / Cleaned Data for Paper Copilot

Python 786 36 Updated Dec 4, 2025
Python 961 101 Updated Dec 16, 2025

Eliciting Long CoT from a Short CoT Model

Python 6 Updated May 16, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 567 54 Updated Oct 7, 2025

A comprehensive collection of process reward models.

129 3 Updated Oct 4, 2025

Process Reward Models That Think

Python 66 5 Updated Nov 29, 2025
Python 8 8 Updated Nov 14, 2025

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 860 48 Updated Aug 12, 2024

s1: Simple test-time scaling

Python 6,615 764 Updated Jun 25, 2025

Our library for RL environments + evals

Python 3,651 453 Updated Dec 20, 2025

LLM-Merging: Building LLMs Efficiently through Merging

Jupyter Notebook 207 44 Updated Sep 24, 2024

pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.

Python 7,400 419 Updated Dec 17, 2025
JavaScript 3,823 1,661 Updated Jun 21, 2024

Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"

Python 181 16 Updated May 20, 2025

Recipes to scale inference-time compute of open models

Python 1,121 131 Updated May 22, 2025

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

623 35 Updated Dec 18, 2025

Curate High Quality Datasets, Train, Evaluate and Ship! πŸš€

Python 676 46 Updated Dec 20, 2025

A framework for the evaluation of autoregressive code generation language models.

Python 1,008 252 Updated Jul 22, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,831 3,815 Updated Dec 21, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,081 122 Updated Jun 1, 2023

A library for advanced large language model reasoning

Python 2,318 204 Updated Jun 10, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,831 134 Updated Jan 17, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 πŸ“ and reasoning techniques.

6,869 371 Updated Dec 17, 2025
Python 76 3 Updated Nov 19, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,710 2,214 Updated Mar 11, 2025

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. β€€ πŸ€–πŸ’€

Python 1,083 55 Updated Feb 2, 2025

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 1,036 77 Updated Apr 27, 2025
Next