-
University of Michigan
- Ann Arbor
- http://mukhal.github.io
- @mkhalifaaaa
Stars
generate coding exercises from any github repo
Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions
Processed / Cleaned Data for Paper Copilot
yunx-z / ThinkLogit
Forked from alisawuffles/proxy-tuningEliciting Long CoT from a Short CoT Model
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
A comprehensive collection of process reward models.
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
Our library for RL environments + evals
LLM-Merging: Building LLMs Efficiently through Merging
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
Recipes to scale inference-time compute of open models
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
Curate High Quality Datasets, Train, Evaluate and Ship! π
A framework for the evaluation of autoregressive code generation language models.
SGLang is a fast serving framework for large language models and vision language models.
800,000 step-level correctness labels on LLM solutions to MATH problems
A library for advanced large language model reasoning
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 π and reasoning techniques.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. β π€π€
A benchmark to evaluate language models on questions I've previously asked them to solve.