Skip to content
View ehsk's full-sized avatar

Organizations

@castorini @beir-cellar @project-miracl

Block or report ehsk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Python 311 106 Updated Nov 3, 2025

Drive OSS standards and tools for data curation and evaluation creation for state of the art AI agents

Python 54 8 Updated Jun 29, 2026

Standardize benchmark wrapping so the community can wrap various otherwise-incompatible benchmarks uniformly and use them everywhere.

Python 52 6 Updated Jun 19, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 85,203 18,847 Updated Jul 3, 2026
Python 16 3 Updated Jul 10, 2025

Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"

Python 349 27 Updated Mar 16, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,458 121 Updated Apr 17, 2026

The personal finance app for everyone

Ruby 54,297 5,637 Updated Jul 24, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 424 45 Updated Jul 2, 2026

Recipes to scale inference-time compute of open models

Python 1,133 131 Updated May 26, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,265 4,164 Updated Jul 3, 2026

Simple RL training for reasoning

Python 3,867 286 Updated Dec 23, 2025

Fully open reproduction of DeepSeek-R1

Python 26,356 2,441 Updated Apr 2, 2026
Python 1,159 56 Updated Jan 10, 2026

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models

Python 68 11 Updated Mar 5, 2026

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

The MATH Dataset (NeurIPS 2021)

Python 1,371 115 Updated Sep 6, 2025

TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle

Python 319 40 Updated Dec 16, 2025

🙃 A delightful community-driven (with 2,500+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 188,330 26,376 Updated Jul 1, 2026

The official Meta Llama 3 GitHub site

Python 29,281 3,533 Updated Jan 26, 2025

A blazing fast inference solution for text embeddings models

Rust 4,909 409 Updated Jun 22, 2026

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Python 257 36 Updated Apr 25, 2026

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,263 178 Updated Mar 17, 2026

Code for Contrastive Preference Learning (CPL)

Python 183 15 Updated Nov 22, 2024

Firefly III: a personal finances manager

PHP 23,887 2,205 Updated Jul 2, 2026

Home of StarCoder2!

Python 2,075 197 Updated Mar 21, 2024

Easy and Efficient Quantization for Transformers

C++ 205 17 Updated Mar 25, 2026

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 314 61 Updated Sep 16, 2024
Next