ehsk

Ehsan ehsk

27 followers · 20 following

ServiceNow AI Research
Canada
23:21 (UTC -04:00)
https://ehsk.github.io
@ehsk0

Achievements

Organizations

Lists (2)

Sort

xACL

12 repositories

xML

5 repositories

Stars

xxzcc / Awesome-Credit-Assignment-in-LLM-RL

Python 83 Updated Jul 2, 2026

deepreinforce-ai / CUDA-L1

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Python 311 106 Updated Nov 3, 2025

The-AI-Alliance / cube-harness

Drive OSS standards and tools for data curation and evaluation creation for state of the art AI agents

Python 54 8 Updated Jun 29, 2026

The-AI-Alliance / cube-standard

Standardize benchmark wrapping so the community can wrap various otherwise-incompatible benchmarks uniformly and use them everywhere.

Python 52 6 Updated Jun 19, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 85,203 18,847 Updated Jul 3, 2026

ServiceNow / sec

Python 16 3 Updated Jul 10, 2025

McGill-NLP / the-markovian-thinker

Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"

Python 349 27 Updated Mar 16, 2026

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,458 121 Updated Apr 17, 2026

maybe-finance / maybe

The personal finance app for everyone

Ruby 54,297 5,637 Updated Jul 24, 2025

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 424 45 Updated Jul 2, 2026

IssamLaradji / awesome-deep-research

3 4 Updated May 21, 2025

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,133 131 Updated May 26, 2026

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,265 4,164 Updated Jul 3, 2026

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,867 286 Updated Dec 23, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 26,356 2,441 Updated Apr 2, 2026

huggingface / Math-Verify

Python 1,159 56 Updated Jan 10, 2026

mnoukhov / async_rlhf

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models

Python 68 11 Updated Mar 5, 2026

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 1,371 115 Updated Sep 6, 2025

ServiceNow / TapeAgents

TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle

Python 319 40 Updated Dec 16, 2025

ohmyzsh / ohmyzsh

🙃 A delightful community-driven (with 2,500+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 188,330 26,376 Updated Jul 1, 2026

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,281 3,533 Updated Jan 26, 2025

huggingface / text-embeddings-inference

A blazing fast inference solution for text embeddings models

Rust 4,909 409 Updated Jun 22, 2026

ServiceNow / WorkArena

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Python 257 36 Updated Apr 25, 2026

ServiceNow / BrowserGym

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,263 178 Updated Mar 17, 2026

jhejna / cpl

Code for Contrastive Preference Learning (CPL)

Python 183 15 Updated Nov 22, 2024

firefly-iii / firefly-iii

Firefly III: a personal finances manager

PHP 23,887 2,205 Updated Jul 2, 2026

bigcode-project / starcoder2

Home of StarCoder2!

Python 2,075 197 Updated Mar 21, 2024

NetEase-FuXi / EETQ

Easy and Efficient Quantization for Transformers

C++ 205 17 Updated Mar 25, 2026

AI-secure / DecodingTrust

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 314 61 Updated Sep 16, 2024