Skip to content
View wshi83's full-sized avatar
🐈
Pawsitive
🐈
Pawsitive

Highlights

  • Pro

Block or report wshi83

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agents' Last Exam

Python 720 29 Updated Jun 22, 2026
Python 17 5 Updated Jun 7, 2026

The original nirholas/claude-code before DMCA and take down. Once everything is cleared, it will return. Working with Anthropic and Github to get everything back.

6,279 10 Updated Apr 16, 2026
Jupyter Notebook 25 2 Updated Mar 17, 2026

This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotlight).

Python 25 Updated Sep 29, 2025

Large model-assisted paper review

554 47 Updated Mar 19, 2026
Python 1 Updated Oct 7, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,182 2,100 Updated Jun 9, 2026
Python 158 19 Updated Nov 13, 2025

Collection of latest papers and materials in the area of RLVR!

Python 121 6 Updated Jun 22, 2026

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Python 443 30 Updated Aug 19, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,466 131 Updated Nov 9, 2025

Official Code Repository for paper "Towards Better Instruction Following Retrieval Models"

Python 8 Updated May 16, 2025

[ICLR'26] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale

Python 116 5 Updated Apr 12, 2026

Official Code Repository for WorkForceAgent-R1

Python 7 Updated Jun 1, 2025

[Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Jupyter Notebook 81 9 Updated Mar 10, 2026
Python 30 4 Updated Apr 8, 2025

GeoAI: Artificial Intelligence for Geospatial Data

Python 3,135 450 Updated Jun 22, 2026

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,490 102 Updated Jun 15, 2026

[EMNLP 2024] MIMIR: A Streamlined Platform for Personalized Agent Tuning in Domain Expertise https://arxiv.org/abs/2404.04285

Python 3 Updated Nov 10, 2024

Code and data for TrialGPT.

Python 160 72 Updated Jan 24, 2025
Python 14 6 Updated May 15, 2024

QBRC Somatic Mutation Calling Pipeline

C 16 7 Updated Feb 8, 2022

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 36,853 5,196 Updated Jun 21, 2026
Python 16 Updated Jan 26, 2024

[EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning

Python 37 3 Updated Dec 26, 2024

Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Python 94 18 Updated Jun 18, 2024
Next