Skip to content
View ozyyshr's full-sized avatar

Highlights

  • Pro

Block or report ozyyshr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"

Python 165 13 Updated Dec 25, 2025

Scaling Long-Horizon LLM Agent via Context-Folding

Python 112 8 Updated Jan 26, 2026

AgentFlow: In-the-Flow Agentic System Optimization

Python 1,559 198 Updated Feb 8, 2026

SWE-Exp: Experience-Driven Software Issue Resolution

Python 35 2 Updated Oct 17, 2025

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 502 31 Updated Jun 6, 2025

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 884 58 Updated Jul 31, 2025

AWM: Agent Workflow Memory

Python 394 39 Updated Dec 22, 2025

[EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Python 71 4 Updated Nov 4, 2025

Setup scripts for the WebArena benchmark

Shell 19 9 Updated Jun 19, 2025

[ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.

Python 19 1 Updated Sep 24, 2025

[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)

Python 817 137 Updated Nov 5, 2025
Python 72 4 Updated Jun 10, 2025

Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning (NeurIPS2025-SEA)

Python 81 6 Updated Apr 19, 2025

[NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"

Python 77 10 Updated Feb 7, 2026

[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples

PDDL 120 10 Updated Jan 31, 2026

Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)

Python 21 Updated Oct 16, 2025

Some special ebooks,一些个人喜欢同时也比较特别的电子书

1,452 338 Updated Jan 28, 2026

Structured Chemistry Reasoning with Large Language Models

Python 39 5 Updated May 4, 2024

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery (EMNLP'24)

638 34 Updated Jun 21, 2025

The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))

Python 13 1 Updated Dec 21, 2023

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,223 53 Updated Jul 31, 2024

Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)

Python 46 1 Updated Dec 9, 2023

A curated list for Efficient Large Language Models

Python 1,950 152 Updated Jun 17, 2025

Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"

Python 33 1 Updated May 9, 2024

Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction

Python 54 6 Updated Jan 2, 2024

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 556 34 Updated Oct 28, 2023

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Jupyter Notebook 214 17 Updated Feb 13, 2025

A curated list of papers and resources based on "Large Language Models on Graphs: A Comprehensive Survey" (TKDE)

981 68 Updated Mar 2, 2025

contrastive decoding

Python 207 14 Updated Nov 14, 2022
Next