Skip to content
View ozyyshr's full-sized avatar

Highlights

  • Pro

Block or report ozyyshr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"

Python 122 7 Updated Dec 7, 2025
Python 79 6 Updated Oct 28, 2025

AgentFlow: In-the-Flow Agentic System Optimization

Python 1,425 184 Updated Dec 17, 2025

SWE-Exp: Experience-Driven Software Issue Resolution

Python 36 2 Updated Oct 17, 2025

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 486 31 Updated Jun 6, 2025

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 835 59 Updated Jul 31, 2025

AWM: Agent Workflow Memory

Python 372 34 Updated Jan 31, 2025

[EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Python 63 4 Updated Nov 4, 2025

Setup scripts for the WebArena benchmark

Shell 17 8 Updated Jun 19, 2025

Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.

Python 16 1 Updated Sep 24, 2025

[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)

Python 792 132 Updated Nov 5, 2025
Python 68 3 Updated Jun 10, 2025

Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning (NeurIPS2025-SEA)

Python 78 6 Updated Apr 19, 2025

[NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"

Python 69 8 Updated Dec 20, 2025

[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples

PDDL 112 9 Updated Jul 26, 2025

Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)

Python 20 Updated Oct 16, 2025

Some special ebooks,一些个人喜欢同时也比较特别的电子书

1,359 323 Updated Dec 18, 2025

Structured Chemistry Reasoning with Large Language Models

Python 39 5 Updated May 4, 2024

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery (EMNLP'24)

632 36 Updated Jun 21, 2025

The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))

Python 13 1 Updated Dec 21, 2023

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,210 53 Updated Jul 31, 2024

Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)

Python 46 1 Updated Dec 9, 2023

A curated list for Efficient Large Language Models

Python 1,916 146 Updated Jun 17, 2025

Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"

Python 33 1 Updated May 9, 2024

Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction

Python 54 6 Updated Jan 2, 2024

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 556 34 Updated Oct 28, 2023

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Jupyter Notebook 215 15 Updated Feb 13, 2025

A curated list of papers and resources based on "Large Language Models on Graphs: A Comprehensive Survey" (TKDE)

974 67 Updated Mar 2, 2025

contrastive decoding

Python 205 14 Updated Nov 14, 2022
Next