Skip to content
View Neo-Zhangjiajie's full-sized avatar

Highlights

  • Pro

Organizations

@THU-KEG @THUDM

Block or report Neo-Zhangjiajie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Define your problem and evaluation criteria — EurekAgent coordinates off-the-shelf CLI agents to propose diverse approaches, implement them, run experiments, and iterate. Human intervention is opti…

Python 31 3 Updated Jun 12, 2026

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Python 37 Updated Jun 1, 2026

Deep-dive notes and source code analysis of Claude Code and AI agent harnesses. Exploring memory mechanics and internal architectures.

22 2 Updated Apr 3, 2026

💖🧸 Self hosted, you-owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…

TypeScript 40,981 4,130 Updated Jun 14, 2026

Composable HLS library for rapid development of LLM accelerators. FlexLLM enables spatial-temporal hybrid architectures, with parameterized modulet templates customized for the prefill and decode s…

C++ 22 Updated May 31, 2026

This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".

Python 68 8 Updated Apr 8, 2026

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

Python 591 50 Updated Nov 4, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 19,389 1,487 Updated Feb 27, 2026

DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL

Python 322 33 Updated Oct 2, 2025

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 85,938 16,602 Updated Jun 14, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,114 894 Updated Jun 13, 2026

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 447 52 Updated Jan 26, 2026

🙌 OpenHands: AI-Driven Development

Python 76,961 9,774 Updated Jun 14, 2026

[NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios

Python 37 1 Updated Dec 1, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,999 192 Updated Jun 9, 2026
Python 184 16 Updated Dec 5, 2025

Model Context Protocol Servers

TypeScript 87,190 10,999 Updated Jun 7, 2026

Pioneering Automated GUI Interaction with Native Agents

Python 10,946 821 Updated Jan 27, 2026

A live stream development of RL tunning for LLM agents

Python 4,101 574 Updated May 5, 2026

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 801 113 Updated May 30, 2026

Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

260 10 Updated Mar 7, 2026

An invisible desktop application to help you pass your technical interviews.

4,440 714 Updated Sep 23, 2025
Python 47 1 Updated Apr 12, 2026

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 82 3 Updated Jul 18, 2025
Python 318 24 Updated Aug 18, 2025
Python 62 5 Updated Oct 29, 2024

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,178 359 Updated Dec 30, 2025

o1 Chain of Thought Examples

33 1 Updated Oct 4, 2024

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Python 519 31 Updated Dec 31, 2024

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

TeX 410 12 Updated Mar 2, 2025
Next