Skip to content
View gxy-gxy's full-sized avatar
  • UCAS
  • Beijing, China

Block or report gxy-gxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The evaluation code for MultiIF multi-turn and multi-lingual instruction following

Python 61 10 Updated Oct 29, 2024

Z-library,官方Z-lib镜像网址及入口(2026/6/7)

HTML 809 55 Updated Dec 21, 2025

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 936 238 Updated Mar 26, 2026

Machine Learning Engineering Open Book

Python 17,543 1,113 Updated Mar 16, 2026

The Art of Debugging Open Book

Python 1,331 67 Updated Mar 16, 2026

This repository contains a Freebase dump parser that extracts links to Wikipedia.

Python 27 3 Updated May 8, 2018

🍦 Never use print() to debug again.

Python 10,034 217 Updated Jan 21, 2026

Language Savant. If your repository's language is being reported incorrectly, send us a pull request!

Ruby 13,388 5,083 Updated Mar 18, 2026

Collection of LLM completions for reasoning-gym task datasets

Python 31 7 Updated Jul 4, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,373 114 Updated Mar 25, 2026

FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

Python 65 7 Updated Jan 26, 2026

[Up-to-date] Awesome Agentic Deep Research Resources

684 56 Updated Jan 17, 2026

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 23,329 2,335 Updated Oct 17, 2025

Agentic Learning Powered by AWorld

Python 94 8 Updated Mar 21, 2026

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 204 61 Updated Mar 25, 2026

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 181 30 Updated Mar 17, 2026
Python 63 6 Updated Aug 19, 2025

[EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Jupyter Notebook 65 2 Updated Aug 10, 2025

Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

259 10 Updated Mar 7, 2026

AI-Powered Python & Python-Powered AI (Python-Use)

HTML 4,085 396 Updated Feb 15, 2026

[EMNLP 2025] Awesome RAG Reasoning Resources

413 35 Updated Jan 25, 2026

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 236 25 Updated Aug 2, 2024

(best/better) practices of megatron on veRL and tuning guide

Shell 132 10 Updated Sep 26, 2025
Python 356 20 Updated Jul 29, 2025

Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]

TypeScript 88 13 Updated Jan 18, 2025

A python module to repair invalid JSON from LLMs

Python 4,614 176 Updated Mar 24, 2026

Paper list for Efficient Reasoning.

859 37 Updated Mar 16, 2026

slime is an LLM post-training framework for RL Scaling.

Python 4,978 667 Updated Mar 26, 2026

A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enabling layer-wise analysis of hidden states and predictions.

Jupyter Notebook 159 16 Updated Aug 14, 2025
Next