Skip to content
View shiningliang's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Microsoft
  • Beijing, China

Block or report shiningliang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 59,994 6,524 Updated May 7, 2026

My Python scripts to make high-quality figures for publications in top AI conferences and journals.

Python 1,932 130 Updated May 11, 2026
TypeScript 7 1 Updated May 15, 2026

DIffbot LLM Inference Server

Python 236 27 Updated Aug 21, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 65,618 6,715 Updated May 13, 2026

Code and dataset for paper: DeepPlanner: Scaling Planning Capability for Deep Research Agents via Advantage Shaping

Python 37 1 Updated Dec 9, 2025

A version of verl to support diverse tool use

Python 978 80 Updated Mar 2, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,697 794 Updated May 14, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,157 281 Updated May 15, 2026

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 5,300 4,683 Updated May 15, 2026

Self-Adapting Language Models

Python 1,765 308 Updated Aug 1, 2025

[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Python 1,443 137 Updated Dec 8, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,706 423 Updated Nov 13, 2025

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

C++ 4,515 421 Updated May 15, 2026
Python 4,487 488 Updated Apr 22, 2026
Python 184 16 Updated Dec 5, 2025

[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"

Python 69 2 Updated Apr 11, 2025

在verl上做reward的定制开发

Python 174 7 Updated May 2, 2026

🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]

Python 1,221 106 Updated Nov 17, 2025

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,947 860 Updated Apr 1, 2026

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,744 198 Updated Oct 2, 2025

A framework for the evaluation of autoregressive code generation language models.

Python 1,041 263 Updated Jul 22, 2025

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

2,546 171 Updated Dec 26, 2024

A comprehensive collection of process reward models.

154 4 Updated Oct 4, 2025

算法竞赛模板库 by 灵茶山艾府 💭💡🎈

Go 8,399 798 Updated May 15, 2026

Scalable RL solution for advanced reasoning of language models

Python 1,857 112 Updated Mar 18, 2025

Curated, opinionated index of post-R1 LLM × Reinforcement Learning. Many deep-dive blog posts cross-linked to many papers — GRPO, DAPO, DPO, PPO, RLHF, GSPO, CISPO, VAPO, Reward Modeling, MoE RL st…

68 5 Updated Apr 25, 2026

《大规模语言模型:从理论到实践》第六章强化学习部分内容讲解

Jupyter Notebook 38 1 Updated Mar 13, 2026
Next