shiningliang

Follow

🎯

Focusing

Liang Shining shiningliang

🎯

Focusing

Follow

CS Ph.D @ JLU; Current @microsoft STCA

32 followers · 4 following

Microsoft
Beijing, China

Achievements

Achievements

Stars

ComposioHQ / awesome-claude-skills

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 59,994 6,524 Updated May 7, 2026

ChenLiu-1996 / figures4papers

My Python scripts to make high-quality figures for publications in top AI conferences and journals.

Python 1,932 130 Updated May 11, 2026

corelli18512 / kraki

TypeScript 7 1 Updated May 15, 2026

diffbot / diffbot-llm-inference

DIffbot LLM Inference Server

Python 236 27 Updated Aug 21, 2025

unclecode / crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 65,618 6,715 Updated May 13, 2026

AlexFanw / DeepPlanner

Code and dataset for paper: DeepPlanner: Scaling Planning Capability for Deep Research Agents via Advantage Shaping

Python 37 1 Updated Dec 9, 2025

YunjiaXi / Awesome-Search-Agent-Papers

135 5 Updated Apr 29, 2026

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use

Python 978 80 Updated Mar 2, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,697 794 Updated May 14, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,157 281 Updated May 15, 2026

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,766 78 Updated Jan 20, 2026

TideDra / zotero-arxiv-daily

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 5,300 4,683 Updated May 15, 2026

Continual-Intelligence / SEAL

Self-Adapting Language Models

Python 1,765 308 Updated Aug 1, 2025

RUC-NLPIR / WebThinker

[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Python 1,443 137 Updated Dec 8, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,706 423 Updated Nov 13, 2025

infiniflow / infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

C++ 4,515 421 Updated May 15, 2026

openai / simple-evals

Python 4,487 488 Updated Apr 22, 2026

THU-KEG / AdaptThink

Python 184 16 Updated Dec 5, 2025

efficientscaling / Z1

[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"

Python 69 2 Updated Apr 11, 2025

yuanzhoulvpi2017 / nano_rl

Forked from verl-project/verl

在verl上做reward的定制开发

Python 174 7 Updated May 2, 2026

RUC-NLPIR / Search-o1

🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]

Python 1,221 106 Updated Nov 17, 2025

SWE-bench / SWE-bench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,947 860 Updated Apr 1, 2026

evalplus / evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,744 198 Updated Oct 2, 2025

bigcode-project / bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Python 1,041 263 Updated Jul 22, 2025

km1994 / LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

2,546 171 Updated Dec 26, 2024

RyanLiu112 / Awesome-Process-Reward-Models

A comprehensive collection of process reward models.

154 4 Updated Oct 4, 2025

EndlessCheng / codeforces-go

算法竞赛模板库 by 灵茶山艾府 💭💡🎈

Go 8,399 798 Updated May 15, 2026

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,857 112 Updated Mar 18, 2025

hscspring / rl-llm-nlp

Curated, opinionated index of post-R1 LLM × Reinforcement Learning. Many deep-dive blog posts cross-linked to many papers — GRPO, DAPO, DPO, PPO, RLHF, GSPO, CISPO, VAPO, Reward Modeling, MoE RL st…

68 5 Updated Apr 25, 2026

wenzhaoabc / llm-tap-rl

《大规模语言模型：从理论到实践》第六章强化学习部分内容讲解

Jupyter Notebook 38 1 Updated Mar 13, 2026