Piplup0924

🎯

Focusing

Chen Yang Piplup0924

🎯

Focusing

3 followers · 5 following

Shanghai

Organizations

Stars

songmzhang / DSKD

Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same-tokenizer and cross-tokenizer LLM distillation.

Python 63 12 Updated Mar 21, 2026

Leey21 / awesome-ai-research-writing

Elevate your AI research writing, no more tedious polishing ✨

17,775 1,435 Updated Mar 25, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,082 268 Updated Apr 15, 2026

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 33,726 2,792 Updated Apr 15, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,706 3,662 Updated Apr 15, 2026

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,446 165 Updated Mar 20, 2025

GAIR-NLP / LIMO

[COLM 2025] LIMO: Less is More for Reasoning

Python 1,072 55 Updated Jul 30, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,988 2,415 Updated Apr 2, 2026

dengc2023 / LongDocURL

Python 40 4 Updated Apr 6, 2026

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,149 142 Updated Mar 28, 2026

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,538 126 Updated Feb 19, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,354 917 Updated Apr 15, 2026

chatanywhere / GPT_API_free

Free ChatGPT&DeepSeek API Key，免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API，支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 37,335 2,600 Updated Apr 13, 2026

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,842 131 Updated Jan 17, 2025

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 6,979 690 Updated Mar 15, 2026

hydy100 / R3nzSkin

Forked from R3nzTheCodeGOD/R3nzSkin

Skin changer for League of Legends (LOL)

C++ 1,655 117 Updated Mar 18, 2026

JavaScript 1 Updated Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chen Yang Piplup0924

Organizations

Block or report Piplup0924

Stars

songmzhang / DSKD

Leey21 / awesome-ai-research-writing

alibaba / ROLL

stanfordnlp / dspy

verl-project / verl

Unakar / Logic-RL

GAIR-NLP / LIMO

huggingface / open-r1

dengc2023 / LongDocURL

yunlong10 / Awesome-LLMs-for-Video-Understanding

tencent-ailab / persona-hub

OpenRLHF / OpenRLHF

chatanywhere / GPT_API_free

openreasoner / openr

arcee-ai / mergekit

hydy100 / R3nzSkin

zhentingqi / rStar

meta-llama / llama3

bigscience-workshop / Megatron-DeepSpeed

NVIDIA / Megatron-LM

cubenlp / PGCL

Mokuroh0924 / one-api

X-PLUG / mPLUG-Owl

NVIDIA / RULER

llm-merging / LLM-Merging

bilibili / Index-1.9B

RexWzh / JupyterNotebook

ShineChen1024 / MiaoBi

LawsonAbs / pun

eric-mitchell / direct-preference-optimization