Skip to content
View Piplup0924's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@cubenlp

Block or report Piplup0924

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same-tokenizer and cross-tokenizer LLM distillation.

Python 63 12 Updated Mar 21, 2026

Elevate your AI research writing, no more tedious polishing ✨

17,775 1,435 Updated Mar 25, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,082 268 Updated Apr 15, 2026

DSPy: The framework for programming—not prompting—language models

Python 33,726 2,792 Updated Apr 15, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,706 3,662 Updated Apr 15, 2026

Reproduce R1 Zero on Logic Puzzle

Python 2,446 165 Updated Mar 20, 2025

[COLM 2025] LIMO: Less is More for Reasoning

Python 1,072 55 Updated Jul 30, 2025

Fully open reproduction of DeepSeek-R1

Python 25,988 2,415 Updated Apr 2, 2026
Python 40 4 Updated Apr 6, 2026

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,149 142 Updated Mar 28, 2026

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,538 126 Updated Feb 19, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,354 917 Updated Apr 15, 2026

Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 37,335 2,600 Updated Apr 13, 2026

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,842 131 Updated Jan 17, 2025

Tools for merging pretrained large language models.

Python 6,979 690 Updated Mar 15, 2026

Skin changer for League of Legends (LOL)

C++ 1,655 117 Updated Mar 18, 2026
Python 971 110 Updated Jan 23, 2025

The official Meta Llama 3 GitHub site

Python 29,287 3,529 Updated Jan 26, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,437 226 Updated Mar 20, 2024

Ongoing research training transformer models at scale

Python 16,053 3,829 Updated Apr 15, 2026

[ACL2024] Are U a Joke Master? Pun Generation via Multi-Stage Curriculum Learning towards a Humor LLM

Python 3 Updated Aug 20, 2024

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…

JavaScript 1 Updated Aug 19, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,538 190 Updated Apr 2, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,509 125 Updated Nov 13, 2025

LLM-Merging: Building LLMs Efficiently through Merging

Jupyter Notebook 209 44 Updated Sep 24, 2024

A lightweight multilingual LLM

Python 1,017 49 Updated Aug 8, 2025

服务,项目,实验 Jupyter Notebook

1 Updated Apr 19, 2024

Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion

Python 49 4 Updated Mar 11, 2024

pun word location

Python 5 2 Updated Oct 22, 2022

Reference implementation for DPO (Direct Preference Optimization)

Python 2,882 234 Updated Aug 11, 2024
Next