Stars
Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same-tokenizer and cross-tokenizer LLM distillation.
Elevate your AI research writing, no more tedious polishing ✨
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
DSPy: The framework for programming—not prompting—language models
verl: Volcano Engine Reinforcement Learning for LLMs
Fully open reproduction of DeepSeek-R1
🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Tools for merging pretrained large language models.
hydy100 / R3nzSkin
Forked from R3nzTheCodeGOD/R3nzSkinSkin changer for League of Legends (LOL)
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Ongoing research training transformer models at scale
[ACL2024] Are U a Joke Master? Pun Generation via Multi-Stage Curriculum Learning towards a Humor LLM
Mokuroh0924 / one-api
Forked from songquanpeng/one-apiOpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
LLM-Merging: Building LLMs Efficiently through Merging
Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion
Reference implementation for DPO (Direct Preference Optimization)