Lists (1)
Sort Name ascending (A-Z)
Stars
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
verl: Volcano Engine Reinforcement Learning for LLMs
Build Conversational AI in minutes ⚡️
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
slime is an LLM post-training framework for RL Scaling.
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。
RewardBench: the first evaluation tool for reward models.
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models
siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems
Official implementation of "TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting" (ICML 2025)
EDM 2025, Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information
Simple (slightly optimized) implementation of Tensor Product Attention from the T6 paper with a KV cache