Lists (1)
Sort Name ascending (A-Z)
Stars
Build Conversational AI in minutes ⚡️
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
verl: Volcano Engine Reinforcement Learning for LLMs
slime is an LLM post-training framework for RL Scaling.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models
RewardBench: the first evaluation tool for reward models.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Official implementation of "TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting" (ICML 2025)
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。
Simple (slightly optimized) implementation of Tensor Product Attention from the T6 paper with a KV cache
EDM 2025, Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information