Stars
《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并通过 GitHub 进行开源共享。
LLMs-from-scratch项目中文翻译
It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.
A course on aligning smol models.
🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
Code from the CMU LM inference fall 2025 edition.
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
[Pytorch] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
GRID: Generative Recommendation with Semantic IDs
[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"
Awesome Generative Recommendation papers primarily focused on industry-level applications.
Awesome things about generative recommendation models.
A book for Learning the Foundations of LLMs
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Papers and resources about POI recommendation. | 兴趣点推荐相关论文、模型和资源。
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
Tutorial on Generative Modeling: Interacting with Deep Generative Models for Content Creation
Large Language Model-enhanced Recommender System Papers