Skip to content
View wanghailiang's full-sized avatar

Block or report wanghailiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并通过 GitHub 进行开源共享。

HTML 3,754 635 Updated Sep 7, 2025

LLMs-from-scratch项目中文翻译

Jupyter Notebook 2,690 444 Updated Apr 19, 2026

It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.

1,570 171 Updated Jun 4, 2024

A course on aligning smol models.

Jupyter Notebook 6,661 2,280 Updated May 26, 2026

🧑‍🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

8,562 899 Updated Jun 20, 2026

Code from the CMU LM inference fall 2025 edition.

Python 44 12 Updated Dec 7, 2025

Minimal reproduction of OneRec

Python 1,655 236 Updated May 14, 2026

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 31,002 3,033 Updated Jun 17, 2026

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,634 2,135 Updated May 10, 2026

🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h!

Python 52,037 6,691 Updated Jun 1, 2026

The best ChatGPT that $100 can buy.

Python 55,303 7,591 Updated May 5, 2026

[KDD2026] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"

Python 232 22 Updated Feb 9, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 60,789 7,492 Updated Jun 18, 2026

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,350 410 Updated Apr 20, 2026

GRID: Generative Recommendation with Semantic IDs

Python 698 122 Updated Oct 15, 2025

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 814 114 Updated Jun 20, 2026

Awesome Generative Recommendation papers primarily focused on industry-level applications.

223 13 Updated Jun 1, 2026

Nano vLLM

Python 14,126 2,239 Updated Apr 26, 2026

Awesome things about generative recommendation models.

112 6 Updated Apr 28, 2025

A book for Learning the Foundations of LLMs

16,372 1,558 Updated Dec 12, 2025

大模型基础: 一文了解大模型基础知识

7,390 613 Updated Jun 22, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 97,514 14,940 Updated Jun 2, 2026

Papers and resources about POI recommendation. | 兴趣点推荐相关论文、模型和资源。

250 35 Updated Feb 18, 2024

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Python 2,156 266 Updated Jun 11, 2026

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,928 394 Updated Jun 18, 2026

常用中文停用词表及对比

77 45 Updated Feb 20, 2019

Tutorial on Generative Modeling: Interacting with Deep Generative Models for Content Creation

21 3 Updated Oct 29, 2020

Large Language Model-enhanced Recommender System Papers

766 65 Updated Mar 15, 2026
Next