Skip to content
View wanghailiang's full-sized avatar

Block or report wanghailiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并通过 GitHub 进行开源共享。

HTML 3,239 563 Updated Sep 7, 2025

LLMs-from-scratch项目中文翻译

Jupyter Notebook 2,335 382 Updated Oct 15, 2025

It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.

1,385 152 Updated Jun 4, 2024

A course on aligning smol models.

Jupyter Notebook 6,580 2,295 Updated Feb 6, 2026

🧑‍🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

7,538 729 Updated Feb 15, 2026

Code from the CMU LM inference fall 2025 edition.

Python 34 8 Updated Dec 7, 2025

Minimal reproduction of OneRec

Python 1,054 150 Updated Feb 1, 2026

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 28,265 2,820 Updated Feb 10, 2026

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,230 2,104 Updated May 19, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 39,622 4,772 Updated Feb 6, 2026

The best ChatGPT that $100 can buy.

Python 43,592 5,681 Updated Feb 18, 2026

[Pytorch] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"

Python 183 17 Updated Feb 9, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 21,065 2,414 Updated Feb 10, 2026

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,000 342 Updated Jan 18, 2026

GRID: Generative Recommendation with Semantic IDs

Python 582 105 Updated Oct 15, 2025

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 727 106 Updated Sep 22, 2025

Awesome Generative Recommendation papers primarily focused on industry-level applications.

204 11 Updated Jan 5, 2026

Nano vLLM

Python 11,728 1,587 Updated Nov 3, 2025

Awesome things about generative recommendation models.

104 4 Updated Apr 28, 2025

A book for Learning the Foundations of LLMs

15,762 1,487 Updated Dec 12, 2025

大模型基础: 一文了解大模型基础知识

6,756 569 Updated Dec 18, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 85,465 12,929 Updated Feb 18, 2026

Papers and resources about POI recommendation. | 兴趣点推荐相关论文、模型和资源。

242 35 Updated Feb 18, 2024

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Python 2,063 262 Updated Dec 4, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,739 353 Updated Feb 18, 2026

常用中文停用词表及对比

77 45 Updated Feb 20, 2019

Tutorial on Generative Modeling: Interacting with Deep Generative Models for Content Creation

20 3 Updated Oct 29, 2020

Large Language Model-enhanced Recommender System Papers

743 63 Updated Jan 16, 2026
Next