Skip to content
View so-coolboy's full-sized avatar

Block or report so-coolboy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 59,616 7,827 Updated Mar 31, 2026

An Open Foundation Model and Benchmark to Accelerate Generative Recommendation

Python 704 107 Updated Mar 18, 2026

Minimal reproduction of OneRec

Python 1,361 187 Updated Mar 31, 2026

[Pytorch] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"

Python 198 20 Updated Feb 9, 2026

三元三小时手敲大模型

Python 326 22 Updated Mar 12, 2026

Eedi - Mining Misconceptions in Mathematics 5th place solution

Python 29 6 Updated Dec 14, 2024
Jupyter Notebook 10 5 Updated Mar 11, 2025

Solution of Kaggle competition: MAP - Charting Student Math Misunderstandings

Python 27 5 Updated Oct 25, 2025

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

HTML 4,339 469 Updated Mar 31, 2026

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 3,901 508 Updated Jul 15, 2024

最少使用 3090 即可训练自己的比特大脑(miniLLM)🧠(进行中). Train your own BitBrain(A mini LLM) with just an RTX 3090 minimum.

Python 39 2 Updated Jun 29, 2025

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 45,133 5,458 Updated Mar 31, 2026

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 2,111 145 Updated Nov 22, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 29,410 2,898 Updated Mar 27, 2026

DeepSeek 系列工作解读、扩展和复现。

Python 713 59 Updated Mar 9, 2026

复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!

2,798 420 Updated Mar 15, 2026

Reproduce R1 Zero on Logic Puzzle

Python 2,443 163 Updated Mar 20, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,647 13,691 Updated Apr 1, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,950 4,771 Updated Mar 31, 2026
Python 82 32 Updated Aug 19, 2024

we want to create a repo to illustrate usage of transformers in chinese

Shell 3,170 501 Updated Aug 18, 2024

pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用

Python 131 16 Updated Mar 16, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,798 444 Updated Aug 5, 2025

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 58,771 4,984 Updated Mar 31, 2026

SimCSE有监督与无监督实验复现

Python 152 26 Updated Feb 22, 2024

2023 Kaggle LECR 金牌 Top3 训练代码

Jupyter Notebook 28 8 Updated Mar 16, 2023

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,148 381 Updated Aug 13, 2024

Transformers 库快速入门教程

Python 1,857 224 Updated Feb 24, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,544 2,344 Updated Sep 3, 2025

1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition

Python 208 33 Updated May 20, 2024
Next