Skip to content
View so-coolboy's full-sized avatar

Block or report so-coolboy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 68,534 9,534 Updated Apr 9, 2026

An Open Foundation Model and Benchmark to Accelerate Generative Recommendation

Python 723 109 Updated Mar 18, 2026

Minimal reproduction of OneRec

Python 1,408 197 Updated Mar 31, 2026

[Pytorch] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"

Python 203 20 Updated Feb 9, 2026

三元三小时手敲大模型

Python 360 24 Updated Mar 12, 2026

Eedi - Mining Misconceptions in Mathematics 5th place solution

Python 29 6 Updated Dec 14, 2024
Jupyter Notebook 10 5 Updated Mar 11, 2025

Solution of Kaggle competition: MAP - Charting Student Math Misunderstandings

Python 27 5 Updated Oct 25, 2025

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

HTML 4,353 470 Updated Apr 9, 2026

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 3,911 509 Updated Jul 15, 2024

最少使用 3090 即可训练自己的比特大脑(miniLLM)🧠(进行中). Train your own BitBrain(A mini LLM) with just an RTX 3090 minimum.

Python 40 2 Updated Jun 29, 2025

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 46,308 5,708 Updated Apr 10, 2026

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 2,128 146 Updated Nov 22, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 29,599 2,924 Updated Mar 27, 2026

DeepSeek 系列工作解读、扩展和复现。

Python 722 59 Updated Mar 9, 2026

复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!

2,800 419 Updated Apr 4, 2026

Reproduce R1 Zero on Logic Puzzle

Python 2,447 164 Updated Mar 20, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,405 13,846 Updated Apr 6, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,029 4,777 Updated Apr 9, 2026
Python 82 32 Updated Aug 19, 2024

we want to create a repo to illustrate usage of transformers in chinese

Shell 3,185 503 Updated Aug 18, 2024

pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用

Python 132 16 Updated Mar 16, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,800 443 Updated Aug 5, 2025

Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.

Python 60,717 5,219 Updated Apr 9, 2026

SimCSE有监督与无监督实验复现

Python 152 26 Updated Feb 22, 2024

2023 Kaggle LECR 金牌 Top3 训练代码

Jupyter Notebook 28 8 Updated Mar 16, 2023

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,148 381 Updated Aug 13, 2024

Transformers 库快速入门教程

Python 1,860 223 Updated Feb 24, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,583 2,350 Updated Sep 3, 2025

1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition

Python 208 33 Updated May 20, 2024
Next