Starred repositories
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A topic-centric list of HQ open datasets.
List of Computer Science courses with video lectures.
A collection of design patterns/idioms in Python
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
本文原文由知名 Hacker Eric S. Raymond 所撰寫,教你如何正確的提出技術問題並獲得你滿意的答案。
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
PyTorch Tutorial for Deep Learning Researchers
Generative Models by Stability AI
解决Cursor在免费订阅期间出现以下提示的问题: Your request has been blocked as our system has detected suspicious activity / You've reached your trial request limit. / Too many free trial accounts used on this machine.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
verl: Volcano Engine Reinforcement Learning for LLMs
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Enjoy the magic of Diffusion models!
《动手学大模型Dive into LLMs》系列编程实践教程
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥
Perceptual video quality assessment based on multi-method fusion.
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer