Starred repositories
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A latent text-to-image diffusion model
12 Lessons to Get Started Building AI Agents
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
This repository contains the source code for the paper First Order Motion Model for Image Animation
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
QLoRA: Efficient Finetuning of Quantized LLMs
『ゼロから作る Deep Learning』(O'Reilly Japan, 2016)
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
『ゼロから作る Deep Learning ❹』(O'Reilly Japan, 2022)
📚 🐣 软件实践文集。主题不限,思考讨论有趣有料就好,包含如 系统的模型分析/量化分析、开源漫游者指南、软件可靠性设计实践、平台产品的逻辑与执行… 🥤
Multi-Layer Key-Value sharing experiments on Pythia models