Skip to content
View mssssss123's full-sized avatar

Block or report mssssss123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
26 stars written in Jupyter Notebook
Clear filter

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 78,081 11,527 Updated Nov 6, 2025

Examples and guides for using the OpenAI API

Jupyter Notebook 69,062 11,547 Updated Nov 5, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,396 6,132 Updated Sep 18, 2024

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 27,148 2,721 Updated Nov 5, 2025

Anthropic's Interactive Prompt Engineering Tutorial

Jupyter Notebook 25,961 2,366 Updated Jul 11, 2024

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 25,759 2,589 Updated Nov 4, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,009 2,640 Updated Nov 3, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,047 1,274 Updated Oct 27, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,736 865 Updated Jun 10, 2024

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,408 520 Updated Oct 8, 2025

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,846 488 Updated Nov 27, 2024

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,337 1,216 Updated Jul 30, 2024

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,683 440 Updated Nov 4, 2025
Jupyter Notebook 4,159 549 Updated May 2, 2025

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,661 195 Updated Jun 9, 2025

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 1,975 328 Updated May 21, 2025

Simple implementation of OpenAI CLIP model in PyTorch.

Jupyter Notebook 714 104 Updated Oct 18, 2025

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 596 32 Updated Oct 6, 2024

欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。

Jupyter Notebook 349 39 Updated Jul 21, 2024

The GPT-based Universal Web Scraper MVP is a solution that leverages GPT models and web scraping libraries to generate scraper code based on user input and website analysis, simplifying the web scr…

Jupyter Notebook 270 46 Updated Mar 7, 2024

RAG兴趣小组,全手写的一个RAG应用。Langchain的大部分库会很方便,但是你不一定理解其中原理,所以代码尽可能展现基本算法,主打理解RAG的原理

Jupyter Notebook 240 13 Updated Sep 25, 2024

Comprehensive benchmark for RAG

Jupyter Notebook 232 27 Updated Jun 14, 2025
Jupyter Notebook 227 30 Updated Dec 18, 2023

SciGraphQA: Large-Scale Synthetic Multi-Turn Question-Answering Dataset for Scientific Graphs

Jupyter Notebook 43 2 Updated Oct 25, 2024
Jupyter Notebook 34 3 Updated Mar 6, 2025