Skip to content
View eniompw's full-sized avatar

Highlights

  • Pro

Block or report eniompw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
25 stars written in Jupyter Notebook
Clear filter

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 84,791 12,828 Updated Jan 29, 2026

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 20,287 4,839 Updated Dec 17, 2025

Neural Networks: Zero to Hero

Jupyter Notebook 20,186 2,876 Updated Aug 18, 2024

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,192 1,582 Updated Jan 30, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,190 2,697 Updated Nov 3, 2025

StableLM: Stability AI Language Models

Jupyter Notebook 15,765 1,022 Updated Apr 8, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,240 1,288 Updated May 23, 2024

Public facing notes page

Jupyter Notebook 10,783 4,155 Updated Sep 7, 2025

Official inference library for Mistral models

Jupyter Notebook 10,660 1,011 Updated Nov 21, 2025

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,845 491 Updated Nov 27, 2024

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Jupyter Notebook 3,766 966 Updated Feb 5, 2026

Implement a reasoning LLM in PyTorch from scratch, step by step

Jupyter Notebook 2,839 410 Updated Feb 7, 2026
Jupyter Notebook 2,753 364 Updated May 2, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,539 60 Updated Jun 14, 2025
Jupyter Notebook 1,302 215 Updated Feb 5, 2026

groq-api-cookbook

Jupyter Notebook 1,294 266 Updated Nov 25, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 965 92 Updated Sep 23, 2025

Llama from scratch, or How to implement a paper without crying

Jupyter Notebook 584 54 Updated May 29, 2024

Revisiting Mid-training in the Era of Reinforcement Learning Scaling

Jupyter Notebook 182 14 Updated Jul 23, 2025

LLM Chess - evaluating Large Language Models' reasoning and instruction-following abilities by simulating chess games

Jupyter Notebook 89 9 Updated Feb 5, 2026

Train and run a small Llama 2 model from scratch on the TinyStories dataset.

Jupyter Notebook 5 Updated Sep 25, 2025

Load larger models by offloading model layers to both GPU and CPU

Jupyter Notebook 3 Updated Jul 28, 2023

Boston Housing Dataset Example

Jupyter Notebook 1 1 Updated Jul 30, 2023

Simple Single Neuron Neural Network

Jupyter Notebook 1 Updated Jul 22, 2024

Example of Sentiment Analysis using TensorFlow and BERT

Jupyter Notebook 1 Updated Feb 27, 2023