Skip to content
View vpj's full-sized avatar
😜
😜

Organizations

@labmlai

Block or report vpj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 87,537 12,663 Updated Mar 26, 2026

95% token savings. 155x faster queries. 16 languages. LLMs can't read your entire codebase. TLDR extracts structure, traces dependencies, and gives them exactly what they need.

Python 1,171 113 Updated Jan 17, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,080 140 Updated Jun 17, 2026

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 369 45 Updated Jun 10, 2026

LLM Council works together to answer your hardest questions

Python 21,025 3,915 Updated Nov 22, 2025

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 1,143 166 Updated Jun 11, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,516 606 Updated Jun 18, 2026

Fast CUDA matrix multiplication from scratch

Cuda 1,221 196 Updated Sep 2, 2025
TypeScript 8 Updated Feb 19, 2026

πŸš€πŸ€– Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 68,868 7,022 Updated Jun 18, 2026

Prompts for our Grok chat assistant and the `@grok` bot on X.

Jinja 4,162 462 Updated Nov 17, 2025

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 808 52 Updated Aug 15, 2025

Fully open reproduction of DeepSeek-R1

Python 26,329 2,444 Updated Apr 2, 2026

πŸ™ Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 75,725 8,229 Updated Mar 11, 2026

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,759 273 Updated Jul 18, 2025

Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044

Python 36 5 Updated Oct 3, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 845 44 Updated Mar 15, 2026

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 494 70 Updated Sep 27, 2024

LLM101n: Let's build a Storyteller

37,337 2,050 Updated Aug 1, 2024
Jupyter Notebook 2,269 517 Updated Jun 11, 2026

LLM Analytics

TypeScript 714 34 Updated Oct 19, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 20,842 1,491 Updated Jun 13, 2026

Convert PDF to markdown + JSON quickly with high accuracy

Python 36,201 2,498 Updated Jun 6, 2026

πŸ”Ž Monitor deep learning model training and hardware usage from your mobile phone πŸ“±

Python 2,322 151 Updated Apr 10, 2025

Curate better data for LLMs

Python 1,071 105 Updated Mar 19, 2024

Code for Quiet-STaR

Python 739 92 Updated Aug 21, 2024

Grok open release

Python 51,689 8,477 Updated Aug 30, 2024

A multi-programming language benchmark for LLMs

Python 307 56 Updated Apr 12, 2026

MLX: An array framework for Apple silicon

C++ 27,137 1,924 Updated Jun 17, 2026

DeepSeek LLM: Let there be answers

Makefile 7,078 1,097 Updated Feb 4, 2024
Next