Skip to content
View ax7e's full-sized avatar

Block or report ax7e

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,092 293 Updated Jan 14, 2026

Nano vLLM

Python 12,437 1,781 Updated Nov 3, 2025

Post-training with Tinker

Python 2,978 359 Updated Mar 25, 2026

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,385 1,325 Updated Jul 9, 2025

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Python 321 36 Updated Jun 10, 2025

Code for reproducing our paper "Are Sparse Autoencoders Useful? A Case Study in Sparse Probing"

Jupyter Notebook 33 7 Updated Mar 31, 2025

This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.

Python 17 1 Updated Sep 13, 2024

TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.

Jupyter Notebook 163 16 Updated Oct 16, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,759 602 Updated Mar 25, 2026

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 985 94 Updated May 3, 2024

DeepSeek LLM: Let there be answers

Makefile 6,787 1,060 Updated Feb 4, 2024

Pond: CXL-Based Memory Pooling Systems for Cloud Platforms (ASPLOS'23)

HTML 221 45 Updated Oct 13, 2024

DeepSeek Coder: Let the Code Write Itself

Python 22,950 2,747 Updated Nov 11, 2025

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,702 192 Updated Oct 2, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 66,141 6,647 Updated Jan 22, 2026

Extracting spatial and temporal world models from LLMs

Jupyter Notebook 260 25 Updated Oct 17, 2023

Your API ⇒ Paid MCP. Instantly.

TypeScript 18,130 2,239 Updated Feb 11, 2026

A library that provides an embeddable, persistent key-value store for fast storage.

C++ 31,669 6,766 Updated Mar 25, 2026

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,878 622 Updated Feb 21, 2025

Inference code for CodeLlama models

Python 16,334 1,937 Updated Aug 12, 2024

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 5,593 351 Updated Sep 12, 2025

Unofficial Pytorch implementation of Dom-LM paper.

Python 33 12 Updated Mar 6, 2023

Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)

Python 105 11 Updated Mar 16, 2023

All-in-one text de-duplication

Python 749 75 Updated Mar 9, 2026

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,391 775 Updated Mar 25, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,517 2,343 Updated Sep 3, 2025

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)

Python 94 17 Updated Jul 14, 2023

The Legion Parallel Programming System

C++ 754 152 Updated Dec 17, 2025
Next