Skip to content
View rangehow's full-sized avatar

Block or report rangehow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Self-hosted AI assistant with tool use, multi-agent orchestration, coding copilot and a lightweight Flask + vanilla JS stack.

Python 121 14 Updated Jun 11, 2026

An introduction to ODEs and their applications in vision and language

HTML 15 1 Updated Feb 26, 2026

dLLM: Simple Diffusion Language Modeling

Python 2,564 267 Updated Apr 15, 2026

The code for "Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation"

Jupyter Notebook 43 7 Updated Jun 15, 2023
Python 3 8 Updated May 20, 2026

A conda-forge distribution.

Shell 9,880 511 Updated Jun 3, 2026

Minimalistic large language model 3D-parallelism training

Python 2,716 317 Updated May 26, 2026

A repository for pretraining a discrete diffusion model (llada), with all components built on the Hugging Face ecosystem.

Python 13 1 Updated Aug 27, 2025
Python 31 5 Updated Aug 18, 2025
Python 110 20 Updated Jan 24, 2026

🔥 今日热榜 API,一个聚合热门数据的 API 接口,支持 RSS 模式 及 Vercel 部署 | 前端页面:https://github.com/imsyy/DailyHot

TypeScript 3,864 1,287 Updated Mar 11, 2026

GUI for LLaDA Diffusion LLM with Quantization for low end GPU and CPU options.

Python 25 2 Updated Mar 7, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

10,840 850 Updated Jan 21, 2026

Fast and memory-efficient exact attention

Python 24,117 2,823 Updated Jun 10, 2026

Nano vLLM

Python 13,993 2,201 Updated Apr 26, 2026

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,421 1,312 Updated Jul 9, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,590 17,956 Updated Jun 12, 2026

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,924 402 Updated Jun 11, 2026

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

Python 58 5 Updated Feb 2, 2026

A lightweight suffix-sorting library

C 409 92 Updated Mar 25, 2020

Reverse Engineering the Abstraction and Reasoning Corpus

Jupyter Notebook 351 53 Updated Feb 24, 2025

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 2,830 191 Updated Mar 24, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 28,915 6,471 Updated Jun 12, 2026

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 3,249 243 Updated Jun 8, 2026

Tools for merging pretrained large language models.

Python 7,137 730 Updated May 6, 2026

Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.

Cython 1,124 98 Updated Apr 8, 2026

Python package for string algorithms ➰

Python 51 6 Updated Feb 3, 2026

The libsais library provides fast linear-time construction of suffix array (SA), generalized suffix array (GSA), longest common prefix (LCP) array, permuted LCP (PLCP) array, Burrows-Wheeler transf…

C 240 30 Updated Sep 10, 2025

NumPy and SciPy on Multi-Node Multi-GPU systems

Python 978 87 Updated Jun 12, 2026

An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)

Go 33 6 Updated Jun 19, 2024
Next