Skip to content
View ztxz16's full-sized avatar

Block or report ztxz16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,788 471 Updated Jun 14, 2026

REAP: Router-weighted Expert Activation Pruning for SMoE compression

Python 403 74 Updated Apr 17, 2026

High Performance LLM Inference Operator Library

C++ 947 97 Updated Jun 11, 2026

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 30,970 3,026 Updated Jun 17, 2026

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Python 7,548 536 Updated Jul 10, 2024

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 36,422 10,928 Updated Nov 15, 2025

A simple css3 animation page

CSS 33 12 Updated Jun 1, 2016

An alternative to original alert, confirm and prompt.

JavaScript 177 16 Updated Mar 16, 2017