Skip to content
View LinPoly's full-sized avatar
😅
😅

Block or report LinPoly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,941 453 Updated Mar 27, 2026

cloc counts blank lines, comment lines, and physical lines of source code in many programming languages.

Perl 22,751 1,102 Updated Mar 13, 2026

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 96,646 8,951 Updated Mar 26, 2026

OCaml - Oxidized!

OCaml 684 137 Updated Mar 27, 2026

Modern scientific computing for OCaml

OCaml 364 49 Updated Mar 24, 2026

💥💻💥 A data-parallel functional programming language

Haskell 2,688 198 Updated Mar 27, 2026

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,833 2,331 Updated Mar 25, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,781 1,017 Updated Mar 27, 2026

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,936 318 Updated Jan 14, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,074 1,130 Updated Feb 9, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,540 1,006 Updated Feb 6, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,289 842 Updated Mar 22, 2026

Mirror of the Glasgow Haskell Compiler. Please submit issues and patches to GHC's Gitlab instance (https://gitlab.haskell.org/ghc/ghc). First time contributors are encouraged to get started with th…

Haskell 3,225 730 Updated Mar 27, 2026

The Python programming language

Python 72,119 34,319 Updated Mar 27, 2026

Universal LLM Deployment Engine with ML Compilation

Python 22,281 1,977 Updated Mar 27, 2026

eBPF Developer Tutorial: Learning eBPF Step by Step with Examples

C 4,009 570 Updated Mar 11, 2026

A debugging and profiling tool that can trace and visualize python code execution

Python 7,598 469 Updated Feb 16, 2026

AlphaFold 3 inference pipeline.

Python 7,772 1,161 Updated Mar 10, 2026

A language server for Standard ML.

Rust 240 12 Updated Mar 25, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,250 318 Updated Mar 27, 2026

Neural Code Intelligence Survey 2024-25; Reading lists and resources

282 15 Updated Jul 24, 2025

Fast and memory-efficient exact attention

Python 23,011 2,558 Updated Mar 26, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,200 2,218 Updated Mar 27, 2026

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,726 674 Updated Mar 27, 2026

Ongoing research training transformer models at scale

Python 15,826 3,763 Updated Mar 27, 2026

科技爱好者周刊,每周五发布

86,816 3,921 Updated Mar 27, 2026

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.

Rust 32,789 1,991 Updated Mar 22, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,498 1,752 Updated Mar 24, 2026

LeetCode 101:力扣刷题指南

10,023 1,259 Updated Feb 12, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,521 14,853 Updated Mar 27, 2026
Next