Skip to content
View LinPoly's full-sized avatar
😅
😅

Block or report LinPoly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,957 460 Updated Mar 31, 2026

cloc counts blank lines, comment lines, and physical lines of source code in many programming languages.

Perl 22,760 1,100 Updated Mar 13, 2026

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 96,720 8,978 Updated Mar 30, 2026

OCaml - Oxidized!

OCaml 684 137 Updated Mar 31, 2026

Modern scientific computing for OCaml

OCaml 365 49 Updated Mar 24, 2026

💥💻💥 A data-parallel functional programming language

Haskell 2,689 198 Updated Mar 31, 2026

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,847 2,334 Updated Mar 25, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,788 1,024 Updated Mar 30, 2026

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,938 319 Updated Jan 14, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,091 1,137 Updated Mar 31, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,547 1,005 Updated Mar 31, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,299 847 Updated Mar 22, 2026

Mirror of the Glasgow Haskell Compiler. Please submit issues and patches to GHC's Gitlab instance (https://gitlab.haskell.org/ghc/ghc). First time contributors are encouraged to get started with th…

Haskell 3,228 729 Updated Mar 31, 2026

The Python programming language

Python 72,135 34,340 Updated Mar 31, 2026

Universal LLM Deployment Engine with ML Compilation

Python 22,295 1,979 Updated Mar 29, 2026

eBPF Developer Tutorial: Learning eBPF Step by Step with Examples

C 4,013 571 Updated Mar 11, 2026

A debugging and profiling tool that can trace and visualize python code execution

Python 7,598 470 Updated Feb 16, 2026

AlphaFold 3 inference pipeline.

Python 7,785 1,165 Updated Mar 31, 2026

A language server for Standard ML.

Rust 240 12 Updated Mar 25, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,290 324 Updated Mar 31, 2026

Neural Code Intelligence Survey 2024-25; Reading lists and resources

282 15 Updated Jul 24, 2025

Fast and memory-efficient exact attention

Python 23,064 2,569 Updated Mar 31, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,232 2,234 Updated Mar 31, 2026

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,739 675 Updated Mar 30, 2026

Ongoing research training transformer models at scale

Python 15,869 3,771 Updated Mar 31, 2026

科技爱好者周刊,每周五发布

87,070 3,926 Updated Mar 27, 2026

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.

Rust 32,796 1,993 Updated Mar 22, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,511 1,759 Updated Mar 30, 2026

LeetCode 101:力扣刷题指南

10,026 1,258 Updated Feb 12, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,829 15,001 Updated Mar 31, 2026
Next