Skip to content
View LuYanFCP's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report LuYanFCP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 80,269 16,877 Updated May 17, 2026

Learning TileLang with 10 puzzles!

Python 273 32 Updated Apr 28, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,626 977 Updated May 17, 2026
Python 90 13 Updated May 17, 2026

Pure Rust Inference Engine

Rust 335 38 Updated May 17, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,036 94 Updated May 17, 2026

high-performance linear attention kernel library built on TileLang

Python 488 37 Updated May 7, 2026

A kernel library written in tilelang

Python 1,524 126 Updated Apr 23, 2026

Lucebox: LLM inference server built for speed for specific consumer hardware.

C++ 2,138 200 Updated May 17, 2026

FlashKDA: high-performance Kimi Delta Attention kernels

Cuda 425 34 Updated Apr 22, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 160,694 33,238 Updated May 17, 2026

Open-source, low-cost 10.5 GHz PLFM phased array RADAR system

PLSQL 20,278 4,819 Updated May 15, 2026

The Native Terminal Emulator with a builtin AI Harness

Rust 431 26 Updated May 13, 2026
C 82 12 Updated Apr 14, 2024

Information collection for the Happy Horse AI video generator model. Official demo and updates at happyhorses.io.

630 62 Updated May 12, 2026

Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams.

Go 1,714 110 Updated Apr 25, 2026

A Flash Player emulator written in Rust

Rust 18,080 1,026 Updated May 17, 2026

A dedicated effort to make an optimized, bleeding edge vLLM image using Docker to support DGX comprehensively

Cuda 113 21 Updated Feb 22, 2026

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Python 14,403 1,214 Updated May 14, 2026
Python 15 2 Updated Mar 22, 2026

Bub it. Build it. A hook-first runtime for agents that live alongside people.

Python 1,332 127 Updated May 17, 2026

Python interface to PROJ (cartographic projections and coordinate transformations library)

Python 1,207 233 Updated Apr 20, 2026

Visual Skills Pack for Obsidian: generate Canvas, Excalidraw, and Mermaid diagrams from text with Claude Code

2,781 251 Updated Feb 12, 2026

Make Any Website & Tool Your CLI. A universal CLI Hub and AI-native runtime. Transform any website, Electron app, or local binary into a standardized command-line interface. Built for AI Agents to …

JavaScript 21,503 2,175 Updated May 16, 2026

Turn any project into a tmux-powered terminal IDE with a simple ide.yml

TypeScript 464 26 Updated May 15, 2026

⭐️ A cross-platform CLI All-in-One assistant tool for Claude Code, Codex & Gemini CLI.

Rust 2,756 168 Updated May 17, 2026

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 283 118 Updated May 17, 2026

Distributed Compiler based on Triton for Parallel Systems

Python 1,426 142 Updated Apr 22, 2026

Terminal UI for NVIDIA Nsight Systems profiles — timeline viewer, kernel navigator, NVTX hierarchy

Python 55 11 Updated May 14, 2026

A KeePass/Password Safe Client for iOS and OS X

Objective-C 1,435 126 Updated Nov 5, 2025
Next