Skip to content
View yankay's full-sized avatar
🦆
I love nature
🦆
I love nature
  • DaoCloud
  • Shanghai, China
  • 01:38 (UTC +08:00)
  • LinkedIn in/yankay

Sponsoring

@vllm-project

Block or report yankay

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An LLM post-training framework with vLLM for RL Scaling

Python 235 15 Updated Jun 13, 2026

🚀 Beautiful highly customizable statusline for Claude Code CLI with powerline support, themes, and more.

TypeScript 10,680 457 Updated Jun 8, 2026

Code search MCP for Claude Code. Make entire codebase the context for any coding agent.

TypeScript 11,835 870 Updated Jun 8, 2026

KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

Python 395 22 Updated Jun 12, 2026

Enhancement Proposals and Architecture Decisions

12 17 Updated May 28, 2026
Rust 5 1 Updated Jun 12, 2026

A Kubernetes operator that provides a declarative API to deploy, manage, and safely roll out MCP Servers, handling their full lifecycle with production-grade automation and ecosystem integrations.

Go 28 27 Updated Jun 13, 2026

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

107 10 Updated Mar 14, 2026

AI agent skills published by NVIDIA

Python 1,234 156 Updated Jun 13, 2026

ripgrep recursively searches directories for a regex pattern while respecting your gitignore

Rust 65,018 2,592 Updated Jun 5, 2026

π RuView turns commodity WiFi signals into real-time spatial intelligence, vital sign monitoring, and presence detection — all without a single pixel of video.

Rust 73,593 9,818 Updated Jun 13, 2026

Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with C…

JavaScript 82,113 7,091 Updated Jun 13, 2026

A Kubernetes-native operator for platform teams to inspect, debug, and explore files on idle or active Persistent Volume Claims (PVCs).

Vue 10 1 Updated Jun 12, 2026

DynaSim toolbox for modeling and simulating dynamical systems

MATLAB 64 31 Updated Feb 3, 2025

A computer you can curl ⚡

Python 2,694 206 Updated Apr 17, 2026

zot - A scale-out production-ready vendor-neutral OCI-native container image/artifact registry (purely based on OCI Distribution Specification)

Go 2,344 226 Updated Jun 12, 2026

Streamline your workflow with Lynkr, a CLI tool that acts as an HTTP proxy for efficient code interactions using Claude Code CLI.

JavaScript 469 47 Updated Jun 11, 2026

Pre-indexed code knowledge graph, auto syncs on code changes, for Claude Code, Codex, Gemini, Cursor, OpenCode, AntiGravity, Kiro, and Hermes Agent — fewer tokens, fewer tool calls, 100% local

TypeScript 48,531 2,970 Updated Jun 13, 2026

from vibe coding to agentic engineering - practice makes claude perfect

HTML 57,619 5,784 Updated Jun 13, 2026

Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1

Python 66,383 10,819 Updated Jun 7, 2026

A lightweight, secure, cloud-native ACP harness that bridges Discord and any ACP-compatible coding CLI.

Rust 577 152 Updated Jun 13, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 5 2 Updated Feb 15, 2026

[NeurIPS 2025] Thinkless: LLM Learns When to Think

Python 260 20 Updated Sep 26, 2025
Python 1 Updated May 13, 2026

Light Image Video Generation Inference Framework

Python 2,389 216 Updated Jun 13, 2026

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 1,330 45 Updated Jan 1, 2026

Ultimate collection of Claude Code tips, tricks, hacks, and workflows that you can use to master Claude Code in minutes

1,770 246 Updated May 27, 2026

Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents

Swift 21,956 1,691 Updated Jun 13, 2026

Unlimited FREE AI coding. Connect Claude Code, Codex, Cursor, Cline, Copilot, Antigravity to FREE Claude/GPT/Gemini via 40+ providers. Auto-fallback, RTK -40% tokens, never hit limits.

JavaScript 17,426 2,679 Updated Jun 13, 2026
Next