Skip to content
View inkcherry's full-sized avatar
🍉
🍉

Block or report inkcherry

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Bridge local AI coding agents (Claude Code, Cursor, Gemini CLI, Codex) to messaging platforms (Feishu/Lark, DingTalk, Slack, Telegram, Discord, LINE, WeChat Work). Chat with your AI dev assistant f…

Go 12,879 1,213 Updated Jun 23, 2026

Go ahead and axolotl questions

Python 12,076 1,373 Updated Jun 22, 2026

你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.

TypeScript 18,378 1,104 Updated Jun 17, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,999 79,563 Updated Jun 23, 2026

MoonPalace(月宫)是由 Moonshot AI 月之暗面提供的 API 调试工具。

Go 254 11 Updated Dec 30, 2024

NVIDIA Inference Xfer Library (NIXL)

C++ 1,098 356 Updated Jun 23, 2026

Unified Collective Communication Library

C 308 129 Updated Jun 3, 2026

[DEPRECATED] Moved to ROCm/rocm-systems repo

C++ 146 44 Updated Jun 17, 2026
C++ 2 Updated Oct 30, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,632 874 Updated Jun 23, 2026

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,323 524 Updated Jun 23, 2026

Ongoing research training transformer models at scale

Python 42 35 Updated Jun 19, 2026

Modular RDMA Interface

C++ 139 52 Updated Jun 23, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,089 4,110 Updated Jun 23, 2026

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 67,127 6,025 Updated Jun 23, 2026

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 11,299 1,163 Updated Jun 22, 2026

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,410 286 Updated Feb 20, 2026

Nano vLLM

Python 14,143 2,242 Updated Apr 26, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,257 290 Updated Jun 23, 2026

Python pdb for multiple processes

Python 82 9 Updated May 24, 2025

A family of lightweight multimodal models.

Python 1,053 76 Updated Nov 18, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 88 123 Updated Jun 16, 2026

Run compilers interactively from your web browser and interact with the assembly

TypeScript 18,850 2,061 Updated Jun 21, 2026

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,346 174 Updated Jun 23, 2026

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 4,135 334 Updated Jun 23, 2026

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,331 104 Updated Aug 28, 2025

Perplexity GPU Kernels

C++ 584 94 Updated Nov 7, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,313 1,321 Updated Jun 22, 2026

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 3,124 266 Updated Jun 20, 2026
Next