Skip to content
View withlin's full-sized avatar
🧸
🧸
  • GuangZhou,China

Block or report withlin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Use Codex from Claude Code to review code or delegate tasks.

JavaScript 5,779 280 Updated Mar 31, 2026
TypeScript 27,387 1,225 Updated Mar 31, 2026

Harness Engineering 学习指南 — 从概念理解到独立实践的深度学习档案

318 32 Updated Mar 31, 2026

Run Anthropic's Claude Code CLI with OpenAI models such as GPT-5-Codex, GPT-5.1, and others via a local LiteLLM proxy.

Python 220 23 Updated Jan 4, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,311 268 Updated Mar 27, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 7,819 1,052 Updated Apr 1, 2026

LLAMA Turboquant implementation with CUDA support

C++ 298 21 Updated Mar 29, 2026

Memory Sparse Attention - 亿级(100M)token 上下文的端到端可训练记忆框架

2,395 133 Updated Mar 29, 2026

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 7,016 748 Updated Mar 29, 2026

An incremental parsing system for programming tools

Rust 24,418 2,524 Updated Mar 31, 2026

State-of-the-Art Text Embeddings

Python 18,478 2,767 Updated Mar 25, 2026

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 242 59 Updated Mar 31, 2026

A Claude Code skill to generate images with Nano Banana

235 29 Updated Feb 19, 2026

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Go 235 22 Updated Mar 31, 2026

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,502 1,743 Updated Mar 31, 2026

AI agents running research on single-GPU nanochat training automatically

Python 62,754 8,780 Updated Mar 26, 2026

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

Python 23,177 2,276 Updated Feb 2, 2026

Lightweight coding agent that runs in your terminal

Rust 70,060 9,630 Updated Apr 1, 2026

An open-source, code-first Go toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Go 7,311 597 Updated Mar 31, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,015 639 Updated Mar 31, 2026

A security-focused library OS supporting kernel- and user-mode execution

Rust 2,544 112 Updated Mar 31, 2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 882 65 Updated Mar 4, 2026

Safe rust wrapper around CUDA toolkit

Rust 1,091 145 Updated Mar 25, 2026

Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity

Go 26,924 3,765 Updated Apr 1, 2026

Distributed KV cache scheduling & offloading libraries

Go 122 105 Updated Mar 31, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,195 304 Updated Jan 14, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,478 447 Updated Mar 31, 2026

happy happy happyclaw~

TypeScript 561 91 Updated Mar 31, 2026

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

43,430 4,147 Updated Mar 26, 2026
Next