Skip to content
View withlin's full-sized avatar
🧸
🧸
  • GuangZhou,China

Block or report withlin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Use Codex from Claude Code to review code or delegate tasks.

JavaScript 6,095 300 Updated Mar 31, 2026
TypeScript 28,035 1,296 Updated Mar 31, 2026

Harness Engineering 学习指南 — 从概念理解到独立实践的深度学习档案

322 32 Updated Mar 31, 2026

Run Anthropic's Claude Code CLI with OpenAI models such as GPT-5-Codex, GPT-5.1, and others via a local LiteLLM proxy.

Python 220 23 Updated Jan 4, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,313 268 Updated Mar 27, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 7,819 1,052 Updated Apr 1, 2026

LLAMA Turboquant implementation with CUDA support

C++ 299 21 Updated Mar 29, 2026

Memory Sparse Attention - 亿级(100M)token 上下文的端到端可训练记忆框架

2,401 133 Updated Apr 1, 2026

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 7,018 748 Updated Mar 29, 2026

An incremental parsing system for programming tools

Rust 24,420 2,525 Updated Mar 31, 2026

State-of-the-Art Text Embeddings

Python 18,479 2,767 Updated Mar 25, 2026

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 242 59 Updated Mar 31, 2026

A Claude Code skill to generate images with Nano Banana

235 29 Updated Feb 19, 2026

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Go 235 22 Updated Mar 31, 2026

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,502 1,742 Updated Mar 31, 2026

AI agents running research on single-GPU nanochat training automatically

Python 62,853 8,801 Updated Mar 26, 2026

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

Python 23,177 2,278 Updated Feb 2, 2026

Lightweight coding agent that runs in your terminal

Rust 70,463 9,750 Updated Apr 1, 2026

An open-source, code-first Go toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Go 7,315 597 Updated Mar 31, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,017 641 Updated Apr 1, 2026

A security-focused library OS supporting kernel- and user-mode execution

Rust 2,544 112 Updated Apr 1, 2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 884 65 Updated Mar 4, 2026

Safe rust wrapper around CUDA toolkit

Rust 1,091 145 Updated Mar 25, 2026

Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity

Go 26,934 3,768 Updated Apr 1, 2026

Distributed KV cache scheduling & offloading libraries

Go 122 106 Updated Mar 31, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,195 305 Updated Jan 14, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,484 448 Updated Mar 31, 2026

happy happy happyclaw~

TypeScript 561 91 Updated Apr 1, 2026

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

43,467 4,151 Updated Mar 26, 2026
Next