Skip to content
View withlin's full-sized avatar
🧸
🧸
  • GuangZhou,China

Block or report withlin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Use Codex from Claude Code to review code or delegate tasks.

JavaScript 4,402 175 Updated Mar 31, 2026
TypeScript 25,257 1,081 Updated Mar 31, 2026

Harness Engineering 学习指南 — 从概念理解到独立实践的深度学习档案

311 31 Updated Mar 25, 2026

Run Anthropic's Claude Code CLI with OpenAI models such as GPT-5-Codex, GPT-5.1, and others via a local LiteLLM proxy.

Python 220 23 Updated Jan 4, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,303 266 Updated Mar 27, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 7,793 1,050 Updated Mar 31, 2026

LLAMA Turboquant implementation with CUDA support

C++ 290 21 Updated Mar 29, 2026

Memory Sparse Attention - 亿级(100M)token 上下文的端到端可训练记忆框架

2,383 132 Updated Mar 29, 2026

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 7,014 748 Updated Mar 29, 2026

An incremental parsing system for programming tools

Rust 24,416 2,523 Updated Mar 31, 2026

State-of-the-Art Text Embeddings

Python 18,475 2,768 Updated Mar 25, 2026

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 241 59 Updated Mar 31, 2026

A Claude Code skill to generate images with Nano Banana

233 29 Updated Feb 19, 2026

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Go 233 22 Updated Mar 31, 2026

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,501 1,743 Updated Mar 31, 2026

AI agents running research on single-GPU nanochat training automatically

Python 62,348 8,715 Updated Mar 26, 2026

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

Python 23,176 2,276 Updated Feb 2, 2026

Lightweight coding agent that runs in your terminal

Rust 68,705 9,207 Updated Mar 31, 2026

An open-source, code-first Go toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Go 7,302 597 Updated Mar 31, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,011 638 Updated Mar 31, 2026

A security-focused library OS supporting kernel- and user-mode execution

Rust 2,543 112 Updated Mar 31, 2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 882 65 Updated Mar 4, 2026

Safe rust wrapper around CUDA toolkit

Rust 1,091 145 Updated Mar 25, 2026

Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity

Go 26,874 3,759 Updated Mar 31, 2026

Distributed KV cache scheduling & offloading libraries

Go 122 105 Updated Mar 31, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,194 304 Updated Jan 14, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,455 445 Updated Mar 30, 2026

happy happy happyclaw~

TypeScript 558 91 Updated Mar 30, 2026

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

43,353 4,133 Updated Mar 26, 2026
Next