Skip to content
View withlin's full-sized avatar
🧸
🧸
  • GuangZhou,China

Block or report withlin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Use Codex from Claude Code to review code or delegate tasks.

JavaScript 2,866 107 Updated Mar 31, 2026
TypeScript 22,903 926 Updated Mar 30, 2026

Harness Engineering 学习指南 — 从概念理解到独立实践的深度学习档案

292 28 Updated Mar 25, 2026

Run Anthropic's Claude Code CLI with OpenAI models such as GPT-5-Codex, GPT-5.1, and others via a local LiteLLM proxy.

Python 220 23 Updated Jan 4, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,291 265 Updated Mar 27, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 7,781 1,048 Updated Mar 31, 2026

LLAMA Turboquant implementation with CUDA support

C++ 279 20 Updated Mar 29, 2026

Memory Sparse Attention - 亿级(100M)token 上下文的端到端可训练记忆框架

2,377 131 Updated Mar 29, 2026

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 7,014 748 Updated Mar 29, 2026

An incremental parsing system for programming tools

Rust 24,410 2,522 Updated Mar 31, 2026

State-of-the-Art Text Embeddings

Python 18,473 2,768 Updated Mar 25, 2026

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 241 59 Updated Mar 31, 2026

A Claude Code skill to generate images with Nano Banana

233 29 Updated Feb 19, 2026

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Go 233 22 Updated Mar 31, 2026

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,498 1,743 Updated Mar 31, 2026

AI agents running research on single-GPU nanochat training automatically

Python 62,002 8,657 Updated Mar 26, 2026

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

Python 23,169 2,275 Updated Feb 2, 2026

Lightweight coding agent that runs in your terminal

Rust 68,503 9,184 Updated Mar 31, 2026

An open-source, code-first Go toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Go 7,298 597 Updated Mar 26, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,010 636 Updated Mar 31, 2026

A security-focused library OS supporting kernel- and user-mode execution

Rust 2,542 112 Updated Mar 31, 2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 881 65 Updated Mar 4, 2026

Safe rust wrapper around CUDA toolkit

Rust 1,091 144 Updated Mar 25, 2026

Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity

Go 26,819 3,755 Updated Mar 31, 2026

Distributed KV cache scheduling & offloading libraries

Go 122 105 Updated Mar 31, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,189 304 Updated Jan 14, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,436 442 Updated Mar 30, 2026

happy happy happyclaw~

TypeScript 553 88 Updated Mar 30, 2026

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

43,258 4,123 Updated Mar 26, 2026
Next