Ldpe2G

Follow

I may be slow to respond.

Liang Depeng Ldpe2G

I may be slow to respond.

Follow

384 followers · 491 following

Sun Yat-sen University
Guang Zhou, China
https://www.zhihu.com/people/liang-de-peng/posts

Achievements

Achievements

Stars

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,659 85 Updated Dec 20, 2025

tile-ai / TileRT

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 462 20 Updated Dec 23, 2025

deepreinforce-ai / CUDA-L2

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Cuda 249 19 Updated Dec 15, 2025

hyx1999 / SAM-Decoding

Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton

Python 39 1 Updated Feb 13, 2025

jcrist / msgspec

A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML

Python 3,429 124 Updated Nov 27, 2025

ijl / orjson

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

Python 7,708 279 Updated Dec 18, 2025

OpenHands / OpenHands

🙌 OpenHands: AI-Driven Development

Python 65,869 8,105 Updated Dec 23, 2025

SWE-bench / SWE-bench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,003 719 Updated Dec 18, 2025

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 345 45 Updated Apr 22, 2025

snowflakedb / ArcticInference

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 354 40 Updated Dec 16, 2025

gaogaotiantian / viztracer

A debugging and profiling tool that can trace and visualize python code execution

Python 7,460 467 Updated Dec 21, 2025

dsl-learn / triton-tutorial

Getting Started with Triton: A Tutorial for Python Beginners

HTML 27 2 Updated Oct 21, 2025

toon-format / toon

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

TypeScript 21,066 928 Updated Dec 15, 2025

QwenLM / Qwen3Guard

Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.

Python 388 26 Updated Oct 21, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,731 2,877 Updated Dec 23, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,958 358 Updated Dec 23, 2025

meta-pytorch / monarch

PyTorch Single Controller

Rust 932 120 Updated Dec 23, 2025

MoonshotAI / kimi-cli

Kimi CLI is your next CLI agent.

Python 3,669 361 Updated Dec 23, 2025

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 726 73 Updated Nov 30, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,554 1,926 Updated Oct 25, 2025

shinezyy / deepseek_model

Python 38 4 Updated Oct 12, 2025

tile-ai / tilelang-ascend

Ascend TileLang adapter

C++ 167 47 Updated Dec 23, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,293 355 Updated Dec 23, 2025

MoonshotAI / K2-Vendor-Verifier

Verify Precision of all Kimi K2 API Vendor

Python 488 26 Updated Nov 19, 2025

jd-opensource / xllm

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 833 103 Updated Dec 23, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,157 193 Updated Oct 9, 2025

guidance-ai / llguidance

Super-fast Structured Outputs

Rust 643 43 Updated Dec 1, 2025

jannismoeller / pytorch-nanobind-cuda-example

Forked from lgarrison/cupy-nanobind-example

A tiny demo of interfacing CUDA via nanobind with a pytorch tensor

Cuda 7 Updated Dec 24, 2024

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,593 2,285 Updated Oct 17, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,713 1,357 Updated Dec 17, 2025