Skip to content
View tangxin-hn's full-sized avatar

Block or report tangxin-hn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

TypeScript 15,043 1,538 Updated Feb 2, 2026

Tutorials for NVIDIA CUPTI samples

C++ 61 13 Updated Nov 3, 2025

A library to analyze PyTorch traces.

Python 499 85 Updated Apr 1, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,993 15,411 Updated Apr 10, 2026

CUPTI based GPU profiling library exposing usdt hooks

C 28 1 Updated Apr 9, 2026

整理和收集来自不同项目的Cursor规则文件,提供多种编程语言和框架的规则支持。

1,771 351 Updated Jan 25, 2026

A fast compressor/decompressor

C++ 6,556 1,032 Updated Mar 6, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,418 13,853 Updated Apr 6, 2026

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,699 1,450 Updated Jan 4, 2026

CUDA/Metal accelerated language model inference

C 632 31 Updated May 29, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,610 5,269 Updated Apr 10, 2026

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 942 246 Updated Apr 10, 2026

eBPF based always-on CPU/GPU profiler auto-discovering targets in Kubernetes and systemd, zero code changes or restarts needed!

Go 717 88 Updated Apr 10, 2026

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

JavaScript 13,359 433 Updated Apr 6, 2026

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also …

C++ 370 82 Updated Apr 9, 2026

eBPF Observability - Distributed Tracing and Profiling

Go 4,002 443 Updated Apr 10, 2026

Distributed tracing without code changes. 🚀 Instantly monitor any application using OpenTelemetry and eBPF

Go 3,649 246 Updated Apr 10, 2026

High-level tracing language for Linux

C++ 10,049 1,450 Updated Apr 6, 2026

🔥 horizontally-scalable, highly-available, multi-tenant continuous profiling aggregation system

Go 2,032 70 Updated Jul 19, 2023

Continuous Profiling Platform. Debug performance issues down to a single line of code

Go 11,343 737 Updated Apr 9, 2026

Continuous profiling for analysis of CPU and memory usage, down to the line number and throughout time. Saving infrastructure cost, improving performance, and increasing reliability.

TypeScript 4,838 249 Updated Apr 10, 2026

ebpf-go is a pure-Go library to read, modify and load eBPF programs and attach them to various hooks in the Linux kernel.

Go 7,650 848 Updated Apr 2, 2026

The production-scale datacenter profiler (C/C++, Go, Rust, Python, Java, NodeJS, .NET, PHP, Ruby, Perl, ...)

Go 3,080 392 Updated Apr 10, 2026

Hooked CUDA-related dynamic libraries by using automated code generation tools.

C 172 47 Updated Dec 12, 2023

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 283 49 Updated Apr 10, 2026

Sampling profiler for Python programs

Rust 15,092 507 Updated Apr 9, 2026

Trace your python process line by line with eBPF!

Python 261 5 Updated Feb 19, 2023

The best way to write secure and reliable applications. Write nothing; deploy nowhere.

Dockerfile 65,159 4,829 Updated Aug 7, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 98,995 27,456 Updated Apr 10, 2026

DLRover: An Automatic Distributed Deep Learning System

Python 1,641 211 Updated Apr 2, 2026
Next