crcrpar

Masaki crcrpar

209 followers · 243 following

NVIDIA
Tokyo
01:11 (UTC +09:00)

Achievements

x4 x2 x3

Achievements

x4 x2 x3

Stars

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,230 189 Updated Dec 23, 2025

apache / tvm-ffi

Open ABI and FFI for Machine Learning Systems

C++ 258 43 Updated Dec 23, 2025

yifuwang / symm-mem-recipes

Python 152 14 Updated Dec 27, 2024

Aleph-Alpha / Alpha-MoE

Cuda 43 10 Updated Dec 10, 2025

pocc / pre-commit-hooks

C/C++ hooks to integrate with pre-commit

Python 376 82 Updated Mar 20, 2024

NVlabs / parrot

Parrot is a C++ library for fused array operations using CUDA/Thrust. It provides efficient GPU-accelerated operations with lazy evaluation semantics, allowing for chaining of operations without un…

Cuda 240 14 Updated Dec 18, 2025

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,841 321 Updated Dec 21, 2025

fanshiqing / grouped_gemm

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 175 46 Updated Dec 16, 2025

xdslproject / xdsl

A Python compiler design toolkit.

Python 459 133 Updated Dec 17, 2025

meta-pytorch / kraken

Triton-based Symmetric Memory operators and examples

Python 67 11 Updated Oct 17, 2025

meta-pytorch / autoparallel

An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.

Python 48 13 Updated Dec 23, 2025

NVIDIA / nvshmem

NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…

C++ 424 48 Updated Dec 20, 2025

perplexityai / pplx-kernels

Perplexity GPU Kernels

C++ 542 74 Updated Nov 7, 2025

meta-pytorch / BackendBench

Ship correct and fast LLM kernels to PyTorch

Python 127 15 Updated Dec 18, 2025

openxla / tokamax

Tokamax: A GPU and TPU kernel library.

Python 142 6 Updated Dec 23, 2025

pypa / hatch

Modern, extensible Python project management

Python 7,042 358 Updated Dec 17, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,927 149 Updated Aug 26, 2025

flashinfer-ai / cutlass-viz

Python 65 3 Updated Apr 26, 2025

meta-pytorch / compile-graph-break-site

This repository contains the source code for a static website that provides documentation for each "Graph Break" identified by a Graph Break ID (GBID).

Python 4 3 Updated Dec 22, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,288 114 Updated Dec 16, 2025

databricks / megablocks

Python 1,512 219 Updated Jun 26, 2025

envoyproxy / ai-gateway

Manages Unified Access to Generative AI Services built on Envoy Gateway

Go 1,280 141 Updated Dec 23, 2025

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 714 64 Updated Dec 23, 2025

facebookincubator / dynolog

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also …

C++ 356 76 Updated Dec 15, 2025

dandavison / delta

A syntax-highlighting pager for git, diff, grep, and blame output

Rust 28,453 464 Updated Dec 11, 2025

meta-pytorch / tritonparse

TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels

Python 178 15 Updated Dec 23, 2025

meta-pytorch / tlparse

TORCH_LOGS parser for PT2

Rust 70 22 Updated Nov 10, 2025

facebook / pyrefly

A fast type checker and language server for Python

Rust 5,094 231 Updated Dec 23, 2025

pytorch / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 694 89 Updated Dec 23, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 21,777 1,893 Updated Dec 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Masaki crcrpar

Achievements

Achievements

Block or report crcrpar

Stars

sgl-project / mini-sglang

apache / tvm-ffi

yifuwang / symm-mem-recipes

Aleph-Alpha / Alpha-MoE

pocc / pre-commit-hooks

NVlabs / parrot

NVlabs / Sana

fanshiqing / grouped_gemm

xdslproject / xdsl

meta-pytorch / kraken

meta-pytorch / autoparallel

NVIDIA / nvshmem

perplexityai / pplx-kernels

meta-pytorch / BackendBench

openxla / tokamax

pypa / hatch

huggingface / picotron

flashinfer-ai / cutlass-viz

meta-pytorch / compile-graph-break-site

ByteDance-Seed / Triton-distributed

databricks / megablocks

envoyproxy / ai-gateway

Dao-AILab / quack

facebookincubator / dynolog

dandavison / delta

meta-pytorch / tritonparse

meta-pytorch / tlparse

facebook / pyrefly

pytorch / helion

mlc-ai / mlc-llm