Stars
Agentless🐱: an agentless approach to automatically solve software development problems
TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels
Efficient Triton Kernels for LLM Training
Development repository for the Triton language and compiler
Github mirror of trition-lang/triton repo.
blt / cernan
Forked from postmates/cernantelemetry aggregation and shipping, last up the ladder
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
A C++ standalone library for machine learning
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
A tool to monitor network traffic over Zoom and Workplace Chat
functorch is JAX-like composable function transforms for PyTorch.
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
synchronous and asynchronous event based c++ executor libray
A chrome extension for suspending all tabs to free up memory