dfyz

Ivan Komarov dfyz

60 followers · 4 following

Belgrade
13:18 (UTC +02:00)
http://dfyz.info/
@i_komarov

Achievements

x2 x2

Achievements

x2 x2

Stars

microsoft / MMA-Sim

Python 2 2 Updated Jan 29, 2026

redplait / denvdis

NVidia sass disassembler/inline patcher

C++ 66 13 Updated Apr 14, 2026

Aleph-Alpha / Alpha-MoE

Cuda 55 11 Updated Dec 10, 2025

freedomofpress / santa-pwn-dangerzone

Pwning Santa before the bad guys do 🎅

Python 3 1 Updated Dec 10, 2025

facebookresearch / CUTracer

A dynamic binary instrumentation tool for tracing and analyzing CUDA kernel instructions.

Python 60 6 Updated Apr 10, 2026

Noumena-Network / nmoe

MoE training for Me and You and maybe other people

Python 381 32 Updated Mar 15, 2026

north-numerical-computing / MATLAB-tensor-core

The MATLAB Tensor Core: a set of models of tensor cores written in MATLAB

MATLAB 17 2 Updated Apr 7, 2026

purplesyringa / mod2k

Fast arithmetic modulo `2^k`, `2^k - 1`, and `2^k - d`.

Rust 19 2 Updated Dec 22, 2025

abcdabcd987 / libfabric-efa-demo

C++ 81 11 Updated Jan 5, 2025

0xD0GF00D / DocumentSASS

Unofficial description of the CUDA assembly (SASS) instruction sets.

Python 209 20 Updated Jul 18, 2025

dicksites / KUtrace

Low-overhead tracing of all Linux kernel-user transitions, for serious performance analysis. Includes kernel patches, loadable module, and post-processing software. Output is HTML/SVG per-CPU-core …

HTML 686 70 Updated Sep 1, 2024

dckc / awesome-ocap

Awesome Object Capabilities and Capability Security

JavaScript 396 27 Updated Apr 1, 2026

mikaku / Fiwix

A UNIX-like kernel for the i386 architecture

C 641 45 Updated Apr 8, 2026

yosefk / funtrace

A fast, small C/C++ function call tracer for x86-64/Linux, supports clang & gcc, ftrace, threads, exceptions & shared libraries

C++ 196 3 Updated Mar 25, 2025

sarah-quinones / small-gemm

Rust 4 Updated Jan 25, 2024

busytex / busytex

TexLive programs bundled into a single static binary for x86_64-linux / WASM

Makefile 66 8 Updated Mar 24, 2025

kevin-lesenechal / elf-info

Inspect and dissect an ELF file with pretty formatting.

Rust 119 10 Updated Feb 25, 2024

mephi42 / ctf

My solutions for CTF challenges

C 74 14 Updated Dec 16, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 103,600 16,822 Updated Apr 14, 2026

google / XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web

C 2,307 481 Updated Apr 14, 2026

google / ruy

C++ 323 93 Updated Feb 17, 2026

pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,552 731 Updated Apr 14, 2026

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 6,411 935 Updated Mar 27, 2024

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,572 1,785 Updated Apr 9, 2026

onnx / onnx

Open standard for machine learning interoperability

Python 20,653 3,916 Updated Apr 14, 2026

microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 19,864 3,825 Updated Apr 14, 2026

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 99,114 27,482 Updated Apr 14, 2026

flame / blis

BLAS-like Library Instantiation Software Framework

C 2,626 417 Updated Nov 11, 2025

uxlfoundation / oneDNN

oneAPI Deep Neural Network Library (oneDNN)

C++ 3,979 1,119 Updated Apr 14, 2026

OpenMathLib / OpenBLAS

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 7,373 1,661 Updated Apr 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ivan Komarov dfyz

Achievements

Achievements

Block or report dfyz

Stars

microsoft / MMA-Sim

redplait / denvdis

Aleph-Alpha / Alpha-MoE

freedomofpress / santa-pwn-dangerzone

facebookresearch / CUTracer

Noumena-Network / nmoe

north-numerical-computing / MATLAB-tensor-core

purplesyringa / mod2k

abcdabcd987 / libfabric-efa-demo

0xD0GF00D / DocumentSASS

dicksites / KUtrace

dckc / awesome-ocap

mikaku / Fiwix

yosefk / funtrace

sarah-quinones / small-gemm

busytex / busytex

kevin-lesenechal / elf-info

mephi42 / ctf

ggml-org / llama.cpp

google / XNNPACK

google / ruy

pytorch / FBGEMM

NVIDIA / FasterTransformer

NVIDIA / cutlass

onnx / onnx

microsoft / onnxruntime

pytorch / pytorch

flame / blis

uxlfoundation / oneDNN

OpenMathLib / OpenBLAS