- Belgrade
-
15:20
(UTC +01:00) - http://dfyz.info/
- @i_komarov
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ BSD 3-Clause "New" or "Revised" License UpdatedDec 12, 2025 -
ctf-writeups Public
CTF writeups from GatorSheavesMutably (https://ctftime.org/team/109518) and More Smoked Leet Chicken (https://ctftime.org/team/1005)
-
-
nix-binary-ninja Public
Forked from jchv/nix-binary-ninjaUnofficial Nix flake for using Binary Ninja on NixOS.
Nix The Unlicense UpdatedNov 30, 2025 -
-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedAug 27, 2025 -
verl Public
Forked from yaof20/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedAug 17, 2025 -
ThunderKittens Public
Forked from HazyResearch/ThunderKittensTile primitives for speedy kernels
Cuda MIT License UpdatedAug 6, 2025 -
stal-ix.github.io Public
Forked from stal-ix/stal-ix.github.iolanding page
MIT License UpdatedJun 18, 2025 -
osm-renderer Public
OpenStreetMap raster tile renderer written in Rust
-
cutlass_grouped_gemm Public
Forked from imoneoi/cutlass_grouped_gemmPyTorch bindings for CUTLASS grouped GEMM.
Cuda Apache License 2.0 UpdatedOct 18, 2024 -
gpu-burn Public
Forked from wilicc/gpu-burnMulti-GPU CUDA stress test
C++ BSD 2-Clause "Simplified" License UpdatedOct 9, 2024 -
nccl Public
Forked from NVIDIA/ncclOptimized primitives for collective multi-GPU communication
C++ Other UpdatedJul 9, 2024 -
-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedApr 8, 2024 -
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 5, 2024 -
llvm-project Public
Forked from llvm/llvm-projectLLVM fork with restored Alpha backend
3 UpdatedMar 11, 2024 -
Solving mysteries from Dick Sites' Understanding Software Dynamics (https://www.informit.com/title/9780137589739)
-
cosmopolitan Public
Forked from jart/cosmopolitanbuild-once run-anywhere c library
C ISC License UpdatedMar 3, 2024 -
llama-avx-512 Public
Expermenting with quantized AVX-512 dot product for llama.cpp
-
adventofcode Public
My solutions for the Advent of Code challenge (http://adventofcode.com/). Includes a solution to the Synacor Challenge (https://challenge.synacor.com/) as a bonus.
-
-
llama.cpp Public
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
C MIT License UpdatedMay 9, 2023 -
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++ MIT License UpdatedFeb 22, 2023 -
pwntools Public
Forked from Gallopsled/pwntoolsCTF framework and exploit development library
Python Other UpdatedFeb 6, 2023 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
-
cublaslt-bias-epilogue Public
A reproducer to show that cuBLASLt appears to apply bias epilogue incorrectly when multiplying a float32 vector by a float32 matrix and the output matrix has ld > rows
-
ctf-writeups-1 Public
Forked from cscosu/ctf-writeupsWrite-ups for the Buckeye Bureau of BOF
Python UpdatedDec 20, 2021 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python MIT License UpdatedJul 28, 2021