nzw0301

Kento Nozawa nzw0301

193 followers · 132 following

Preferred Networks, Inc.
Japan
08:12 (UTC +09:00)
nzw0301.github.io

Achievements

x4 x3 x3 x2

Achievements

x4 x3 x3 x2

Organizations

Lists (4)

Sort

Stars

9 stars written in Cuda

Clear filter

luanfujun / deep-painterly-harmonization

Code and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189

Cuda 6,053 613 Updated Aug 2, 2021

CannyLab / tsne-cuda

GPU Accelerated t-SNE for CUDA with Python bindings

Cuda 1,926 137 Updated Oct 2, 2024

rapidsai / raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 994 228 Updated Apr 10, 2026

Dao-AILab / causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 825 171 Updated Mar 10, 2026

NVIDIA / nv-wavenet

Reference implementation of real-time autoregressive wavenet inference

Cuda 745 125 Updated Jan 19, 2021

efeslab / Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Cuda 337 31 Updated Jul 2, 2024

facebookarchive / fbcuda

Facebook's CUDA extensions.

Cuda 284 57 Updated Mar 27, 2019

AlibabaResearch / flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Cuda 239 22 Updated Sep 24, 2023

LeviViana / torch_sampling

Efficient reservoir sampling implementation for PyTorch

Cuda 106 5 Updated Sep 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kento Nozawa nzw0301

Achievements

Achievements

Organizations

Block or report nzw0301

Lists (4)

datasets

resources

self-sup

tools

Stars

luanfujun / deep-painterly-harmonization

CannyLab / tsne-cuda

rapidsai / raft

Dao-AILab / causal-conv1d

NVIDIA / nv-wavenet

efeslab / Atom

facebookarchive / fbcuda

AlibabaResearch / flash-llm

LeviViana / torch_sampling