Skip to content
View guanfuchen's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report guanfuchen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
12 stars written in Cuda
Clear filter

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Cuda 4,446 930 Updated Aug 30, 2024

Squeeze-and-Excitation Networks

Cuda 3,580 851 Updated Feb 25, 2019

A GPU implementation of Convolutional Neural Nets in C++

Cuda 504 229 Updated Oct 1, 2020

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

Cuda 492 53 Updated Jun 16, 2021

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 442 47 Updated May 14, 2025

Unsupervised Learning of Video Representations using LSTMs

Cuda 362 112 Updated Mar 6, 2018

GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications.

Cuda 357 31 Updated Oct 27, 2025

CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups

Cuda 229 68 Updated Feb 27, 2025

Code for Dynamic Convolutions: Exploiting Spatial Sparsity for Faster Inference (CVPR2020)

Cuda 128 14 Updated Jan 17, 2022

CUDA implementation of data clustering using expectation maximization with a Gaussian mixture model. Supports multiple GPUs on a single node.

Cuda 1 Updated Mar 11, 2012