- Ho Chi Minh, Viet Nam
- https://viblo.asia/u/Giahuy
- https://medium.com/@giahuy04
- in/cismine
Stars
CUDA Python: Performance meets Productivity
NVIDIA Linux open GPU kernel module source
NVIDIA curated collection of educational resources related to general purpose GPU programming.
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
CUDA Templates and Python DSLs for High-Performance Linear Algebra
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
how to optimize some algorithm in cuda.
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
GPU programming related news and material links
Examples from Programming in Parallel with CUDA
My study notes and hands-on projects for CUDA-based GPU programming
collection of benchmarks to measure basic GPU capabilities
A collection of code snippets from the publication Daily Dose of Data Science on Substack: http://www.dailydoseofds.com/
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU par…