Starred repositories
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)
Lecture notes, projects and other materials for Course 'CS205 C/C++ Program Design' at Southern University of Science and Technology.
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
A library of GPU kernels for sparse matrix operations.
GPU-scheduler-for-deep-learning
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
Code base and slides for ECE408:Applied Parallel Programming On GPU.
A GPU accelerated error-bounded lossy compression for scientific data.
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS