Skip to content
View Ewenwan's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Ewenwan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5 Updated Feb 7, 2025

Tutorials to GPU programming. Reading notes.

18 Updated Apr 27, 2023

High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.

Cuda 120 7 Updated Jul 13, 2024

An unofficial cuda assembler, for all generations of SASS, hopefully :)

Python 553 95 Updated Apr 20, 2023

Applied AI experiments and examples for PyTorch

Python 302 29 Updated Aug 22, 2025

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,257 540 Updated Jul 27, 2024

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,897 1,165 Updated Nov 3, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,738 865 Updated Jun 10, 2024

Docker Image for Ubuntu Desktop which support HW GPU accelerated GUI apps. you can access the Container with ssh or remote desktop, just like Cloud VM.

Dockerfile 466 112 Updated Sep 20, 2025

Development repository for the Triton language and compiler

MLIR 17,517 2,375 Updated Nov 10, 2025

Development repository for the Triton language and compiler

C++ 1 Updated Oct 9, 2024
Python 1,850 100 Updated Nov 20, 2024

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 25,825 2,596 Updated Nov 10, 2025

This is a cross-chip platform collection of operators and a unified neural network library.

Python 18 1 Updated Nov 3, 2023

What are learned in tiktoken?

Python 73 4 Updated May 14, 2024

清华大学材料学院本科学习资料 - PPT、图书、作业、实验报告等

Jupyter Notebook 34 4 Updated Sep 15, 2021

AI学习之路

Python 2 Updated Jun 28, 2023

AI学习之路

Python 132 32 Updated Jun 28, 2023

Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch

Python 451 32 Updated Nov 10, 2025
Python 1 1 Updated Dec 16, 2023

参加华为AscendC算子编程比赛的代码

5 2 Updated Apr 23, 2024
C++ 8 1 Updated Jul 26, 2022

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

Cuda 224 35 Updated Dec 14, 2024

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,521 201 Updated Apr 29, 2021

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 388 52 Updated Jan 2, 2025

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 1,185 172 Updated Jul 29, 2023
Jupyter Notebook 16 5 Updated Jan 6, 2024

学习安全运营的记录 | The knowledge base of security operation

HTML 858 176 Updated Aug 27, 2023

C++开发的视频行为分析系统v4版本

C++ 207 49 Updated Nov 10, 2025

ESJZone 的小說備份

Python 496 240 Updated May 26, 2023
Next