-
Tongji University
- Shanghai
- https://blog.csdn.net/xiaoxiaowenqiang
- https://gitee.com/EwenWan
Stars
- All languages
- ANTLR
- ActionScript
- Assembly
- Batchfile
- C
- C#
- C++
- CMake
- CSS
- Cuda
- Cython
- DIGITAL Command Language
- Dart
- Dockerfile
- Fortran
- GCC Machine Description
- Go
- Groovy
- HTML
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Limbo
- Lua
- MATLAB
- MLIR
- Makefile
- Markdown
- Mathematica
- OCaml
- Objective-C
- PHP
- PLSQL
- Perl
- Python
- RPC
- Racket
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Scilab
- Shell
- Smali
- Swift
- SystemVerilog
- TeX
- TypeScript
- Verilog
- Vim Script
- Vue
- Yacc
High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.
An unofficial cuda assembler, for all generations of SASS, hopefully :)
Applied AI experiments and examples for PyTorch
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Production First and Production Ready End-to-End Speech Recognition Toolkit
QLoRA: Efficient Finetuning of Quantized LLMs
Docker Image for Ubuntu Desktop which support HW GPU accelerated GUI apps. you can access the Container with ssh or remote desktop, just like Cloud VM.
Development repository for the Triton language and compiler
Ewenwan / triton
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
This is a cross-chip platform collection of operators and a unified neural network library.
清华大学材料学院本科学习资料 - PPT、图书、作业、实验报告等
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )
Several simple examples for popular neural network toolkits calling custom CUDA operators.
Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
学习安全运营的记录 | The knowledge base of security operation