- Santa Clara
-
15:56
(UTC -07:00) - kunwu.me
- https://orcid.org/0000-0002-0149-1409
- in/kun-wu-069a14105
- https://go.kunwu.me/wakatime
Highlights
-
-
taxes-2018 Public
Forked from pyTaxPrep/taxes-2018Fills out forms for 2018 tax returns.
Python GNU Lesser General Public License v3.0 UpdatedMar 24, 2026 -
Qidian_Webnovel_DataCollection Public
Forked from GOLEM-lab/Qidian_Webnovel_DataCollectionJupyter Notebook Other UpdatedJan 22, 2026 -
jjwxc-crawler Public
Forked from dev-chenxing/jjwxc-crawlerA simple tool to scrape and download non-V chapters of any novel from jjwxc.net in .docx format, built with Python and Scrapy | 基于Scrapy开发的晋江爬虫,根据书号下载小说非V章节,生成可编辑的Word文档
Python UpdatedJan 21, 2026 -
intrasm_engine Public
Enhancing CUDA Intra-Streaming-Multiprocessor Parallelism for Large Language Models via Fine-Grained Task Graph
Jupyter Notebook Other UpdatedJul 6, 2025 -
K-Wu.github.io Public
Forked from alshedivat/al-folioA beautiful, simple, clean, and responsive Jekyll theme for academics
-
FlashTrain Public
An Activation Offloading Framework to SSDs for Faster Large Language Model Training
-
douban_movie_review Public
Forked from 3inchtime/douban_movie_review豆瓣Top250影评爬虫(用于情感分析语料)
Python UpdatedMar 2, 2025 -
-
llm-analysis Public
Forked from cli99/llm-analysisLatency and Memory Analysis of Transformer Models for Training and Inference
Python Apache License 2.0 UpdatedNov 12, 2024 -
IGB-Datasets Public
Forked from IllinoisGraphBenchmark/IGB-DatasetsLargest realworld open-source graph dataset - Worked done under IBM-Illinois Discovery Accelerator Institute and Amazon Research Awards and in collaboration with NVIDIA Research.
Python Other UpdatedOct 31, 2024 -
HET Public
HET: The HET Hetero-GNN Kernel Optimization and Code Generation Project
-
-
Megatron-DeepSpeed Public
Forked from deepspeedai/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
-
QidianCrawler Public
本程序是一个基于DrissionPage库的小说爬虫,用于爬取起点中文网的小说内容,它使用Rich库来提供丰富的输出信息。
-
-
-
-
private clone of https://github.com/twjiang/graphSAGE-pytorch
Python UpdatedJul 25, 2024 -
-
PiPPy Public archive
Forked from pytorch/PiPPyPipeline Parallelism for PyTorch
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 29, 2024 -
mlir-standalone-template Public template
Forked from jmgorius/mlir-standalone-templateAn out-of-tree MLIR dialect template w/ CI flow to keep up-to-date.
CMake Other UpdatedApr 12, 2024 -
CUDALibrarySamples Public
Forked from NVIDIA/CUDALibrarySamplesCUDA Library Samples
-
Timing code backed up at the 1.1.2.timing branch
Python Apache License 2.0 UpdatedFeb 25, 2024 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 10, 2024 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedJan 19, 2024 -
triton_autotuning Public
Clone of https://github.com/tensorflow/tensorflow/tree/master/tensorflow/compiler/xla/experiments/triton_autotuning
Python UpdatedJan 18, 2024 -
-
graphiler_experimental Public
private clone of https://github.com/xiezhq-hermann/graphiler
Cuda Apache License 2.0 UpdatedJan 12, 2024 -
sputnik_experimental Public
The custom sputnik repo whose original is https://github.com/google-research/sputnik/
C++ Apache License 2.0 UpdatedJan 7, 2024