sunshinemyson

Sven sunshinemyson

I'm no one.

31 followers · 38 following

verisilicon.com
CD

Achievements

Stars

microsoft / SeerAttention

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Python 168 14 Updated Sep 23, 2025

MPSLab-ASU / ML-Accelerators

Topics in Machine Learning Accelerator Design

90 22 Updated Feb 16, 2023

VeriSilicon / triton-shared

Forked from microsoft/triton-shared

Shared Middle-Layer for Triton Compilation

MLIR 1 Updated Jan 7, 2025

FlagOpen / FlagGems

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 747 148 Updated Nov 6, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,654 595 Updated Nov 6, 2025

lselector / seminar

Jupyter Notebook 126 39 Updated Oct 31, 2025

CMU-SAFARI / prim-benchmarks

PrIM (Processing-In-Memory benchmarks) is the first benchmark suite for a real-world processing-in-memory (PIM) architecture. PrIM is developed to evaluate, analyze, and characterize the first publ…

C 162 58 Updated Apr 29, 2024

CMU-SAFARI / ramulator2

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 423 111 Updated Oct 20, 2025

hahnyuan / LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 571 70 Updated Sep 11, 2024

AmadeusChan / Awesome-LLM-System-Papers

610 30 Updated May 10, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,083 3,265 Updated Jun 26, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,662 319 Updated Aug 19, 2025

VeriSilicon / vsi-pjrt-plugin

The pjrt-plugin implementation for VeriSIlicon NPU IP for Tensorflow/PyTorch/Other ecosystem.

C++ 7 1 Updated Apr 28, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,259 11,065 Updated Nov 6, 2025

VeriSilicon / nn-sl

Empower VeriSilicon's NPU on Android Platform by NNAPI

C++ 5 5 Updated Jan 21, 2025

huangrt01 / CS-Notes

我的自学笔记，终身更新

Python 3,919 483 Updated Nov 5, 2025

zjd1988 / TIM-VX-python

Forked from VeriSilicon/TIM-VX

Verisilicon Tensor Interface Module

C 5 Updated Oct 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sven sunshinemyson

Achievements

Achievements

Block or report sunshinemyson

Stars

microsoft / SeerAttention

MPSLab-ASU / ML-Accelerators

VeriSilicon / triton-shared

FlagOpen / FlagGems

pytorch / torchtitan

lselector / seminar

CMU-SAFARI / prim-benchmarks

CMU-SAFARI / ramulator2

hahnyuan / LLM-Viewer

AmadeusChan / Awesome-LLM-System-Papers

karpathy / llm.c

xlite-dev / Awesome-LLM-Inference

VeriSilicon / vsi-pjrt-plugin

vllm-project / vllm

VeriSilicon / nn-sl

huangrt01 / CS-Notes

zjd1988 / TIM-VX-python

microsoft / triton-shared

google / spirv-tutor

huggingface / transformers

facebookresearch / ConvNeXt

jidicula / clang-format-action

VeriSilicon / TIM-VX

llvm / torch-mlir

VeriSilicon / acuitylite

NVIDIA / cutlass

Arm-Examples / ML-zoo

microsoft / SandDance

QuantConnect / Lean

2dust / v2rayN