Roger-Fpeng

Follow

Fpeng Roger-Fpeng

Follow

Stay Hungry Stay Foolish.

5 followers · 18 following

Hangzhou City, China

Achievements

Achievements

Highlights

Pro

Popular repositories Loading

ampere_flash_attention_from_scratch ampere_flash_attention_from_scratch Public

This is an implementation of flash attention from scratch, without importing any external libraries.

Cuda 22 2
Computer-networking-TCP Computer-networking-TCP Public

Implementation of css144 lab.

C++
mahimahi-ge mahimahi-ge Public

Add ge channel loss mode for mahimahi.

Shell
xquic xquic Public

Forked from alibaba/xquic

XQUIC Library released by Alibaba is a cross-platform implementation of QUIC and HTTP/3 protocol.

C
flash-attention flash-attention Public

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python
ktransformers ktransformers Public

Forked from kvcache-ai/ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python