scv119

⌨️

to become a better human being

Chen Shen scv119

⌨️

to become a better human being

Building Ray@Anyscale, formerly@Facebook, co-creator of Delos

142 followers · 84 following

Anyscale
United States

Achievements

x3 x3

Achievements

x3 x3

DeepEP Public
Forked from deepseek-ai/DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda MIT License Updated Apr 29, 2025
mini-redis Public
Forked from tokio-rs/mini-redis

Incomplete Redis client and server implementation using Tokio - for learning purposes only

Rust MIT License Updated Apr 29, 2024
learn-rust Public

Rust Updated Apr 25, 2024
scv119 Public

Updated Apr 6, 2024
openmlsys-zh Public
Forked from openmlsys/openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX Updated Mar 6, 2024
r4cppp Public
Forked from nrc/r4cppp

Rust for C++ programmers

Rust Other Updated Feb 21, 2024
flashinfer Public
Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda Apache License 2.0 Updated Feb 8, 2024
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Jan 17, 2024
lightllm Public
Forked from ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python Apache License 2.0 Updated Dec 27, 2023
how-to-optim-algorithm-in-cuda Public
Forked from BBuf/how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda Updated Dec 24, 2023
cutlass-kernels Public
Forked from ColfaxResearch/cutlass-kernels

Cuda MIT License Updated Dec 20, 2023
ray Public
Forked from ray-project/ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyp…

Python 2 Apache License 2.0 Updated Dec 14, 2023
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ Apache License 2.0 Updated Dec 11, 2023
megablocks Public
Forked from databricks/megablocks

Python Apache License 2.0 Updated Dec 11, 2023
flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 1 BSD 3-Clause "New" or "Revised" License Updated Dec 5, 2023
The-Art-of-Linear-Algebra Public
Forked from kenjihiranabe/The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript Creative Commons Zero v1.0 Universal Updated Nov 30, 2023
ScaleLLM Public
Forked from vectorch-ai/ScaleLLM

A high-performance inference system for large language models, designed for production environments.

C++ Apache License 2.0 Updated Nov 23, 2023
grouped_gemm Public
Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda Apache License 2.0 Updated Nov 17, 2023
awesome-tensor-compilers Public
Forked from merrymercy/awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

1 Updated Oct 19, 2023
lmdeploy Public
Forked from InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

C++ Apache License 2.0 Updated Oct 5, 2023
punica Public
Forked from punica-ai/punica

Cuda 1 Updated Sep 16, 2023
FasterTransformer Public
Forked from NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

C++ Apache License 2.0 Updated Sep 8, 2023
CUDA-PPT Public
Forked from MARD1NO/CUDA-PPT

Apache License 2.0 Updated Jun 26, 2023
learning-triton Public

Python Updated Jun 4, 2023
open_llama Public
Forked from openlm-research/open_llama

Apache License 2.0 Updated May 2, 2023
learning-nn Public

Jupyter Notebook Updated May 2, 2023
Lightrails Public

Yet another distributed training/inferencing framework.

Apache License 2.0 Updated Apr 11, 2023
nanoGPT Public
Forked from karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python MIT License Updated Mar 25, 2023
Megatron-LM Public
Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python Other Updated Mar 25, 2023
og-equity-compensation Public
Forked from jlevy/og-equity-compensation

Stock options, RSUs, taxes — read the latest edition: www.holloway.com/ec

Updated Oct 15, 2021

Chen Shen scv119

Achievements

Achievements

DeepEP Public

Uh oh!

mini-redis Public

Uh oh!

learn-rust Public

Uh oh!

scv119 Public

Uh oh!

openmlsys-zh Public

Uh oh!

r4cppp Public

Uh oh!

flashinfer Public

Uh oh!

vllm Public

Uh oh!

lightllm Public

Uh oh!

how-to-optim-algorithm-in-cuda Public

Uh oh!

cutlass-kernels Public

Uh oh!

ray Public

Uh oh!

TensorRT-LLM Public

Uh oh!

megablocks Public

Uh oh!

flash-attention Public

Uh oh!

The-Art-of-Linear-Algebra Public

Uh oh!

ScaleLLM Public

Uh oh!

grouped_gemm Public

Uh oh!

awesome-tensor-compilers Public

Uh oh!

lmdeploy Public

Uh oh!

punica Public

Uh oh!

FasterTransformer Public

Uh oh!

CUDA-PPT Public

Uh oh!

learning-triton Public

Uh oh!

open_llama Public

Uh oh!

learning-nn Public

Uh oh!

Lightrails Public

Uh oh!

nanoGPT Public

Uh oh!

Megatron-LM Public

Uh oh!

og-equity-compensation Public

Uh oh!