JayFzh

🏠

Working from home

Zihao JayFzh

🏠

Working from home

4 followers · 30 following

SJTU & Alibaba Cloud
Hangzhou, China
13:06 (UTC -12:00)

Highlights

Stars

AmberLJC / LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

1,705 90 Updated Dec 22, 2025

kaihaoma / APT

Python 5 Updated May 10, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,465 1,999 Updated Nov 1, 2025

openai / harmony

Renderer for the harmony response format to be used with gpt-oss

Rust 4,090 240 Updated Dec 15, 2025

vnpy / vnpy

基于Python的开源量化交易平台开发框架

Python 34,875 10,544 Updated Dec 24, 2025

horovod / horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,645 2,256 Updated Dec 1, 2025

stepfun-ai / Step3

441 10 Updated Aug 10, 2025

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

9,760 709 Updated Nov 7, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,212 85 Updated Aug 28, 2025

torvalds / linux

Linux kernel source tree

C 211,660 59,558 Updated Dec 24, 2025

microsoft / vscode-copilot-chat

Copilot Chat extension for VS Code

TypeScript 9,166 1,519 Updated Dec 24, 2025

dmlc / dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 14,192 3,058 Updated Jul 31, 2025

Infrawaves / DeepEP_ibrc_dual-ports_multiQP

Aims to implement dual-port and multi-qp solutions in deepEP ibrc transport

Cuda 73 3 Updated May 9, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,868 1,815 Updated Oct 13, 2025

Terabit-Ethernet / hostCC

hostCC is a congestion control architecture which handles host congestion, along with in-network congestion

Shell 58 13 Updated Aug 10, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,943 3,861 Updated Dec 25, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,116 12,169 Updated Dec 25, 2025

facebookresearch / dlrm_datasets

Set of datasets for the deep learning recommendation model (DLRM).

48 17 Updated Dec 21, 2022

AlibabaPAI / llumnix

Efficient and easy multi-instance LLM serving

Python 518 44 Updated Sep 3, 2025

ROCm / rocSHMEM

rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.

C++ 137 42 Updated Dec 22, 2025

ByteDance-Seed / Seed-Thinking-v1.5

818 17 Updated Jun 9, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 20,015 1,673 Updated Nov 26, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,767 2,890 Updated Dec 24, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,454 124 Updated Dec 24, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,289 114 Updated Dec 16, 2025

zartbot / shallowsim

DeepSeek-V3/R1 inference performance simulator

Jupyter Notebook 174 26 Updated Mar 27, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,679 753 Updated Dec 25, 2025

shenh10 / DeepSeek_Simulator

Python 92 12 Updated Apr 2, 2025

openai / openai-agents-python

A lightweight, powerful framework for multi-agent workflows

Python 17,955 3,011 Updated Dec 24, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,433 7,814 Updated Dec 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly