whybeyoung

💭

I may be slow to respond.

ybyang whybeyoung

💭

I may be slow to respond.

71 followers · 78 following

NIVIC
HeFei

Achievements

x3 x2 x2

Achievements

x3 x2 x2

Lists (3)

Sort

Ds

🔮 Future ideas

1 repository

讯飞

1 repository

Stars

lightseekorg / smg

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat histor…

Rust 173 49 Updated Apr 19, 2026

iflytek / skillhub

Self-hosted, open-source agent skill registry for enterprises. Publish & version skill packages, govern with RBAC and audit logs, deploy on-premise with Docker or Kubernetes.

Java 2,670 332 Updated Apr 17, 2026

leeguandong / Awesome-Chinese-Stable-Diffusion

中文文生图stable diffsion模型集合

418 24 Updated Feb 11, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,024 585 Updated Mar 13, 2026

iflytek / rdma_demo

rdma_demo

Python 1 Updated Apr 9, 2025

whybeyoung / sglang

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 1 Updated Apr 17, 2026

iflytek / astron-xmod-shim

Astron-xmod-shim — Lightweight, declarative middleware for reliably converging AI service workloads.

Go 101 16 Updated Nov 3, 2025

antgroup / sglang

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 30 5 Updated Apr 18, 2026

iflytek / agentbridge

Cross-platform AI workflow DSL converter supporting iFlytek Spark, Dify, and Coze platforms with unified intermediate representation and bidirectional transformation capabilities.

Go 23 3 Updated Mar 3, 2026

kvcache-ai / TrEnv-X

Go 81 6 Updated Sep 15, 2025

sgl-project / rbg

A workload for deploying LLM inference services on Kubernetes

Go 206 54 Updated Apr 14, 2026

whybeyoung / go-openai

Forked from sashabaranov/go-openai

OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go

Go 2 Updated Sep 22, 2025

sgl-project / ome

Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

Go 424 74 Updated Apr 18, 2026

fzyzcjy / sglang

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 7 1 Updated Apr 6, 2026

iflytek / ifly-workflow-mcp-server

This a simple implementation of an MCP server using iFlytek. It enables calling iFlytek workflows through MCP tools.

Python 27 8 Updated Mar 28, 2025

Marovlo / easyPyverbs

easy version of pyverbs

Python 6 2 Updated Apr 16, 2023

deepseek-ai / smallpond

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,947 442 Updated Mar 5, 2025

deepseek-ai / EPLB

Expert Parallelism Load Balancer

Python 1,359 201 Updated Mar 24, 2025

deepseek-ai / profile-data

Analyze computation-communication overlap in V3/R1.

1,149 145 Updated Mar 21, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,943 320 Updated Jan 14, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,048 396 Updated Apr 8, 2026

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,618 887 Updated Apr 17, 2026

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 9,136 1,154 Updated Apr 16, 2026

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,555 1,008 Updated Apr 7, 2026

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,973 287 Updated May 15, 2025

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 26,074 5,444 Updated Apr 19, 2026

ml-tooling / opyrator

🪄 Turns your machine learning code into microservices with web API, interactive GUI, and more.

Python 3,137 166 Updated Mar 30, 2026

containers / nri-plugins

A collection of community maintained NRI plugins

Go 102 34 Updated Apr 16, 2026

ScilifelabDataCentre / serve

SciLifeLab Serve is a platform offering machine learning model serving, data science app hosting (Shiny, Gradio, Streamlit, Dash, etc.), and other tools to life science researchers affiliated with …

Python 14 3 Updated Apr 17, 2026

basetenlabs / truss-examples

Examples of models deployable with Truss

Python 223 60 Updated Apr 16, 2026