wkcn

🐳

Tell Your World 🎵

JackieWu wkcn

🐳

Tell Your World 🎵

1/6 out of the gravity

392 followers · 188 following

China

Achievements

x2 x3 x2

Achievements

x2 x3 x2

Highlights

Organizations

Stars

MLSys

34 repositories

tensorflow / mesh

Mesh TensorFlow: Model Parallelism Made Easier

Python 1,624 255 Updated Nov 17, 2023

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 18,843 2,730 Updated Apr 5, 2026

flexflow / flexflow-train

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,870 250 Updated Mar 25, 2026

horovod / horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,686 2,248 Updated Dec 1, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,988 4,775 Updated Apr 3, 2026

microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 19,756 3,806 Updated Apr 5, 2026

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,156 4,025 Updated Apr 5, 2026

OAID / AutoKernel

AutoKernel 是一个简单易用，低门槛的自动算子优化工具，提高深度学习算法部署效率。

C++ 747 82 Updated Sep 23, 2022

microsoft / nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

C++ 1,003 167 Updated Sep 19, 2024

Oneflow-Inc / oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 9,391 1,015 Updated Dec 4, 2025

tensorflow / tensorflow

An Open Source Machine Learning Framework for Everyone

C++ 194,463 75,255 Updated Apr 5, 2026

optuna / optuna

A hyperparameter optimization framework

Python 13,852 1,291 Updated Apr 3, 2026

huawei-noah / bolt

Bolt is a deep learning library with high performance and heterogeneous flexibility.

C++ 957 164 Updated Apr 11, 2025

PaddlePaddle / Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

C++ 23,805 5,981 Updated Apr 5, 2026

lightgbm-org / LightGBM

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

C++ 18,217 3,994 Updated Apr 4, 2026

Tencent / TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its…

C++ 4,631 772 Updated May 9, 2025

dmlc / rabit

Reliable Allreduce and Broadcast Interface for distributed machine learning

C++ 513 180 Updated Nov 5, 2020

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 41,955 7,406 Updated Apr 5, 2026

Jittor / jittor

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

Python 3,215 321 Updated Mar 27, 2026

liwei-cpp / MetaNN

C++ 291 73 Updated Jan 18, 2021

OAID / Tengine

Tengine is a lite, high performance, modular inference engine for embedded device

C++ 4,513 977 Updated Mar 6, 2025

tiny-dnn / tiny-dnn

header only, dependency-free deep learning framework in C++14

C++ 6,020 1,396 Updated Apr 17, 2022

JDAI-CV / dabnn

dabnn is an accelerated binary neural networks inference framework for mobile platform

C++ 778 102 Updated Nov 12, 2019

dmlc / xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 28,226 8,863 Updated Mar 31, 2026

dmlc / ps-lite

A lightweight parameter server interface

C++ 1,562 546 Updated Mar 2, 2026

jax-ml / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 35,303 3,504 Updated Apr 5, 2026

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 98,811 27,405 Updated Apr 5, 2026

BVLC / caffe

Caffe: a fast open framework for deep learning.

C++ 34,751 18,529 Updated Jul 31, 2024

apache / tvm

Open Machine Learning Compiler Framework

Python 13,249 3,848 Updated Apr 5, 2026

hpi-xnor / BMXNet

(New version is out: https://github.com/hpi-xnor/BMXNet-v2) BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet

C++ 351 94 Updated Nov 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JackieWu wkcn

Achievements

Achievements

Highlights

Organizations

Block or report wkcn

MLSys

tensorflow / mesh

triton-lang / triton

flexflow / flexflow-train

horovod / horovod

deepspeedai / DeepSpeed

microsoft / onnxruntime

tinygrad / tinygrad

OAID / AutoKernel

microsoft / nnfusion

Oneflow-Inc / oneflow

tensorflow / tensorflow

optuna / optuna

huawei-noah / bolt

PaddlePaddle / Paddle

lightgbm-org / LightGBM

Tencent / TNN

dmlc / rabit

ray-project / ray

Jittor / jittor

liwei-cpp / MetaNN

OAID / Tengine

tiny-dnn / tiny-dnn

JDAI-CV / dabnn

dmlc / xgboost

dmlc / ps-lite

jax-ml / jax

pytorch / pytorch

BVLC / caffe

apache / tvm

hpi-xnor / BMXNet