MoFHeka

He Jia MoFHeka

Github suspend my old account, so this is my new account. My Gitee url is https://gitee.com/MoFHeka.

24 followers · 2 following

Beijing

Achievements

execution-ucx Public

A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.

training infrastructure machine-learning deep-learning rpc ray ucx

C++ 11 3 Apache License 2.0 Updated Sep 29, 2025
xla-launcher Public

XLA Launcher is a high-performance, lightweight C++ library designed to provide a simple interface for loading and executing computation graphs represented in the StableHLO format.

machine-learning tensorflow pytorch compiler-optimization onnx jax xla

C++ 2 1 Other Updated Aug 1, 2025
verl Public
Forked from volcengine/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python Apache License 2.0 Updated Jul 1, 2025
CorelinkIDE Public

C Apache License 2.0 Updated Jun 16, 2025
CL32Q0-thirdparty Public

C Apache License 2.0 Updated Jun 16, 2025
CL32Q0-debug-bridge Public

C Apache License 2.0 Updated Jun 16, 2025
CL32Q0-driver-library Public

C Apache License 2.0 Updated Jun 16, 2025
CL32Q0-demo Public

Makefile Apache License 2.0 Updated Jun 16, 2025
CL32Q0-CMSIS-DSP Public

C Other Updated Jun 16, 2025
GeoCompass Public

Swift Updated Jun 16, 2025
AccurateHeartX Public

C++ GNU General Public License v3.0 Updated Jun 16, 2025
tensorflow-onnx Public
Forked from onnx/tensorflow-onnx

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Jupyter Notebook Apache License 2.0 Updated Feb 19, 2025
interview-coder Public
Forked from arunsetty/interview-coder

An open-source invisible desktop application to help you pass your technical interviews.

TypeScript Updated Jan 24, 2025
AsterHiredis Public

A seastar implement for redis client.

C++ 1 Apache License 2.0 Updated Jan 16, 2025
recommenders-addons Public
Forked from tensorflow/recommenders-addons

Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.

Cuda 1 3 Apache License 2.0 Updated Jan 13, 2025
bazel-central-registry Public
Forked from bazelbuild/bazel-central-registry

The central registry of Bazel modules for the Bzlmod external dependency system.

Starlark Apache License 2.0 Updated Jan 3, 2025
rules_nccl Public

Starlark MIT License Updated Jan 2, 2025
MeepoEmbedding Public

A distributed high-performance dynamic lookuptable-style Embedding designed for recommendation, search, CTR and advertising systems. Supports GPU, CPU, remote distributed KV (such as Redis), SSD, a…

4 3 Apache License 2.0 Updated Nov 26, 2024
Megatron-LM Public
Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python Other Updated Apr 26, 2024
runtime Public
Forked from tensorflow/runtime

A performant and modular runtime for TensorFlow

C++ Apache License 2.0 Updated Mar 26, 2024
clash-for-linux-backup Public
Forked from zengpuzhang/clash-for-linux-backup

Linux最完整的Clash for Linux的备份仓库，完全可以使用！由Yizuko进行修复及维护

Shell MIT License Updated Feb 28, 2024
DeepSpeed Public
Forked from deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python Apache License 2.0 Updated Feb 26, 2024
TransformerEngine Public
Forked from NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in bot…

Python Apache License 2.0 Updated Jan 29, 2024
deepray Public
Forked from deepray-AI/deepray

Deepray for continuous integration development.

Python Apache License 2.0 Updated Dec 25, 2023
LLaMA-Megatron Public

A LLaMA1/LLaMA12 Megatron implement.

pytorch llama megatron megatron-lm llm llm-training llama2

Python 28 2 Apache License 2.0 Updated Dec 13, 2023
Megatron-AutoCkpt Public

A Megatron checkpoint auto-saving patch at the end of each iteration, inspired by Alibaba PAI EasyCkpt for Megatron.

Python 1 Apache License 2.0 Updated Nov 21, 2023
HierarchicalKV Public
Forked from NVIDIA-Merlin/HierarchicalKV

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of Merlin-KV is to store key-value feature-embeddings on high-b…

Cuda Apache License 2.0 Updated Oct 12, 2023
unit-scaling Public
Forked from graphcore-research/unit-scaling

A library for unit scaling in PyTorch

Jupyter Notebook MIT License Updated Aug 23, 2023
NeMo Public
Forked from lhb8125/NeMo

NeMo: a toolkit for conversational AI

Python Apache License 2.0 Updated Aug 15, 2023
LLaMA-Alpa Public

A LLaMa pretrain code by using Alpa(https://github.com/alpa-projects/alpa).

Python 1 Apache License 2.0 Updated Jul 13, 2023

He Jia MoFHeka

Achievements

Achievements

execution-ucx Public

Uh oh!

xla-launcher Public

Uh oh!

verl Public

Uh oh!

CorelinkIDE Public

Uh oh!

CL32Q0-thirdparty Public

Uh oh!

CL32Q0-debug-bridge Public

Uh oh!

CL32Q0-driver-library Public

Uh oh!

CL32Q0-demo Public

Uh oh!

CL32Q0-CMSIS-DSP Public

Uh oh!

GeoCompass Public

Uh oh!

AccurateHeartX Public

Uh oh!

tensorflow-onnx Public

Uh oh!

interview-coder Public

Uh oh!

AsterHiredis Public

Uh oh!

recommenders-addons Public

Uh oh!

bazel-central-registry Public

Uh oh!

rules_nccl Public

Uh oh!

MeepoEmbedding Public

Uh oh!

Megatron-LM Public

Uh oh!

runtime Public

Uh oh!

clash-for-linux-backup Public

Uh oh!

DeepSpeed Public

Uh oh!

TransformerEngine Public

Uh oh!

deepray Public

Uh oh!

LLaMA-Megatron Public

Uh oh!

Megatron-AutoCkpt Public

Uh oh!

HierarchicalKV Public

Uh oh!

unit-scaling Public

Uh oh!

NeMo Public

Uh oh!

LLaMA-Alpa Public

Uh oh!