Skip to content
View Mofan-Z's full-sized avatar
  • ByteDance
  • Shanghai
  • 01:00 (UTC +08:00)

Block or report Mofan-Z

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

缠中说禅技术分析工具;缠论;股票;期货;Quant;量化交易

Python 4,859 1,436 Updated Apr 14, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,825 180 Updated Apr 14, 2026

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 275 19 Updated Feb 2, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,972 285 Updated May 15, 2025

common in-memory tensor structure

C++ 1,191 160 Updated Jan 26, 2026

Serverless LLM Serving for Everyone.

Python 674 70 Updated Mar 6, 2026

Optimized primitives for collective multi-GPU communication

C++ 4,609 1,197 Updated Apr 14, 2026

An elegant PyTorch deep reinforcement learning library.

Python 10,539 1,291 Updated Apr 3, 2026

Simple, safe way to store and distribute tensors

Python 3,703 307 Updated Apr 14, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,678 3,652 Updated Apr 14, 2026

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,211 398 Updated Jul 11, 2024

A flexible and efficient training framework for large-scale alignment tasks

Python 452 39 Updated Oct 23, 2025

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.

Python 61,522 5,330 Updated Apr 14, 2026

A rule-based tunnel for Android.

Kotlin 37,458 2,451 Updated Apr 14, 2026

A V2Ray client for Android, support Xray core and v2fly core

Kotlin 54,019 7,190 Updated Apr 14, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,973 1,264 Updated Apr 14, 2026

A Data Streaming Library for Efficient Neural Network Training

Python 1,491 188 Updated Feb 2, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,573 1,785 Updated Apr 9, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,579 15,570 Updated Apr 14, 2026

Zero Bubble Pipeline Parallelism

Python 452 34 Updated May 7, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,499 312 Updated Jul 17, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,342 917 Updated Apr 14, 2026

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

5,006 534 Updated Sep 25, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,086 674 Updated Apr 14, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,097 8,569 Updated Apr 12, 2026

A PyTorch native platform for training generative AI models

Python 5,236 782 Updated Apr 14, 2026

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,004 61 Updated Mar 3, 2026

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,451 113 Updated Apr 13, 2026

Training and serving large-scale neural networks with auto parallelization.

Python 3,187 362 Updated Dec 9, 2023
Next