Skip to content
View Mofan-Z's full-sized avatar
  • ByteDance
  • Shanghai
  • 05:56 (UTC +08:00)

Block or report Mofan-Z

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

缠中说禅技术分析工具;缠论;股票;期货;Quant;量化交易

Python 4,852 1,434 Updated Apr 8, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,820 179 Updated Apr 13, 2026

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 275 19 Updated Feb 2, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,971 286 Updated May 15, 2025

common in-memory tensor structure

C++ 1,189 160 Updated Jan 26, 2026

Serverless LLM Serving for Everyone.

Python 672 70 Updated Mar 6, 2026

Optimized primitives for collective multi-GPU communication

C++ 4,609 1,196 Updated Apr 12, 2026

An elegant PyTorch deep reinforcement learning library.

Python 10,531 1,291 Updated Apr 3, 2026

Simple, safe way to store and distribute tensors

Python 3,702 307 Updated Apr 13, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,652 3,647 Updated Apr 13, 2026

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,210 398 Updated Jul 11, 2024

A flexible and efficient training framework for large-scale alignment tasks

Python 452 39 Updated Oct 23, 2025

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.

Python 61,392 5,309 Updated Apr 13, 2026

A rule-based tunnel for Android.

Kotlin 37,387 2,448 Updated Apr 13, 2026

A V2Ray client for Android, support Xray core and v2fly core

Kotlin 53,940 7,186 Updated Apr 13, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,965 1,261 Updated Apr 13, 2026

A Data Streaming Library for Efficient Neural Network Training

Python 1,491 188 Updated Feb 2, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,566 1,784 Updated Apr 9, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,443 15,536 Updated Apr 13, 2026

Zero Bubble Pipeline Parallelism

Python 452 34 Updated May 7, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,499 310 Updated Jul 17, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,341 917 Updated Apr 13, 2026

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

5,006 534 Updated Sep 25, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,084 672 Updated Apr 13, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,043 8,561 Updated Apr 12, 2026

A PyTorch native platform for training generative AI models

Python 5,234 782 Updated Apr 13, 2026

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,004 61 Updated Mar 3, 2026

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,451 113 Updated Apr 13, 2026

Training and serving large-scale neural networks with auto parallelization.

Python 3,187 362 Updated Dec 9, 2023
Next