Skip to content
View Weili17's full-sized avatar

Block or report Weili17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 451 19 Updated Dec 8, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,918 351 Updated Dec 19, 2025

A framework for efficient model inference with omni-modality models

Python 1,027 141 Updated Dec 20, 2025

A Lighting Pytorch Framework for Recommendation Models (PyTorch推荐算法框架), Easy-to-use and Easy-to-extend. https://datawhalechina.github.io/torch-rechub/

Python 674 105 Updated Dec 17, 2025

Open Fabric Interfaces

C 742 459 Updated Dec 20, 2025

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 398 23 Updated Sep 15, 2025

Perplexity open source garden for inference technology

Rust 307 25 Updated Dec 9, 2025

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 2,022 216 Updated Dec 16, 2025

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 163,871 52,365 Updated Dec 20, 2025

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 827 101 Updated Dec 19, 2025

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 722 73 Updated Nov 30, 2025

Xray, Penetrates Everything. Also the best v2ray-core. Where the magic happens. An open platform for various uses.

Go 33,540 4,811 Updated Dec 17, 2025

A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters

Python 52 1 Updated Jul 23, 2024
Python 1,660 99 Updated Sep 30, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,880 467 Updated Dec 18, 2025

An industrial deep learning framework for high-dimension sparse data

PureBasic 4,305 1,029 Updated Sep 25, 2024

A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.

Python 278 16 Updated Nov 19, 2025

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

374 31 Updated Nov 11, 2025

[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

Python 35 3 Updated Jun 12, 2024

A PyTorch native platform for training generative AI models

Python 4,859 644 Updated Dec 20, 2025

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 363 33 Updated Dec 10, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,482 216 Updated Dec 15, 2025
C 87 36 Updated Jun 14, 2022

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,439 121 Updated Dec 20, 2025

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 2,239 178 Updated Mar 6, 2025

Distributed parallel 3D-Causal-VAE for efficient training and inference

Python 42 3 Updated Aug 20, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,924 1,505 Updated Dec 17, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,263 350 Updated Dec 20, 2025
Next