Lists (4)
Sort Name ascending (A-Z)
Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
SGLang is a high-performance serving framework for large language models and multimodal models.
Work with remote images registries - retrieving information, images, signing content
A toolkit to run Ray applications on Kubernetes
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A curated list of awesome Go frameworks, libraries and software
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Kubernetes Virtualization API and runtime in order to define and manage virtual machines.
Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workl…
模块化研发框架,运维调度系统,帮助应用解决研发运维系列痛点问题,省资源、秒级启动、灵活部署、快速需求交付等,并帮助存量应用低成本演进到 Serverless 模式; modular development framework and serving platform to enable app evolve from monolithic to microservices and also …
A feature-rich command-line audio/video downloader
Stable Diffusion web UI
An open cloud native capacity solution which helps you achieve ultimate resource utilization in an intelligent and risk-free way.
Kubetools - Curated List of Kubernetes Tools
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.
Core components in the OCM project. Report here if you found any issues in OCM.
Kubernetes community content
Gin is a high-performance HTTP web framework written in Go. It provides a Martini-like API but with significantly better performance—up to 40 times faster—thanks to httprouter. Gin is designed for …
DLRover: An Automatic Distributed Deep Learning System
Add-on agent to generate and expose cluster-level metrics.
A curated list of software and architecture related design patterns.
The Prometheus monitoring system and time series database.
💥 A Lodash-style Go library based on Go 1.18+ Generics (map, filter, contains, find...)