Skip to content
@gpustack

GPUStack

Simple, scalable AI model deployment on GPU clusters

Pinned Loading

  1. gpustack gpustack Public

    Simple, scalable AI model deployment on GPU clusters

    Python 3.8k 385

  2. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 208 22

  3. llama-box llama-box Public

    LM inference server implementation based on *.cpp.

    C++ 280 25

  4. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 164 24

Repositories

Showing 10 of 12 repositories
  • runtime Public
    gpustack/runtime’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Oct 9, 2025
  • gpustack-ui Public
    gpustack/gpustack-ui’s past year of commit activity
    TypeScript 49 Apache-2.0 34 1 0 Updated Oct 9, 2025
  • gpustack Public

    Simple, scalable AI model deployment on GPU clusters

    gpustack/gpustack’s past year of commit activity
    Python 3,824 Apache-2.0 385 454 (1 issue needs help) 19 Updated Oct 9, 2025
  • runner Public
    gpustack/runner’s past year of commit activity
    Dockerfile 0 Apache-2.0 0 0 0 Updated Sep 18, 2025
  • gpustack/gpustack.github.io’s past year of commit activity
    HTML 0 2 0 0 Updated Sep 15, 2025
  • gpustack/gpustack-helper’s past year of commit activity
    Python 0 Apache-2.0 2 1 0 Updated Aug 26, 2025
  • gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    gpustack/gguf-parser-go’s past year of commit activity
    Go 208 MIT 22 0 0 Updated Aug 18, 2025
  • llama-box Public

    LM inference server implementation based on *.cpp.

    gpustack/llama-box’s past year of commit activity
    C++ 280 MIT 25 4 0 Updated Aug 16, 2025
  • .github Public

    Meta-Github repository for all GPUStack repositories.

    gpustack/.github’s past year of commit activity
    Dockerfile 0 Apache-2.0 1 0 0 Updated Aug 11, 2025
  • vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    gpustack/vox-box’s past year of commit activity
    Python 164 Apache-2.0 24 16 0 Updated Jul 19, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.