⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

Pythongpustack/gpustack

gpustack

A GPU cluster manager for high-performance AI model serving (vLLM, SGLang) and on-demand SSH-accessible GPU instances.

84.2/100

★ 5.4KForks: 590

View on GitHub →Homepage →

Loading report...

Similar Projects

sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python★ 30.7K

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python★ 87.1K

unsloth

Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.

Python★ 68.9K

CowAgent

Open-source super AI assistant & Agent Harness. Plans tasks, runs tools and skills, self-evolves with memory and knowledge. Multi-model, multi-channel. Lightweight, extensible, one-line install. (formerly chatgpt-on-wechat)

Python★ 46.1K

← Back to List