Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonovg-project/kvcached

kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

74.3/100
885Forks: 103
View on GitHub
Loading report...

Similar Projects

InferenceX

69

Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TPUv6e/v7/Trainium2/3

Python857

LMCache

87

Supercharge Your LLM with the Fastest KV Cache Layer

Python8.1K

UltraRAG

86

[GitHub Trending #2] A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

Python5.5K

sparrow

88

Structured data extraction and instruction calling with ML, LLM and Vision LLM

Python5.2K
Back to List