Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Cmicrosoft/vattention

vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

51.0/100
500Forks: 42
View on GitHub
Loading report...

Similar Projects

redis

94

For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.

C75.2K

rufus

90

The Reliable USB Formatting Utility

C36.7K

codebase-memory-mcp

91

High-performance code intelligence MCP server. Indexes codebases into a persistent knowledge graph — average repo in milliseconds. 158 languages, sub-ms queries, 99% fewer tokens. Single static binary, zero dependencies.

C24.7K

lvgl

93

Embedded graphics library to create beautiful UIs for any MCU, MPU and display type.

C24.0K
Back to List