Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++aphrodite-engine/aphrodite-engine

aphrodite-engine

Large-scale LLM inference engine

81.5/100
1.7KForks: 193
View on GitHubHomepage →
Loading report...

Similar Projects

ZhiLight

65

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++904

MNN

94

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

C++15.0K

serving

92

A flexible, high-performance serving system for machine learning models

C++6.3K

rocketride-server

78

High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+ vector databases, and agent orchestration, all from your IDE. Includes VS Code extension, TypeScript/Python SDKs, and Docker deployment.

C++1.9K
Back to List