Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++dphnAI/aphrodite-engine

aphrodite-engine

Large-scale LLM inference engine

68.3/100
1.8KForks: 199
View on GitHub
Loading report...

Similar Projects

ZhiLight

58

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++905

FinceptTerminal

91

FinceptTerminal is a modern finance application offering advanced market analytics, investment research, and economic data tools, designed for interactive exploration and data-driven decision-making in a user-friendly environment.

C++26.2K

MNN

93

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

C++15.5K

serving

92

A flexible, high-performance serving system for machine learning models

C++6.4K
Back to List