Large-scale LLM inference engine
A highly optimized LLM inference acceleration engine for Llama and its variants.
FinceptTerminal is a modern finance application offering advanced market analytics, investment research, and economic data tools, designed for interactive exploration and data-driven decision-making in a user-friendly environment.
MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.
A flexible, high-performance serving system for machine learning models