Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++jd-opensource/xllm
xllm
A high-performance inference engine for LLMs, optimized for diverse AI accelerators.