Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Shellllm-d/llm-d
llm-d
Achieve state of the art inference performance with modern accelerators on Kubernetes