Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonWenyueh/MinivLLM
MinivLLM
Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation