Nano vLLM
Large Language Model Text Generation Inference
A high-throughput and memory-efficient inference and serving engine for LLMs
LLM training code for Databricks foundation models
Pretrained model hub for Keras 3.