TokenSpeed is a speed-of-light LLM inference engine.
SGLang is a high-performance serving framework for large language models and multimodal models.
open-source healthcare ai
[GitHub Trending #2] A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL