Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonlightseekorg/tokenspeed

tokenspeed

TokenSpeed is a speed-of-light LLM inference engine.

73.6/100
1.5KForks: 167
View on GitHubHomepage →
Loading report...

Similar Projects

sglang

91

SGLang is a high-performance serving framework for large language models and multimodal models.

Python29.5K

UltraRAG

86

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

Python5.6K

EasyR1

68

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python5.0K

AngelSlim

78

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

Python1.3K
Back to List