Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonInternLM/lmdeploy

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

85.2/100
7.9KForks: 701
View on GitHubHomepage →
Loading report...

Similar Projects

OpenLLM

91

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python12.4K

Chinese-LLaMA-Alpaca-2

73

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python7.1K

lorax

82

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python3.8K

vllm

93

A high-throughput and memory-efficient inference and serving engine for LLMs

Python82.4K
Back to List