Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonvllm-project/llm-compressor

llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

84.1/100
3.1KForks: 489
View on GitHubHomepage →
Loading report...

Similar Projects

nncf

81

Neural Network Compression Framework for enhanced OpenVINO™ inference

Python1.2K

LlamaFactory

92

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python70.5K

faster-whisper

64

Faster Whisper transcription with CTranslate2

Python22.4K

Chinese-LLaMA-Alpaca

90

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python18.9K
Back to List