Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonbitsandbytes-foundation/bitsandbytes

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

90.5/100
8.0KForks: 831
View on GitHubHomepage →
Loading report...

Similar Projects

LlamaFactory

92

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python68.0K

ml-engineering

74

Machine Learning Engineering Open Book

Python17.3K

inference

89

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

Python9.1K

hqq

81

Official implementation of Half-Quadratic Quantization (HQQ)

Python915
Back to List