Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonxorbitsai/inference

inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

91.3/100
9.3KForks: 821
View on GitHubHomepage →
Loading report...

Similar Projects

pytorch-lightning

91

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python31.1K

ml-engineering

69

Machine Learning Engineering Open Book

Python17.8K

bitsandbytes

90

Accessible large language models via k-bit quantization for PyTorch.

Python8.1K

chronos-forecasting

89

Chronos: Pretrained Models for Time Series Forecasting

Python5.2K
Back to List