Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonxorbitsai/inference

inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

89.2/100
9.1KForks: 801
View on GitHubHomepage →
Loading report...

Similar Projects

pytorch-lightning

92

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python30.9K

ml-engineering

74

Machine Learning Engineering Open Book

Python17.3K

bitsandbytes

91

Accessible large language models via k-bit quantization for PyTorch.

Python8.0K

agents-from-scratch

53

Build AI agents from first principles using a local LLM - no frameworks, no cloud APIs, no hidden reasoning.

Python568
Back to List