Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++PaddlePaddle/Serving

Serving

A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)

74.0/100
925Forks: 251
View on GitHub
Loading report...

Similar Projects

serving

77

A flexible, high-performance serving system for machine learning models

C++6.3K

deeplake

86

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

C++9.0K

model_server

85

A scalable inference server for models optimized with OpenVINO™

C++836

flash-tokenizer

60

EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING

C++504
Back to List