Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonSemiAnalysisAI/InferenceX

InferenceX

Open Source Continuous Inference Benchmark Research Platform Kimi K2.6, DeepSeekv4, GLM5 - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 & soon™ TPUv6e/v7/Trainium2/3

69.5/100
1.1KForks: 189
View on GitHubHomepage →
Loading report...

Similar Projects

LMCache

88

LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

Python8.5K

vllm

93

A high-throughput and memory-efficient inference and serving engine for LLMs

Python82.4K

ramalama

86

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.

Python2.9K

ml-engineering

72

Machine Learning Engineering Open Book

Python18.1K
Back to List