Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythoncontainers/ramalama

ramalama

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.

86.7/100
2.6KForks: 306
View on GitHubHomepage →
Loading report...

Similar Projects

InferenceX

69

Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TPUv6e/v7/Trainium2/3

Python638

LMCache

87

Supercharge Your LLM with the Fastest KV Cache Layer

Python7.6K

langchain

94

The agent engineering platform

Python128.7K

open-webui

94

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python126.3K
Back to List