Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Rustopeninfer-project/openinfer

openinfer

Pure Rust + CUDA LLM inference engine — no PyTorch, OpenAI-compatible, serves Qwen3 to Kimi-K2

68.7/100
501Forks: 75
View on GitHubHomepage →
Loading report...

Similar Projects

openinterpreter

89

A lightweight coding agent for open models like Deepseek, Kimi, and Qwen

Rust64.3K

vllm

93

A high-throughput and memory-efficient inference and serving engine for LLMs

Python85.2K

CodeWhale

89

Open-source, community-driven agent harness

Rust39.3K

chitu

87

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python3.1K
Back to List