Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++Luce-Org/lucebox-hub

lucebox-hub

Fast LLM speculative inference server for consumer hardware.

74.3/100
2.4KForks: 219
View on GitHubHomepage →
Loading report...

Similar Projects

llamafile

89

Distribute and run LLMs with a single file.

C++24.8K

RCLI

62

Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG

C++1.5K

surogate

73

Training/Fine-tuning at the speed of light

C++796

lemonade

85

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk

C++4.3K
Back to List