Low-latency AI engine for mobile devices & wearables
llama and other large language models on iOS and MacOS offline using GGML library.
MCP Monitoring with eBPF
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Production ready toolkit to run AI locally