Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++uccl-project/uccl

uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

80.0/100
1.4KForks: 155
View on GitHubHomepage →
Loading report...

Similar Projects

tiny-vllm

52

Build your own high performance LLM inference engine in C++ and CUDA - a smaller version of vLLM

C++776

deeplake

87

Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.

C++9.2K

cactus

86

Low-latency AI engine for mobile devices & wearables

C++5.3K

lemonade

85

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk

C++4.3K
Back to List