Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Cudathu-ml/SpargeAttn
SpargeAttn
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.