Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonxlite-dev/Awesome-LLM-Inference

Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

72.9/100
5.3KForks: 385
View on GitHub
Loading report...

Similar Projects

UltraRAG

86

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

Python5.6K

llm_note

54

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Python882

vllm-cli

56

A command-line interface tool for serving LLM using vLLM.

Python500

unsloth

93

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python66.1K
Back to List