Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonxlite-dev/Awesome-LLM-Inference

Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

83.9/100
5.2KForks: 366
View on GitHub
Loading report...

Similar Projects

UltraRAG

86

[GitHub Trending #2] A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

Python5.5K

llm_note

60

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Python878

ragflow

93

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python78.8K

unsloth

92

Web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.

Python62.5K
Back to List