Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonxlite-dev/Awesome-LLM-Inference

Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

81.3/100
5.0KForks: 348
View on GitHub
Loading report...

Similar Projects

UltraRAG

85

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

Python5.4K

llm_note

46

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Python866

ragflow

93

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python74.4K

unsloth

93

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python53.5K
Back to List