Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonnoonghunna/club-3090

club-3090

Community recipes for serving LLMs on RTX 3090/4090/5090 CUDA gpus. Multi-engine (vLLM, llama.cpp, ik_llama) and model-agnostic. Currently shipping Qwen3.6-27B Qwen3.6 35B Gemma 4 26B Gemma 4 31B configs for 1× and 2× cards.

74.7/100
1.4KForks: 74
View on GitHub
Loading report...

Similar Projects

hermes-agent

91

The agent that grows with you

Python197.7K

AutoGPT

96

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python185.0K

transformers

98

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python161.7K

markitdown

83

Python tool for converting files and office documents to Markdown.

Python156.1K
Back to List