GraySoft
Projects Models Compare Cloud benchmarks FAQ Download guIDE →

Models for 12 GB VRAM

Models that fit 12 GB VRAM with room for context and batching. Sorted by Hugging Face downloads. Smallest GGUF file size shown per model.

AbteeXAILab/lumynax-reasoning-gpt-oss-20b-gguf
AbteeXAILab · from 11.28 GB · 3,360 downloads
AbteeXAILab/lumynax-moe-moonlight-16b-a3b-gguf
AbteeXAILab · from 8.14 GB · 2,051 downloads
DavidAU/MN-DARKEST-UNIVERSE-29B-GGUF
DavidAU · from 10.25 GB · 1,946 downloads
AbteeXAILab/lumynax-reasoning-deepseek-distill-text-gguf
AbteeXAILab · from 8.37 GB · 953 downloads
ccharnkij/Phi-4-Uncensored-GGUF
ccharnkij · from 8.44 GB · 891 downloads
AbteeXAILab/lumynax-infused-phi-4-text-gguf
AbteeXAILab · from 8.44 GB · 704 downloads
AbteeXAILab/lumynax-infused-qwen3-14b-gguf
AbteeXAILab · from 8.38 GB · 533 downloads
deucebucket/Qwen3.6-27B-Cerebellum-Q2K-GGUF
deucebucket · from 9.98 GB · 221 downloads
deucebucket/Qwen3.6-27B-Cerebellum-GGUF
deucebucket · from 11.98 GB · 218 downloads
gdfhhjk/spectrida-re-gguf
gdfhhjk · from 8.11 GB · 124 downloads
DavidAU/MN-GRAND-Gutenberg-Lyra4-Lyra-23.5B-GGUF
DavidAU · from 8.29 GB · 104 downloads
pirola/context-1-mxfp4-gguf
pirola · from 11.28 GB · 0 downloads
temaq-org/Tema_Q-X4-Thinking-TaQuants-GGUF
temaq-org · from 10.30 GB · 0 downloads
ramagotchi/DeepSeek-Coder-V2-Lite-Instruct-GGUF
ramagotchi · from 9.65 GB · 0 downloads
Iambackup/Gemma-4-12B-OBLITERATED-GGUF
Iambackup · from 11.80 GB · 0 downloads
DevQuasar/CohereLabs.North-Mini-Code-1.0-GGUF
DevQuasar · from 10.58 GB · 0 downloads
DevQuasar/google.diffusiongemma-26B-A4B-it-GGUF
DevQuasar · from 9.86 GB · 0 downloads
build-small-hackathon/lfed-qwen2.5-coder-14b-sql-gguf
build-small-hackathon · from 8.37 GB · 0 downloads

Run models locally with guIDE

Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.

Download guIDE → · Browse 524k+ models · Compare models