Models for 12 GB VRAM
Models that fit 12 GB VRAM with room for context and batching. Sorted by Hugging Face downloads. Smallest GGUF file size shown per model.
Run models locally with guIDE
Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.