Models for 16 GB VRAM
Desktop-class 16 GB tier — higher quants and small multimodal models. Sorted by Hugging Face downloads. Smallest GGUF file size shown per model.
Run models locally with guIDE
Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.