Question 1

What is ZeZZm/aero-deuce-GGUF?

Accepted Answer

--- license: apache-2.0 base_model: google/gemma-4-12b-it tags: - gguf - q4_k_m - gemma - instruction-following - text-generation - llama.cpp inference: false --- # Aero-Deuce — GGUF Q4_K_M A fine-tuned Gemma 4 12B instruction-following model. This is the **GGUF quantized version** (~7 GB) that runs locally on CPU or GPU with no Python required. ## Download Click the **Files and versions** tab above and download `aero-deuce-q4km.gguf`. That's the only file you need. ## Which format should I use? | Format | Best for | Link | |---|---|---| | **GGUF** ← you are here | Local inference, llama.cpp, LM Studio, GPT4All | This repo | | [MLX 4-bit](https://huggingface.co/ZeZZm/aero-deuce-MLX) | Apple Silicon (Mac) | [ZeZZm/aero-deuce-MLX](https://huggingface.co/ZeZZm/aero-deuce-MLX) | | [LoRA Adapter](https://huggingface.co/ZeZZm/aero-deuce) | Merging with base model, further fine-tuning | [ZeZZm…

Question 2

What license applies to ZeZZm/aero-deuce-GGUF?

Accepted Answer

License: apache-2.0. Verify terms on Hugging Face before commercial use.

Question 3

How do I run ZeZZm/aero-deuce-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Question 4

How much VRAM or disk space does ZeZZm/aero-deuce-GGUF need?

Accepted Answer

Runs locally from ~6.87 GB disk (8 GB VRAM class GPUs with llama.cpp / guIDE).

ZeZZm/aero-deuce-GGUF overview

Repository Files & Downloads

Model Details

Model README

Aero-Deuce — GGUF Q4_K_M

Download

Which format should I use?

Quick Start

Model Details

System Prompt

License

Run ZeZZm/aero-deuce-GGUF with guIDE

Model ID	ZeZZm/aero-deuce-GGUF
Author	ZeZZm
Pipeline	text-generation
License	apache-2.0
Base model	google/gemma-4-12b-it
Last modified	2026-06-07T22:01:09.000Z