Question 1

What is ysong21/gemma-4-12B-it-qat-assistant-MTP-Q4_0-GGUF?

Accepted Answer

--- license: apache-2.0 library_name: llama.cpp base_model: google/gemma-4-12B-it-qat-q4_0-unquantized-assistant tags: - gguf - llama.cpp - gemma - gemma-4 - qat - mtp - draft-model - assistant - speculative-decoding pipeline_tag: text-generation --- # Gemma 4 12B IT QAT Assistant MTP Q4_0 GGUF This repository contains a Q4_0 GGUF conversion of Google's official Gemma 4 12B IT QAT assistant / drafter checkpoint: - Source checkpoint: `google/gemma-4-12B-it-qat-q4_0-unquantized-assistant` - Intended target model: `google/gemma-4-12B-it-qat-q4_0-gguf` - Output file: `gemma-4-12B-it-qat-assistant-MTP-Q4_0.gguf` - Runtime: llama.cpp with Gemma 4 MTP / `draft-mtp` support This is not a standalone chat model. It is a draft model for speculative decoding and must be loaded together with a matching Gemma 4 12B IT QAT target model. ## Usage ```bash llama-server \ -hf google/gemma-4-12B-it-qat-q4_…

Question 2

What license applies to ysong21/gemma-4-12B-it-qat-assistant-MTP-Q4_0-GGUF?

Accepted Answer

License: apache-2.0. Verify terms on Hugging Face before commercial use.

Question 3

How do I run ysong21/gemma-4-12B-it-qat-assistant-MTP-Q4_0-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Question 4

How much VRAM or disk space does ysong21/gemma-4-12B-it-qat-assistant-MTP-Q4_0-GGUF need?

Accepted Answer

Runs locally from ~308.0 MB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

ysong21/gemma-4-12B-it-qat-assistant-MTP-Q4_0-GGUF overview

Repository Files & Downloads

Model Details

Model README

Gemma 4 12B IT QAT Assistant MTP Q4_0 GGUF

Usage

Conversion

Local Validation

Notes

Run ysong21/gemma-4-12B-it-qat-assistant-MTP-Q4_0-GGUF with guIDE

Model ID	ysong21/gemma-4-12B-it-qat-assistant-MTP-Q4_0-GGUF
Author	ysong21
Pipeline	text-generation
License	apache-2.0
Base model	google/gemma-4-12B-it-qat-q4_0-unquantized-assistant
Last modified	2026-06-08T05:58:30.000Z