Question 1

What is ji-farthing/gemma-4-26B-A4B-it-DFlash-SWA-ik-llama-GGUF?

Accepted Answer

--- license: apache-2.0 base_model: - z-lab/gemma-4-26B-A4B-it-DFlash - google/gemma-4-26B-A4B-it tags: - gguf - gemma4 - dflash - speculative-decoding - sliding-window-attention - ik_llama --- # Gemma 4 26B-A4B DFlash SWA draft for ik_llama This repo contains an `ik_llama`-compatible DFlash draft GGUF converted from `z-lab/gemma-4-26B-A4B-it-DFlash`, carrying the per-layer sliding-window attention (SWA) pattern. This is not a standalone chat model. Use it as a `--model-draft` file next to a matching Gemma 4 26B-A4B IT target GGUF, with DFlash speculative decoding. Gemma 4 needs `--jinja`. ## Sliding-window attention The draft is sliding-window on every layer except a final full-attention (global) layer: `sliding_window_pattern = [true, true, true, true, false]`, `sliding_window = 2048`. ## Files | File | Quant | Draft window | | --- | --- | --- | | `gemma-4-26B-A4B-it-DFlash-SWA-ik_lla…

Question 2

What license applies to ji-farthing/gemma-4-26B-A4B-it-DFlash-SWA-ik-llama-GGUF?

Accepted Answer

License: apache-2.0. Verify terms on Hugging Face before commercial use.

Question 3

How do I run ji-farthing/gemma-4-26B-A4B-it-DFlash-SWA-ik-llama-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Question 4

How much VRAM or disk space does ji-farthing/gemma-4-26B-A4B-it-DFlash-SWA-ik-llama-GGUF need?

Accepted Answer

Runs locally from ~450.5 MB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

Model ID	ji-farthing/gemma-4-26B-A4B-it-DFlash-SWA-ik-llama-GGUF
Author	ji-farthing
Pipeline	—
License	apache-2.0
Base model	z-lab/gemma-4-26B-A4B-it-DFlash,google/gemma-4-26B-A4B-it
Last modified	2026-06-24T13:33:20.000Z

ji-farthing/gemma-4-26B-A4B-it-DFlash-SWA-ik-llama-GGUF overview

Repository Files & Downloads

Model Details

Model README

Gemma 4 26B-A4B DFlash SWA draft for ik_llama

Sliding-window attention

Files

Use

Validation (RTX 4070, ik_llama DFlash SWA branch)

Conversion

Run ji-farthing/gemma-4-26B-A4B-it-DFlash-SWA-ik-llama-GGUF with guIDE