Question 1

What is ji-farthing/Qwen3.5-35B-A3B-DFlash-SWA-ik-llama-GGUF?

Accepted Answer

--- license: apache-2.0 base_model: - z-lab/Qwen3.5-35B-A3B-DFlash - Qwen/Qwen3.5-35B-A3B tags: - gguf - qwen3.5 - dflash - speculative-decoding - sliding-window-attention - ik_llama --- # Qwen3.5-35B-A3B DFlash SWA draft for ik_llama This repo contains an `ik_llama`-compatible DFlash draft GGUF converted from `z-lab/Qwen3.5-35B-A3B-DFlash`, carrying the per-layer sliding-window attention (SWA) pattern. This is not a standalone chat model. Use it as a `--model-draft` file next to a matching Qwen3.5-35B-A3B target GGUF, with DFlash speculative decoding. ## Sliding-window attention The draft is sliding-window on every layer except a final full-attention (global) layer: `sliding_window_pattern = [true, true, true, true, true, false]`, `sliding_window = 4096`. ## Files | File | Quant | Draft window | | --- | --- | --- | | `Qwen3.5-35B-A3B-DFlash-SWA-ik_llama-Q8_0.gguf` | Q8_0 | 4096 (5 slid…

Question 2

What license applies to ji-farthing/Qwen3.5-35B-A3B-DFlash-SWA-ik-llama-GGUF?

Accepted Answer

License: apache-2.0. Verify terms on Hugging Face before commercial use.

Question 3

How do I run ji-farthing/Qwen3.5-35B-A3B-DFlash-SWA-ik-llama-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Question 4

How much VRAM or disk space does ji-farthing/Qwen3.5-35B-A3B-DFlash-SWA-ik-llama-GGUF need?

Accepted Answer

Runs locally from ~401.6 MB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

Model ID	ji-farthing/Qwen3.5-35B-A3B-DFlash-SWA-ik-llama-GGUF
Author	ji-farthing
Pipeline	—
License	apache-2.0
Base model	z-lab/Qwen3.5-35B-A3B-DFlash,Qwen/Qwen3.5-35B-A3B
Last modified	2026-06-24T13:33:51.000Z

ji-farthing/Qwen3.5-35B-A3B-DFlash-SWA-ik-llama-GGUF overview

Repository Files & Downloads

Model Details

Model README

Qwen3.5-35B-A3B DFlash SWA draft for ik_llama

Sliding-window attention

Files

Use

Validation (RTX 4070, ik_llama DFlash SWA branch)

Conversion

Run ji-farthing/Qwen3.5-35B-A3B-DFlash-SWA-ik-llama-GGUF with guIDE