Question 1

What is Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF?

Accepted Answer

--- base_model: Qwen/Qwen3.6-35B-A3B-FP8 frameworks: - '' library_name: transformers license: apache-2.0 license_link: https://huggingface.co/Qwen/Qwen3.6-35B-A3B-FP8/blob/main/LICENSE pipeline_tag: image-text-to-text tasks: [] tags: - llama-cpp - gguf-my-repo --- # Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF This model was converted to GGUF format from [`Qwen/Qwen3.6-35B-A3B-FP8`](https://huggingface.co/Qwen/Qwen3.6-35B-A3B-FP8) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space. Refer to the [original model card](https://huggingface.co/Qwen/Qwen3.6-35B-A3B-FP8) for more details on the model. ## Use with llama.cpp Install llama.cpp through brew (works on Mac and Linux) ```bash brew install llama.cpp ``` Invoke the llama.cpp server or the CLI. ### CLI: ```bash llama-cli --hf-repo Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF --hf-file qwen3.6-35b…

Question 2

What license applies to Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF?

Accepted Answer

License: apache-2.0. Verify terms on Hugging Face before commercial use.

Question 3

How do I run Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: image-text-to-text.

Question 4

How much VRAM or disk space does Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF need?

Accepted Answer

Runs locally from ~10.4 MB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF overview

Repository Files & Downloads

Model Details

Model README

Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF

Use with llama.cpp

CLI:

Server:

Run Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF with guIDE

Model ID	Cathul/Qwen3.6-35B-A3B-FP8-Q6_K-GGUF
Author	Cathul
Pipeline	image-text-to-text
License	apache-2.0
Base model	Qwen/Qwen3.6-35B-A3B-FP8
Last modified	2026-06-07T07:24:51.000Z