kuroto4ka/qwen3-vl-8b-instruct-unredacted-max-quants-gguf MAX.Q8_0 GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

kuroto4ka/qwen3-vl-8b-instruct-unredacted-max-quants-gguf overview

This repository contains high-quality GGUF quantizations for the prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX model.

ggufvisionmultimodalqwenqwen-3unredactedimage-to-texttext-generationconversationalroleplayassistantvlvlmenruzhbase_model:prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAXbase_model:quantized:prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAXlicense:apache-2.0endpoints_compatibleregion:us

kuroto4ka/qwen3-vl-8b-instruct-unredacted-max-quants-gguf visual

Downloads

1,897

Likes

Pipeline

image-to-text

Library

gguf

Visibility

Public

Access

Open

Repository Files & Downloads

14 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Qwen3-VL-8B-Instruct-Unredacted-MAX.F16.gguf	GGUF	F16	15.26 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.Q2_K.gguf	GGUF	Q2_K	3.06 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.Q3_K_L.gguf	GGUF	Q3_K_L	4.13 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.Q3_K_M.gguf	GGUF	Q3_K_M	3.84 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.Q4_K_M.gguf	GGUF	Q4_K_M	4.68 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.Q4_K_S.gguf	GGUF	Q4_K_S	4.47 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.Q5_K_M.gguf	GGUF	Q5_K_M	5.45 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.Q5_K_S.gguf	GGUF	Q5_K_S	5.33 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.Q6_K.gguf	GGUF	Q6_K	6.26 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.Q8_0.gguf	GGUF	—	8.11 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.mmproj-bf16.gguf	GGUF	BF16	1.08 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.mmproj-f16.gguf	GGUF	F16	1.08 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.mmproj-f32.gguf	GGUF	F32	2.15 GB	Download
Qwen3-VL-8B-Instruct-Unredacted-MAX.mmproj-q8_0.gguf	GGUF	—	717.44 MB	Download

Model Details Live

Model Slug

kuroto4ka/qwen3-vl-8b-instruct-unredacted-max-quants-gguf

Author

KuroTo4ka

Pipeline Task

image-to-text

Library

gguf

Created

2026-04-10

Last Modified

2026-04-13

Gated

Private

HF SHA

d2db200bba5cc65c50959a5d5624e34d6cc8e7bc

License

apache-2.0

Language

en, ru, zh

Base Model

prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "language": [
      "en",
      "ru",
      "zh"
    ],
    "tags": [
      "vision",
      "multimodal",
      "gguf",
      "qwen",
      "qwen-3",
      "unredacted",
      "image-to-text",
      "text-generation",
      "conversational",
      "roleplay",
      "assistant",
      "vl",
      "vlm"
    ],
    "pipeline_tag": "image-to-text",
    "library_name": "gguf",
    "base_model": [
      "prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "language": [
        "en",
        "ru",
        "zh"
      ],
      "tags": [
        "vision",
        "multimodal",
        "gguf",
        "qwen",
        "qwen-3",
        "unredacted",
        "image-to-text",
        "text-generation",
        "conversational",
        "roleplay",
        "assistant",
        "vl",
        "vlm"
      ],
      "pipeline_tag": "image-to-text",
      "library_name": "gguf",
      "base_model": [
        "prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX"
      ]
    },
    "hero_image_url": "https://camo.githubusercontent.com/17b4379eedbf639f0fc005e6512f2a629b9cb059d3cc4eacc1ec65fa21f92898/68747470733a2f2f7169616e77656e2d7265732e6f73732d616363656c65726174652e616c6979756e63732e636f6d2f5177656e332d564c2f7177656e33766c6c6f676f2e706e67",
    "summary": "This repository contains high-quality GGUF quantizations for the prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX model.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nlanguage:\n- en\n- ru\n- zh\ntags:\n- vision\n- multimodal\n- gguf\n- qwen\n- qwen-3\n- unredacted\n- image-to-text\n- text-generation\n- conversational\n- roleplay\n- assistant\n- vl\n- vlm\npipeline_tag: image-to-text\nlibrary_name: gguf\nbase_model:\n- prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX\n---\n\n![1](https://camo.githubusercontent.com/17b4379eedbf639f0fc005e6512f2a629b9cb059d3cc4eacc1ec65fa21f92898/68747470733a2f2f7169616e77656e2d7265732e6f73732d616363656c65726174652e616c6979756e63732e636f6d2f5177656e332d564c2f7177656e33766c6c6f676f2e706e67)\n\n# Qwen3-VL-8B-Instruct-Unredacted-MAX-Quants-GGUF\n\nThis repository contains high-quality GGUF quantizations for the [prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX](https://huggingface.co/prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX) model.\n\n##  Highlights\n- **Unredacted & MAX**: Maximum performance version without restrictive filters.\n- **Full Vision Support**: Includes multiple versions of the vision projector (`mmproj`) for different hardware needs.\n- **Optimized**: Compatible with the latest `llama.cpp` and other GGUF-supported backends.\n\n##  Files Included\n\n### 1. Model Weights (LLM)\n\n| Filename | Quant Method | Description |\n| :--- | :--- | :--- |\n| `Q4_K_M.gguf` | Q4_K_M | **Recommended.** Best balance of speed and intelligence. |\n| `Q8_0.gguf` | Q8_0 | High quality, nearly identical to original weights. |\n| `Q6_K.gguf` | Q6_K | Very high quality, slightly slower than Q4. |\n| `Q5_K_M.gguf` | Q5_K_M | Good balance between Q4 and Q6. |\n| `Q3_K_M.gguf` | Q3_K_M | Low size, moderate quality loss. |\n| `Q2_K.gguf` | Q2_K | Smallest possible size, significant quality loss. |\n| `F16.gguf` | F16 | Baseline reference quality. |\n\n### 2. Vision Projectors (mmproj)\n*Required for image recognition tasks.*\n\n\n| Filename | Type | Description |\n| :--- | :--- | :--- |\n| `mmproj-f32.gguf` | F32 | Absolute maximum precision (2.3GB). |\n| `mmproj-f16.gguf` | F16 | Industry standard for high-quality vision. |\n| `mmproj-bf16.gguf` | BF16 | Optimized for modern NVIDIA GPUs (Ampere+). |\n| `mmproj-q8_0.gguf` | Q8_0 | Best for saving VRAM without losing recognition detail. |\n\n##  Usage\nTo use vision capabilities in `llama.cpp`, use the following command:\n\n```bash\n./llama-cli -m Qwen3-VL-8B-Instruct-Unredacted-MAX.Q4_K_M.gguf \\\n            --mmproj Qwen3-VL-8B-Instruct-Unredacted-MAX.mmproj-f16.gguf \\\n            --image path/to/your/image.jpg \\\n            -p \"Describe this image\"\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "vision",
    "multimodal",
    "qwen",
    "qwen-3",
    "unredacted",
    "image-to-text",
    "text-generation",
    "conversational",
    "roleplay",
    "assistant",
    "vl",
    "vlm",
    "en",
    "ru",
    "zh",
    "base_model:prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX",
    "base_model:quantized:prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us"
  ],
  "likes": 1,
  "downloads": 1897,
  "gated": false,
  "private": false,
  "last_modified": "2026-04-13T09:23:21.000Z",
  "created_at": "2026-04-10T21:23:25.000Z",
  "pipeline_tag": "image-to-text",
  "library_name": "gguf"
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "69d96a4d44072dd46eed610d",
  "id": "KuroTo4ka/Qwen3-VL-8B-Instruct-Unredacted-MAX-Quants-GGUF",
  "modelId": "KuroTo4ka/Qwen3-VL-8B-Instruct-Unredacted-MAX-Quants-GGUF",
  "sha": "d2db200bba5cc65c50959a5d5624e34d6cc8e7bc",
  "createdAt": "2026-04-10T21:23:25.000Z",
  "lastModified": "2026-04-13T09:23:21.000Z",
  "author": "KuroTo4ka",
  "downloads": 1897,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-to-text",
  "library_name": "gguf",
  "siblings_count": 16
}

kuroto4ka/qwen3-vl-8b-instruct-unredacted-max-quants-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard