beinsezii/mistral-small-4-119b-2603-gguf-halo q6k_ffn GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

beinsezii/mistral-small-4-119b-2603-gguf-halo overview

Quant optimized for quality / speed on a Strix Halo 128GiB system. Possibly also beneficial on DGX Spark and similar systems. The TL;DR is this quant achieves both superior quality and speed compared to homogenous Q6_K. See the GLM version for more details on theory and comparisons.

ggufbase_model:mistralai/Mistral-Small-4-119B-2603base_model:quantized:mistralai/Mistral-Small-4-119B-2603license:apache-2.0endpoints_compatibleregion:usimatrixconversational

beinsezii/mistral-small-4-119b-2603-gguf-halo visual

Downloads

152

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

2 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
imatrix.gguf	GGUF	—	113.30 MB	Download
mistral-small-4-119b-q80-q6k_ffn.gguf	GGUF	—	91.63 GB	Download

Model Details Live

Model Slug

beinsezii/mistral-small-4-119b-2603-gguf-halo

Author

Beinsezii

Pipeline Task

—

Library

—

Created

2026-03-17

Last Modified

2026-03-17

Gated

Private

HF SHA

8eeed09d3138747e43efefe53184926b330b508e

License

apache-2.0

Language

Unknown

Base Model

mistralai/Mistral-Small-4-119B-2603

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "base_model": [
      "mistralai/Mistral-Small-4-119B-2603"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "base_model": [
        "mistralai/Mistral-Small-4-119B-2603"
      ]
    },
    "hero_image_url": "",
    "summary": "Quant optimized for quality / speed on a Strix Halo 128GiB system. Possibly also beneficial on DGX Spark and similar systems. The TL;DR is this quant achieves both superior quality and speed compared to homogenous Q6_K. See the GLM version for more details on theory and comparisons.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nbase_model:\n- mistralai/Mistral-Small-4-119B-2603\n---\n\nQuant optimized for quality / speed on a Strix Halo 128GiB system. Possibly also beneficial on DGX Spark and similar systems.\n\nThe TL;DR is this quant achieves both superior quality and speed compared to homogenous Q6_K.\n\nSee the [GLM version](https://huggingface.co/Beinsezii/GLM-4.6V-GGUF-HALO) for more details on theory and comparisons.\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "base_model:mistralai/Mistral-Small-4-119B-2603",
    "base_model:quantized:mistralai/Mistral-Small-4-119B-2603",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 2,
  "downloads": 152,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-17T10:04:31.000Z",
  "created_at": "2026-03-17T09:53:31.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "69b9249b04a9af463ffed840",
  "id": "Beinsezii/Mistral-Small-4-119B-2603-GGUF-HALO",
  "modelId": "Beinsezii/Mistral-Small-4-119B-2603-GGUF-HALO",
  "sha": "8eeed09d3138747e43efefe53184926b330b508e",
  "createdAt": "2026-03-17T09:53:31.000Z",
  "lastModified": "2026-03-17T10:04:31.000Z",
  "author": "Beinsezii",
  "downloads": 152,
  "likes": 2,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 5
}

beinsezii/mistral-small-4-119b-2603-gguf-halo overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard