GraySoft
Projects Models About FAQ Contact Download guIDE →

beinsezii/mistral-small-4-119b-2603-gguf-halo q6k_ffn GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

beinsezii/mistral-small-4-119b-2603-gguf-halo overview

Quant optimized for quality / speed on a Strix Halo 128GiB system. Possibly also beneficial on DGX Spark and similar systems. The TL;DR is this quant achieves both superior quality and speed compared to homogenous Q6_K. See the GLM version for more details on theory and comparisons.

ggufbase_model:mistralai/Mistral-Small-4-119B-2603base_model:quantized:mistralai/Mistral-Small-4-119B-2603license:apache-2.0endpoints_compatibleregion:usimatrixconversational
beinsezii/mistral-small-4-119b-2603-gguf-halo visual
Downloads
152
Likes
2
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

2 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
imatrix.gguf GGUF 113.30 MB Download
mistral-small-4-119b-q80-q6k_ffn.gguf GGUF 91.63 GB Download

Model Details Live

Model Slug
beinsezii/mistral-small-4-119b-2603-gguf-halo
Author
Beinsezii
Pipeline Task
Library
Created
2026-03-17
Last Modified
2026-03-17
Gated
No
Private
No
HF SHA
8eeed09d3138747e43efefe53184926b330b508e
License
apache-2.0
Language
Unknown
Base Model
mistralai/Mistral-Small-4-119B-2603

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "base_model": [
      "mistralai/Mistral-Small-4-119B-2603"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "base_model": [
        "mistralai/Mistral-Small-4-119B-2603"
      ]
    },
    "hero_image_url": "",
    "summary": "Quant optimized for quality / speed on a Strix Halo 128GiB system. Possibly also beneficial on DGX Spark and similar systems. The TL;DR is this quant achieves both superior quality and speed compared to homogenous Q6_K. See the GLM version for more details on theory and comparisons.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nbase_model:\n- mistralai/Mistral-Small-4-119B-2603\n---\n\nQuant optimized for quality / speed on a Strix Halo 128GiB system. Possibly also beneficial on DGX Spark and similar systems.\n\nThe TL;DR is this quant achieves both superior quality and speed compared to homogenous Q6_K.\n\nSee the [GLM version](https://huggingface.co/Beinsezii/GLM-4.6V-GGUF-HALO) for more details on theory and comparisons.\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "base_model:mistralai/Mistral-Small-4-119B-2603",
    "base_model:quantized:mistralai/Mistral-Small-4-119B-2603",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 2,
  "downloads": 152,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-17T10:04:31.000Z",
  "created_at": "2026-03-17T09:53:31.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "69b9249b04a9af463ffed840",
  "id": "Beinsezii/Mistral-Small-4-119B-2603-GGUF-HALO",
  "modelId": "Beinsezii/Mistral-Small-4-119B-2603-GGUF-HALO",
  "sha": "8eeed09d3138747e43efefe53184926b330b508e",
  "createdAt": "2026-03-17T09:53:31.000Z",
  "lastModified": "2026-03-17T10:04:31.000Z",
  "author": "Beinsezii",
  "downloads": 152,
  "likes": 2,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 5
}