enlistedghost/ministral-3-14b-instruct-2512-gguf Q6_K_L GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

enlistedghost/ministral-3-14b-instruct-2512-gguf overview

This release contains: Llama.cpp and Ollama compatible GGUF converted and Quantized model files (Compatible with both Ollama, and Llama.cpp) Quantized GGUF version of: Original Model Link: Description: This release includes GGUF (Ollama + Llama.cpp - compatible) model files and three working multi-modal projector(s) (mmproj) files for the Vision Projector; offering full capabilities in Ollama or Llama.cpp. What's New? As of February 2026 all available GGUF (both F16 and K-Quants) files have been completely overhauled in order to offer the community what I was able to come up with as best size vs quality possible for this model from MistralAI (Thank you MistralAI for your release!). This was done by first taking the BF16 model's safetensors and up-casting them to F32, from there the model was quantized down to the currently available files you find here. Beyond the F32-->Quant conversion that is a new methodology; These GGUF F16+Quantize files also include a completely custom tokenizer chat template. Why Up-Cast to F32 from BF16? Some will argue or perhaps be curious as to why the model was up-cast from BF16 to F32... This is because although it is true that there is no direct quality benefit from casting a BF16 release to a F32 bit file size, there is however a benefit of less quality loss when quantizing from an up-cast BF16 to F32 as the F32 has in a sense more "scratchpad" available mathematically and Quants derrived from an F32 inherently degrade less per-Quant compared to a direct F16 or BF16 conversion. (It's worth a Google if anyone is ever curious!) What is the "Custom Tokenizer Chat Template?" As apposed to the standard "Chat Template" made available by MistralAI - this release of GGUF converted and quantized files offer a totally unique and custom Tokenizer Chat Template in order to provide: Smoother, Faster, Efficient, and Sophisticated interaction/inference with the model. This template strips away all of the "fluff" or genuinely unneeded logic from the JINJA Chat/Tokenizer Template - allowing anyone who uses the model for inferencing can enjoy a significant improvement in speed, quality and context adherence without sacrificing any aspect of the initial release by MistralAI. For reference - here is the new JINJA Tokenizer Chat-Template: (This template features a sliding context window of EIGHT (8) interactions, which may be adjusted per-individual requirements simply by altering the third line of this template from the number nine (9) to either higher or lower numerical values to increase or decrease the sliding context window) No modifications, edits, or configurations are required to use this model with Ollama, it works natively! Both Vision and Text work with Ollama. (^.^) Coming Soon!!! Check back occasionally - as a automated installer/configure Python-3 script is making its way to all of my releases! This allows anyone who is interested in using these models a hassle-free and stress-free experience where the Python-3 script takes care of setting up the model for Ollama (specifically for Ollama, other software optimizations coming later). It is highly recommended to use the Ollama "create" command along with the supplied ".modelfile" to ensure proper configuration for anyone who wishes to get the most out of this particular release. Though, the Python-3 automated installer/configuration tool will handle such aspects if it is chosen to be used. Happy Inferencing! -- Jon Z (EnlistedGhost) ---------------------------------------------- ### Model Updates (As of: February 27th, 2026) Final Quantized and full-F16 modelfiles are being uploaded!!! - Check back for I-Matrix quant files and other static K-Quants if you do not see your desired edition (They are being uploaded, thank you for your patience!) ----------------------------------------------

ggufMistralAIMinistralMinistral-3OllamaLlama.cppGGUFImage-Text-to-TextConversationalQuantizeMultimodalMistral3image-text-to-textconversationalenfresdeitptnlzhjakoardataset:mistralai/MM-MT-Benchbase_model:mistralai/Ministral-3-14B-Instruct-2512-BF16base_model:quantized:mistralai/Ministral-3-14B-Instruct-2512-BF16license:apache-2.0endpoints_compatible

enlistedghost/ministral-3-14b-instruct-2512-gguf visual

Downloads

596

Likes

Pipeline

image-text-to-text

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

21 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Ministral-3-14B-Instruct-2512-F16.gguf	GGUF	F16	25.17 GB	Download
Ministral-3-14B-Instruct-2512-Q2_K.gguf	GGUF	Q2_K	4.79 GB	Download
Ministral-3-14B-Instruct-2512-Q2_K_L.gguf	GGUF	Q2_K_L	6.47 GB	Download
Ministral-3-14B-Instruct-2512-Q2_K_S.gguf	GGUF	Q2_K_S	4.59 GB	Download
Ministral-3-14B-Instruct-2512-Q3_K_L.gguf	GGUF	Q3_K_L	6.72 GB	Download
Ministral-3-14B-Instruct-2512-Q3_K_M.gguf	GGUF	Q3_K_M	6.49 GB	Download
Ministral-3-14B-Instruct-2512-Q3_K_S.gguf	GGUF	Q3_K_S	6.01 GB	Download
Ministral-3-14B-Instruct-2512-Q3_K_XL.gguf	GGUF	Q3_K_XL	7.95 GB	Download
Ministral-3-14B-Instruct-2512-Q4_K_M.gguf	GGUF	Q4_K_M	8.22 GB	Download
Ministral-3-14B-Instruct-2512-Q4_K_S.gguf	GGUF	Q4_K_S	7.32 GB	Download
Ministral-3-14B-Instruct-2512-Q4_K_XL.gguf	GGUF	Q4_K_XL	8.77 GB	Download
Ministral-3-14B-Instruct-2512-Q5_K_M.gguf	GGUF	Q5_K_M	9.34 GB	Download
Ministral-3-14B-Instruct-2512-Q5_K_S.gguf	GGUF	Q5_K_S	8.74 GB	Download
Ministral-3-14B-Instruct-2512-Q5_K_XL.gguf	GGUF	Q5_K_XL	10.56 GB	Download
Ministral-3-14B-Instruct-2512-Q6_K.gguf	GGUF	Q6_K	10.33 GB	Download
Ministral-3-14B-Instruct-2512-Q6_K_L.gguf	GGUF	Q6_K_L	11.10 GB	Download
Ministral-3-14B-Instruct-2512-Q8_0.gguf	GGUF	—	13.37 GB	Download
Ministral-3-14B-Instruct-2512-Q8_0_L.gguf	GGUF	—	14.55 GB	Download
mmproj-Ministral-3-14B-Instruct-2512-F16.gguf	GGUF	F16	837.38 MB	Download
mmproj-Ministral-3-14B-Instruct-2512-F32.gguf	GGUF	F32	1.64 GB	Download
mmproj-Ministral-3-14B-Instruct-2512-Q8_0.gguf	GGUF	—	458.40 MB	Download

Model Details Live

Model Slug

enlistedghost/ministral-3-14b-instruct-2512-gguf

Author

EnlistedGhost

Pipeline Task

image-text-to-text

Library

—

Created

2025-12-27

Last Modified

2026-03-06

Gated

Private

HF SHA

e4ddc56e1b0c365975df459572255a7248b9c12f

License

apache-2.0

Language

en, fr, es, de, it, pt, nl, zh, ja, ko, ar

Base Model

mistralai/Ministral-3-14B-Instruct-2512-BF16

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "datasets": [
      "mistralai/MM-MT-Bench"
    ],
    "language": [
      "en",
      "fr",
      "es",
      "de",
      "it",
      "pt",
      "nl",
      "zh",
      "ja",
      "ko",
      "ar"
    ],
    "base_model": [
      "mistralai/Ministral-3-14B-Instruct-2512-BF16"
    ],
    "new_version": "EnlistedGhost/Ministral-3-14B-Instruct-2512-GGUF",
    "pipeline_tag": "image-text-to-text",
    "tags": [
      "MistralAI",
      "Ministral",
      "Ministral-3",
      "Ollama",
      "Llama.cpp",
      "GGUF",
      "Image-Text-to-Text",
      "Conversational",
      "Quantize",
      "Multimodal",
      "Mistral3"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "datasets": [
        "mistralai/MM-MT-Bench"
      ],
      "language": [
        "en",
        "fr",
        "es",
        "de",
        "it",
        "pt",
        "nl",
        "zh",
        "ja",
        "ko",
        "ar"
      ],
      "base_model": [
        "mistralai/Ministral-3-14B-Instruct-2512-BF16"
      ],
      "new_version": "EnlistedGhost/Ministral-3-14B-Instruct-2512-GGUF",
      "pipeline_tag": "image-text-to-text",
      "tags": [
        "MistralAI",
        "Ministral",
        "Ministral-3",
        "Ollama",
        "Llama.cpp",
        "GGUF",
        "Image-Text-to-Text",
        "Conversational",
        "Quantize",
        "Multimodal",
        "Mistral3"
      ]
    },
    "hero_image_url": "https://ollama.com/assets/library/mistral-nemo/72045292-694a-4867-88c8-8635c9d97030",
    "summary": "**This release contains:**  Llama.cpp and Ollama compatible GGUF converted and Quantized model files *(Compatible with both Ollama, and Llama.cpp)* **Quantized GGUF version of:** **Original Model Link:** **Description:**  This release includes GGUF (Ollama + Llama.cpp - compatible) model files and three working multi-modal projector(s) (mmproj) files for the Vision Projector; offering full capabilities in Ollama or Llama.cpp. **What's New?**  As of February 2026 all available GGUF (both F16 and K-Quants) files have been completely overhauled in order to offer the community what I was able to come up with as best size vs quality possible for this model from MistralAI (Thank you MistralAI for your release!). This was done by first taking the BF16 model's safetensors and *up-casting* them to F32, from there the model was quantized down to the currently available files you find here. Beyond the F32-->Quant conversion that is a new methodology; **These GGUF F16+Quantize files also include a completely custom tokenizer chat template.**  **Why Up-Cast to F32 from BF16?**  Some will argue or perhaps be curious as to why the model was up-cast from BF16 to F32... This is because although it is true that there is no *direct* quality benefit from casting a BF16 release to a F32 bit file size, there *is however* a benefit of **less quality loss** when quantizing from an up-cast BF16 to F32 as the F32 has in a sense more \"scratchpad\" available mathematically and Quants derrived from an F32 inherently degrade less per-Quant compared to a direct F16 or BF16 conversion. (It's worth a Google if anyone is ever curious!)  **What is the \"Custom Tokenizer Chat Template?\"**  As apposed to the standard \"Chat Template\" made available by MistralAI - this release of GGUF converted and quantized files offer a totally unique and custom Tokenizer Chat Template in order to provide: Smoother, Faster, Efficient, and Sophisticated interaction/inference with the model. This template strips away all of the \"fluff\" or genuinely unneeded logic from the JINJA Chat/Tokenizer Template - allowing anyone who uses the model for inferencing can enjoy a significant improvement in speed, quality and context adherence without sacrificing any aspect of the initial release by MistralAI. **For reference - here is the new JINJA Tokenizer Chat-Template:**  *(This template features a sliding context window of EIGHT (8) interactions, which may be adjusted per-individual requirements simply by altering the third line of this template from the number nine (9) to either higher or lower numerical values to increase or decrease the sliding context window)* ``jinja {{- $remMessage := false }} {{- range $index, $_ := .Messages }} {{- if lt (len (slice $.Messages $index)) 9 }} {{- $remMessage = true }} {{- end }} {{- if eq .Role \"system\" }}[SYSTEM_PROMPT]{{ .Content }}[/SYSTEM_PROMPT]{{- end }} {{- if $remMessage }} {{- if eq .Role \"user\" }} [INST]{{ .Content }}[/INST] {{- else if eq .Role \"assistant\" }} {{- if .Content}}{{ .Content }}{{- end }} {{- end }} {{- end }} {{- end }} `` **No modifications, edits, or configurations are required** to use this model with Ollama, it works natively! **Both Vision and Text work with Ollama. (^.^)**  **Coming Soon!!!**  Check back occasionally - as a automated installer/configure Python-3 script is making its way to all of my releases! This allows anyone who is interested in using these models a hassle-free and stress-free experience where the Python-3 script takes care of setting up the model for Ollama (specifically for Ollama, other software optimizations coming later). It is highly recommended to use the Ollama \"create\" command along with the supplied \".modelfile\" to ensure proper configuration for anyone who wishes to get the most out of this particular release. Though, the Python-3 automated installer/configuration tool will handle such aspects if it is chosen to be used. Happy Inferencing!  -- Jon Z (EnlistedGhost) ---------------------------------------------- ### Model Updates (As of: February 27th, 2026) **Final Quantized and full-F16 modelfiles are being uploaded!!!** - Check back for I-Matrix quant files and other static K-Quants if you do not see your desired edition (They are being uploaded, thank you for your patience!) ----------------------------------------------",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\ndatasets:\n- mistralai/MM-MT-Bench\nlanguage:\n- en\n- fr\n- es\n- de\n- it\n- pt\n- nl\n- zh\n- ja\n- ko\n- ar\nbase_model:\n- mistralai/Ministral-3-14B-Instruct-2512-BF16\nnew_version: EnlistedGhost/Ministral-3-14B-Instruct-2512-GGUF\npipeline_tag: image-text-to-text\ntags:\n- MistralAI\n- Ministral\n- Ministral-3\n- Ollama\n- Llama.cpp\n- GGUF\n- Image-Text-to-Text\n- Conversational\n- Quantize\n- Multimodal\n- Mistral3\n---\n\n<img src=\"https://ollama.com/assets/library/mistral-nemo/72045292-694a-4867-88c8-8635c9d97030\" alt=\"Example image\" width=\"168\" height=\"128\"> \n\n\n<img src=\"https://ollama.com/assets/library/ministral-3/83fa3859-d87f-492c-bd81-596cfbceeccb\" alt=\"Example image\" width=\"64\" height=\"64\">\n\n## ------------------------------------------------<br /> - Model Details and Specifications: -<br />------------------------------------------------\n# Ministral-3 14B Instruct 2512 (GGUF)\n\n**This release contains:** <br />\nLlama.cpp and Ollama compatible GGUF converted and Quantized model files \n*(Compatible with both Ollama, and Llama.cpp)*\n\n**Quantized GGUF version of:**\n- Ministral-3-14B-Instruct-2512-BF16 <br /> *(by MistralAI)*\n\n**Original Model Link:**\n- [mistralai/Ministral-3-14B-Instruct-2512-BF16](https://huggingface.co/mistralai/Ministral-3-14B-Instruct-2512-BF16)\n\n**Description:** <br />\nThis release includes GGUF (Ollama + Llama.cpp - compatible) model files and three working multi-modal projector(s) \n(mmproj) files for the Vision Projector; offering full capabilities in Ollama or Llama.cpp.<br />\n\n**What's New?** <br />\nAs of February 2026 all available GGUF (both F16 and K-Quants) files have been completely overhauled in order to offer the community\nwhat I was able to come up with as best size vs quality possible for this model from MistralAI (Thank you MistralAI for your release!).\nThis was done by first taking the BF16 model's safetensors and *up-casting* them to F32, from there the model was quantized down to the\ncurrently available files you find here. Beyond the F32-->Quant conversion that is a new methodology; **These GGUF F16+Quantize files also\ninclude a completely custom tokenizer chat template.** <br /><br />\n\n**Why Up-Cast to F32 from BF16?** <br />\nSome will argue or perhaps be curious as to why the model was up-cast from BF16 to F32... This is because although it is true that there \nis no *direct* quality benefit from casting a BF16 release to a F32 bit file size, there *is however* a benefit of **less quality loss** \nwhen quantizing from an up-cast BF16 to F32 as the F32 has in a sense more \"scratchpad\" available mathematically and Quants derrived from an\nF32 inherently degrade less per-Quant compared to a direct F16 or BF16 conversion. (It's worth a Google if anyone is ever curious!)\n<br /><br />\n\n**What is the \"Custom Tokenizer Chat Template?\"** <br />\nAs apposed to the standard \"Chat Template\" made available by MistralAI - this release of GGUF converted and quantized files offer a totally \nunique and custom Tokenizer Chat Template in order to provide: Smoother, Faster, Efficient, and Sophisticated interaction/inference with the model.\nThis template strips away all of the \"fluff\" or genuinely unneeded logic from the JINJA Chat/Tokenizer Template - allowing anyone who uses the model\nfor inferencing can enjoy a significant improvement in speed, quality and context adherence without sacrificing any aspect of the initial release by MistralAI.\n\n**For reference - here is the new JINJA Tokenizer Chat-Template:** <br />\n*(This template features a sliding context window of EIGHT (8) interactions, which may be adjusted per-individual requirements simply by altering\nthe third line of this template from the number nine (9) to either higher or lower numerical values to increase or decrease the sliding context window)*\n```jinja\n{{- $remMessage := false }}\n{{- range $index, $_ := .Messages }}\n{{- if lt (len (slice $.Messages $index)) 9 }}\n{{- $remMessage = true }}\n{{- end }}\n{{- if eq .Role \"system\" }}[SYSTEM_PROMPT]{{ .Content }}[/SYSTEM_PROMPT]{{- end }}\n{{- if $remMessage }}\n{{- if eq .Role \"user\" }}\n[INST]{{ .Content }}[/INST]\n{{- else if eq .Role \"assistant\" }}\n{{- if .Content}}{{ .Content }}{{- end }}\n{{- end }}\n{{- end }}\n{{- end }}\n```\n\n**No modifications, edits, or configurations are required** to use this model with\nOllama, it works natively! **Both Vision and Text work with Ollama. (^.^)** <br />\n\n**Coming Soon!!!** <br />\nCheck back occasionally - as a automated installer/configure Python-3 script is making its way to all of my releases! \nThis allows anyone who is interested in using these models a hassle-free and stress-free experience where the Python-3\nscript takes care of setting up the model for Ollama (specifically for Ollama, other software optimizations coming later). \nIt is highly recommended to use the Ollama \"create\" command along with the supplied \".modelfile\" to ensure proper configuration\nfor anyone who wishes to get the most out of this particular release. Though, the Python-3 automated installer/configuration tool\nwill handle such aspects if it is chosen to be used.\n\nHappy Inferencing! <br />\n -- Jon Z (EnlistedGhost)\n\n----------------------------------------------\n\n### Model Updates (As of: February 27th, 2026)\n- Updated: All GGUF model file(s) with significantly altered/reworked GGUF conversion(s) and Quant file(s)<br />\n**Final Quantized and full-F16 modelfiles are being uploaded!!!** - Check back for I-Matrix quant files and other static\n  K-Quants if you do not see your desired edition (They are being uploaded, thank you for your patience!)\n\n----------------------------------------------\n\n## -------------------------------------------------------------<br /> - GGUF Conversion and Quantization Details: -<br />-------------------------------------------------------------\n\n**Software used to convert Safetensors to GGUF:**\n- <a href=\"https://github.com/ggml-org/llama.cpp/\">llama.cpp</a>\n\n**Software used to create Quantized GGUF Files:**\n- <a href=\"https://github.com/ggml-org/llama.cpp/\">llama.cpp</a> \n\n**Specific GitHub Commit Point:**\n- <a href=\"https://github.com/ggml-org/llama.cpp/commit/94b0200a01a753eff5897dab9311f51a7bc1c62f\">b8118</a>\n\n**Converted to GGUF and Quantized by:**\n- [EnlistedGhost](https://huggingface.co/EnlistedGhost)\n\n----------------------------------------------\n\n## --------------------------<br /> ---- Original Info ---- <br /> --------------------------\n*(Crossposted from the link in the above section: \"Model Details\"):*\n<br />\n<br />\n<br />\n<br />\n\n# Ministral 3 14B Instruct 2512 BF16\n\nThe largest model in the Ministral 3 family, **Ministral 3 14B** offers frontier capabilities and performance comparable to its larger [Mistral Small 3.2 24B](https://huggingface.co/mistralai/Mistral-Small-3.2-Instruct-2506) counterpart. A powerful and efficient language model with vision capabilities.\n\nThis model is the instruct post-trained version, fine-tuned for instruction tasks, making it ideal for chat and instruction based use cases.\n\nThe Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware. Ministral 3 14B can even be deployed locally, capable of fitting in 32GB of VRAM in BF16, and less than 24GB of RAM/VRAM when quantized.\n\nWe provide a no-loss FP8 version [here](https://huggingface.co/mistralai/Ministral-3-14B-Instruct-2512), you can find other formats and quantizations in the [Ministral 3 - Additional Checkpoints](https://huggingface.co/collections/mistralai/ministral-3-additional-checkpoints) collection.\n\n## Key Features\nMinistral 3 14B consists of two main architectural components:\n- **13.5B Language Model**\n- **0.4B Vision Encoder**\n\nThe Ministral 3 14B Instruct model offers the following capabilities:\n- **Vision**: Enables the model to analyze images and provide insights based on visual content, in addition to text.\n- **Multilingual**: Supports dozens of languages, including English, French, Spanish, German, Italian, Portuguese, Dutch, Chinese, Japanese, Korean, Arabic.\n- **System Prompt**: Maintains strong adherence and support for system prompts.\n- **Agentic**: Offers best-in-class agentic capabilities with native function calling and JSON outputting.\n- **Edge-Optimized**: Delivers best-in-class performance at a small scale, deployable anywhere.\n- **Apache 2.0 License**: Open-source license allowing usage and modification for both commercial and non-commercial purposes.\n- **Large Context Window**: Supports a 256k context window.\n\n### Use Cases\nPrivate AI deployments where advanced capabilities meet practical hardware constraints:\n- Private/custom chat and AI assistant deployments in constrained environments\n- Advanced local agentic use cases\n- Fine-tuning and specialization\n- And more...\n  \nBringing advanced AI capabilities to most environments.\n\n## Ministral 3 Family\n\n| Model Name                     | Type               | Precision | Link                                                                                     |\n|--------------------------------|--------------------|-----------|------------------------------------------------------------------------------------------|\n| Ministral 3 3B Base 2512       | Base pre-trained   | BF16      | [Hugging Face](https://huggingface.co/mistralai/Ministral-3-3B-Base-2512)                |\n| Ministral 3 3B Instruct 2512   | Instruct post-trained | BF16   | [Hugging Face](https://huggingface.co/mistralai/Ministral-3-3B-Instruct-2512)            |\n| Ministral 3 3B Reasoning 2512  | Reasoning capable  | BF16      | [Hugging Face](https://huggingface.co/mistralai/Ministral-3-3B-Reasoning-2512)           |\n| Ministral 3 8B Base 2512       | Base pre-trained   | BF16      | [Hugging Face](https://huggingface.co/mistralai/Ministral-3-8B-Base-2512)                |\n| Ministral 3 8B Instruct 2512   | Instruct post-trained | BF16    | [Hugging Face](https://huggingface.co/mistralai/Ministral-3-8B-Instruct-2512)            |\n| Ministral 3 8B Reasoning 2512  | Reasoning capable  | BF16      | [Hugging Face](https://huggingface.co/mistralai/Ministral-3-8B-Reasoning-2512)           |\n| Ministral 3 14B Base 2512      | Base pre-trained   | BF16      | [Hugging Face](https://huggingface.co/mistralai/Ministral-3-14B-Base-2512)               |\n| **Ministral 3 14B Instruct 2512**  | **Instruct post-trained** | **BF16**    | [Hugging Face](https://huggingface.co/mistralai/Ministral-3-14B-Instruct-2512)           |\n| Ministral 3 14B Reasoning 2512 | Reasoning capable  | BF16      | [Hugging Face](https://huggingface.co/mistralai/Ministral-3-14B-Reasoning-2512)          |\n\nOther formats available [here](https://huggingface.co/collections/mistralai/ministral-3-additional-checkpoints).\n\n## Benchmark Results\n\nWe compare Ministral 3 to similar sized models.\n\n### Reasoning\n\n| Model                     | AIME25      | AIME24      | GPQA Diamond | LiveCodeBench |\n|---------------------------|-------------|-------------|--------------|---------------|\n| **Ministral 3 14B**       | <u>0.850</u>| <u>0.898</u>| <u>0.712</u> | <u>0.646</u>  |\n| Qwen3-14B (Thinking)      | 0.737       | 0.837       | 0.663        | 0.593         |\n|                           |             |             |              |               |\n| **Ministral 3 8B**        | 0.787       | <u>0.860</u>| 0.668        | <u>0.616</u>  |\n| Qwen3-VL-8B-Thinking      | <u>0.798</u>| <u>0.860</u>| <u>0.671</u> | 0.580         |\n|                           |             |             |              |               |\n| **Ministral 3 3B**        | <u>0.721</u>| <u>0.775</u>| 0.534        | <u>0.548</u>  |\n| Qwen3-VL-4B-Thinking      | 0.697       | 0.729       | <u>0.601</u> | 0.513         |\n\n### Instruct\n\n| Model                     | Arena Hard  | WildBench  | MATH Maj@1  | MM MTBench       |\n|---------------------------|-------------|------------|-------------|------------------|\n| **Ministral 3 14B**       | <u>0.551</u>| <u>68.5</u>| <u>0.904</u>| <u>8.49</u>      |\n| Qwen3 14B (Non-Thinking)  | 0.427       | 65.1       | 0.870       | NOT MULTIMODAL   |\n| Gemma3-12B-Instruct       | 0.436       | 63.2       | 0.854       | 6.70             |\n|                           |             |            |             |                  |\n| **Ministral 3 8B**        | 0.509       | <u>66.8</u>| 0.876       | <u>8.08</u>      |\n| Qwen3-VL-8B-Instruct      | <u>0.528</u>| 66.3       | <u>0.946</u>| 8.00             |\n|                           |             |            |             |                  |\n| **Ministral 3 3B**        | 0.305       | <u>56.8</u>| 0.830       | 7.83             |\n| Qwen3-VL-4B-Instruct      | <u>0.438</u>| <u>56.8</u>| <u>0.900</u>| <u>8.01</u>      |\n| Qwen3-VL-2B-Instruct      | 0.163       | 42.2       | 0.786       | 6.36             |\n| Gemma3-4B-Instruct        | 0.318       | 49.1       | 0.759       | 5.23             |\n\n### Base\n\n| Model               | Multilingual MMLU | MATH CoT 2-Shot | AGIEval 5-shot | MMLU Redux 5-shot | MMLU 5-shot | TriviaQA 5-shot |\n|---------------------|-------------------|-----------------|----------------|-------------------|-------------|-----------------|\n| **Ministral 3 14B** | 0.742             | <u>0.676</u>    | 0.648          | 0.820             | 0.794       | 0.749           |\n| Qwen3 14B Base      | <u>0.754</u>      | 0.620           | <u>0.661</u>   | <u>0.837</u>      | <u>0.804</u>| 0.703           |\n| Gemma 3 12B Base    | 0.690             | 0.487           | 0.587          | 0.766             | 0.745       | <u>0.788</u>    |\n|                     |                   |                 |                |                   |             |                 |\n| **Ministral 3 8B**  | <u>0.706</u>      | <u>0.626</u>    | 0.591          | 0.793             | <u>0.761</u>| <u>0.681</u>    |\n| Qwen 3 8B Base      | 0.700             | 0.576           | <u>0.596</u>   | <u>0.794</u>      | 0.760       | 0.639           |\n|                     |                   |                 |                |                   |             |                 |\n| **Ministral 3 3B**  | 0.652             | <u>0.601</u>    | 0.511          | 0.735             | 0.707       | 0.592           |\n| Qwen 3 4B Base      | <u>0.677</u>      | 0.405           | <u>0.570</u>   | <u>0.759</u>      | <u>0.713</u>| 0.530           |\n| Gemma 3 4B Base     | 0.516             | 0.294           | 0.430          | 0.626             | 0.589       | <u>0.640</u>    |\n\n## License\n\nThis model is licensed under the [Apache 2.0 License](https://www.apache.org/licenses/LICENSE-2.0.txt).\n\n*You must not use this model in a manner that infringes, misappropriates, or otherwise violates any third party’s rights, including intellectual property rights.*",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "MistralAI",
    "Ministral",
    "Ministral-3",
    "Ollama",
    "Llama.cpp",
    "GGUF",
    "Image-Text-to-Text",
    "Conversational",
    "Quantize",
    "Multimodal",
    "Mistral3",
    "image-text-to-text",
    "conversational",
    "en",
    "fr",
    "es",
    "de",
    "it",
    "pt",
    "nl",
    "zh",
    "ja",
    "ko",
    "ar",
    "dataset:mistralai/MM-MT-Bench",
    "base_model:mistralai/Ministral-3-14B-Instruct-2512-BF16",
    "base_model:quantized:mistralai/Ministral-3-14B-Instruct-2512-BF16",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us"
  ],
  "likes": 0,
  "downloads": 596,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-06T02:43:05.000Z",
  "created_at": "2025-12-27T16:03:10.000Z",
  "pipeline_tag": "image-text-to-text",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "6950033ea8d391e3fb926eb7",
  "id": "EnlistedGhost/Ministral-3-14B-Instruct-2512-GGUF",
  "modelId": "EnlistedGhost/Ministral-3-14B-Instruct-2512-GGUF",
  "sha": "e4ddc56e1b0c365975df459572255a7248b9c12f",
  "createdAt": "2025-12-27T16:03:10.000Z",
  "lastModified": "2026-03-06T02:43:05.000Z",
  "author": "EnlistedGhost",
  "downloads": 596,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-text-to-text",
  "library_name": "",
  "siblings_count": 26
}

enlistedghost/ministral-3-14b-instruct-2512-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard