inferenceillusionist/mixtral-8x7b-instruct-v0.1-limarp-zloss-imat-gguf IQ3_XS GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

inferenceillusionist/mixtral-8x7b-instruct-v0.1-limarp-zloss-imat-gguf overview

Quantized from fp32 with love. * Quantizations made possible using mixtral-8x7b.imatrix file from this repo (special thanks to ikawrakow). For a brief rundown of iMatrix quant performance please see this PR All quants are verified working prior to uploading to repo for your safety and convenience. Importance matrix quantizations are a work in progress, IQ3 and above is recommended for best results. Tip: Pick a size that can fit in your GPU while still allowing some room for context for best speed. You may need to pad this further depending on if you are running image gen or TTS as well. Original model card can be found here

ggufmergeiMatlicense:apache-2.0endpoints_compatibleregion:usconversational

inferenceillusionist/mixtral-8x7b-instruct-v0.1-limarp-zloss-imat-gguf visual

Downloads

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

18 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ1_M.gguf	GGUF	IQ1_M	10.10 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ2_M.gguf	GGUF	IQ2_M	14.43 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ2_S.gguf	GGUF	IQ2_S	13.16 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ2_XS.gguf	GGUF	IQ2_XS	12.97 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ2_XXS.gguf	GGUF	IQ2_XXS	11.69 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ3_M.gguf	GGUF	IQ3_M	19.96 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ3_S.gguf	GGUF	IQ3_S	19.03 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ3_XS.gguf	GGUF	IQ3_XS	18.02 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ3_XXS.gguf	GGUF	IQ3_XXS	16.99 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ4_XS.gguf	GGUF	IQ4_XS	23.36 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q2_K.gguf	GGUF	Q2_K	16.12 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q3_K_M.gguf	GGUF	Q3_K_M	21.00 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q4_K_M.gguf	GGUF	Q4_K_M	26.49 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q4_K_S.gguf	GGUF	Q4_K_S	24.91 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q5_K_M.gguf	GGUF	Q5_K_M	30.95 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q5_K_S.gguf	GGUF	Q5_K_S	30.02 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q6_K.gguf	GGUF	Q6_K	35.74 GB	Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q8_0.gguf	GGUF	—	46.22 GB	Download

Model Details Live

Model Slug

inferenceillusionist/mixtral-8x7b-instruct-v0.1-limarp-zloss-imat-gguf

Author

InferenceIllusionist

Pipeline Task

—

Library

—

Created

2024-04-16

Last Modified

2024-04-17

Gated

Private

HF SHA

6df7236f8ad53a63ac3c2d5cc3384cb384c6f108

License

apache-2.0

Language

Unknown

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "tags": [
      "merge",
      "gguf",
      "iMat"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "tags": [
        "merge",
        "gguf",
        "iMat"
      ]
    },
    "hero_image_url": "https://i.imgur.com/P68dXux.png",
    "summary": "Quantized from fp32 with love. * Quantizations made possible using mixtral-8x7b.imatrix file from this repo (special thanks to ikawrakow). For a brief rundown of iMatrix quant performance please see this PR All quants are verified working prior to uploading to repo for your safety and convenience.  Importance matrix quantizations are a work in progress, IQ3 and above is recommended for best results. Tip: Pick a size that can fit in your GPU while still allowing some room for context for best speed. You may need to pad this further depending on if you are running image gen or TTS as well. Original model card can be found here",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\ntags:\n- merge\n- gguf\n- iMat\n---\n<img src=\"https://i.imgur.com/P68dXux.png\" width=\"400\"/>\n\n# Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-GGUF\n\nQuantized from fp32 with love.\n* Quantizations made possible using mixtral-8x7b.imatrix file from [this](https://huggingface.co/datasets/ikawrakow/imatrix-from-wiki-train) repo (special thanks to [ikawrakow](https://huggingface.co/ikawrakow)).\n\nFor a brief rundown of iMatrix quant performance please see this [PR](https://github.com/ggerganov/llama.cpp/pull/5747)\n\n<i>All quants are verified working prior to uploading to repo for your safety and convenience. </i>\n\nImportance matrix quantizations are a work in progress, IQ3 and above is recommended for best results. \n\n<b>Tip:</b> Pick a size that can fit in your GPU while still allowing some room for context for best speed. You may need to pad this further depending on if you are running image gen or TTS as well.\n\nOriginal model card can be found [here](https://huggingface.co/Doctor-Shotgun/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss)",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "merge",
    "iMat",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 1,
  "downloads": 81,
  "gated": false,
  "private": false,
  "last_modified": "2024-04-17T18:33:12.000Z",
  "created_at": "2024-04-16T01:49:55.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "661dd94385f70e208d91f33b",
  "id": "InferenceIllusionist/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-GGUF",
  "modelId": "InferenceIllusionist/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-GGUF",
  "sha": "6df7236f8ad53a63ac3c2d5cc3384cb384c6f108",
  "createdAt": "2024-04-16T01:49:55.000Z",
  "lastModified": "2024-04-17T18:33:12.000Z",
  "author": "InferenceIllusionist",
  "downloads": 81,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 20
}

inferenceillusionist/mixtral-8x7b-instruct-v0.1-limarp-zloss-imat-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard