GraySoft
Projects Models About FAQ Contact Download guIDE →

inferenceillusionist/mixtral-8x7b-instruct-v0.1-limarp-zloss-imat-gguf Q6_K GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

inferenceillusionist/mixtral-8x7b-instruct-v0.1-limarp-zloss-imat-gguf overview

Quantized from fp32 with love. * Quantizations made possible using mixtral-8x7b.imatrix file from this repo (special thanks to ikawrakow). For a brief rundown of iMatrix quant performance please see this PR All quants are verified working prior to uploading to repo for your safety and convenience. Importance matrix quantizations are a work in progress, IQ3 and above is recommended for best results. Tip: Pick a size that can fit in your GPU while still allowing some room for context for best speed. You may need to pad this further depending on if you are running image gen or TTS as well. Original model card can be found here

ggufmergeiMatlicense:apache-2.0endpoints_compatibleregion:usconversational
inferenceillusionist/mixtral-8x7b-instruct-v0.1-limarp-zloss-imat-gguf visual
Downloads
81
Likes
1
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

18 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ1_M.gguf GGUF IQ1_M 10.10 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ2_M.gguf GGUF IQ2_M 14.43 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ2_S.gguf GGUF IQ2_S 13.16 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ2_XS.gguf GGUF IQ2_XS 12.97 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ2_XXS.gguf GGUF IQ2_XXS 11.69 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ3_M.gguf GGUF IQ3_M 19.96 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ3_S.gguf GGUF IQ3_S 19.03 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ3_XS.gguf GGUF IQ3_XS 18.02 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ3_XXS.gguf GGUF IQ3_XXS 16.99 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-IQ4_XS.gguf GGUF IQ4_XS 23.36 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q2_K.gguf GGUF Q2_K 16.12 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q3_K_M.gguf GGUF Q3_K_M 21.00 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q4_K_M.gguf GGUF Q4_K_M 26.49 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q4_K_S.gguf GGUF Q4_K_S 24.91 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q5_K_M.gguf GGUF Q5_K_M 30.95 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q5_K_S.gguf GGUF Q5_K_S 30.02 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q6_K.gguf GGUF Q6_K 35.74 GB Download
Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-Q8_0.gguf GGUF 46.22 GB Download

Model Details Live

Model Slug
inferenceillusionist/mixtral-8x7b-instruct-v0.1-limarp-zloss-imat-gguf
Author
InferenceIllusionist
Pipeline Task
Library
Created
2024-04-16
Last Modified
2024-04-17
Gated
No
Private
No
HF SHA
6df7236f8ad53a63ac3c2d5cc3384cb384c6f108
License
apache-2.0
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "tags": [
      "merge",
      "gguf",
      "iMat"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "tags": [
        "merge",
        "gguf",
        "iMat"
      ]
    },
    "hero_image_url": "https://i.imgur.com/P68dXux.png",
    "summary": "Quantized from fp32 with love. * Quantizations made possible using mixtral-8x7b.imatrix file from this repo (special thanks to ikawrakow). For a brief rundown of iMatrix quant performance please see this PR All quants are verified working prior to uploading to repo for your safety and convenience.  Importance matrix quantizations are a work in progress, IQ3 and above is recommended for best results. Tip: Pick a size that can fit in your GPU while still allowing some room for context for best speed. You may need to pad this further depending on if you are running image gen or TTS as well. Original model card can be found here",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\ntags:\n- merge\n- gguf\n- iMat\n---\n<img src=\"https://i.imgur.com/P68dXux.png\" width=\"400\"/>\n\n# Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-GGUF\n\nQuantized from fp32 with love.\n* Quantizations made possible using mixtral-8x7b.imatrix file from [this](https://huggingface.co/datasets/ikawrakow/imatrix-from-wiki-train) repo (special thanks to [ikawrakow](https://huggingface.co/ikawrakow)).\n\nFor a brief rundown of iMatrix quant performance please see this [PR](https://github.com/ggerganov/llama.cpp/pull/5747)\n\n<i>All quants are verified working prior to uploading to repo for your safety and convenience. </i>\n\nImportance matrix quantizations are a work in progress, IQ3 and above is recommended for best results. \n\n<b>Tip:</b> Pick a size that can fit in your GPU while still allowing some room for context for best speed. You may need to pad this further depending on if you are running image gen or TTS as well.\n\nOriginal model card can be found [here](https://huggingface.co/Doctor-Shotgun/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss)",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "merge",
    "iMat",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 1,
  "downloads": 81,
  "gated": false,
  "private": false,
  "last_modified": "2024-04-17T18:33:12.000Z",
  "created_at": "2024-04-16T01:49:55.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "661dd94385f70e208d91f33b",
  "id": "InferenceIllusionist/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-GGUF",
  "modelId": "InferenceIllusionist/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-iMat-GGUF",
  "sha": "6df7236f8ad53a63ac3c2d5cc3384cb384c6f108",
  "createdAt": "2024-04-16T01:49:55.000Z",
  "lastModified": "2024-04-17T18:33:12.000Z",
  "author": "InferenceIllusionist",
  "downloads": 81,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 20
}