GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

quant-cartel/magnum-72b-v1-imat-gguf overview

ggufchatqwenopuslicense:otherendpoints_compatibleregion:usimatrixconversational
quant-cartel/magnum-72b-v1-imat-gguf visual
Downloads
116
Likes
3
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

22 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
magnum-72b-v1-iMat-IQ1_M.gguf GGUF IQ1_M 22.11 GB Download
magnum-72b-v1-iMat-IQ1_S.gguf GGUF IQ1_S 21.13 GB Download
magnum-72b-v1-iMat-IQ2_S.gguf GGUF IQ2_S 26.02 GB Download
magnum-72b-v1-iMat-IQ2_XS.gguf GGUF IQ2_XS 25.20 GB Download
magnum-72b-v1-iMat-IQ2_XXS.gguf GGUF IQ2_XXS 23.74 GB Download
magnum-72b-v1-iMat-IQ3_M.gguf GGUF IQ3_M 33.07 GB Download
magnum-72b-v1-iMat-IQ3_S.gguf GGUF IQ3_S 32.12 GB Download
magnum-72b-v1-iMat-IQ3_XS.gguf GGUF IQ3_XS 30.59 GB Download
magnum-72b-v1-iMat-IQ3_XXS.gguf GGUF IQ3_XXS 29.66 GB Download
magnum-72b-v1-iMat-IQ4_XS.gguf GGUF IQ4_XS 36.98 GB Download
magnum-72b-v1-iMat-Q2_K.gguf GGUF Q2_K 27.76 GB Download
magnum-72b-v1-iMat-Q3_K_M.gguf GGUF Q3_K_M 35.11 GB Download
magnum-72b-v1-iMat-Q4_K_M.gguf GGUF Q4_K_M 44.16 GB Download
magnum-72b-v1-iMat-Q4_K_S.gguf GGUF Q4_K_S 40.88 GB Download
magnum-72b-v1-iMat-Q5_K_M-00001-of-00002.gguf GGUF Q5_K_M 41.90 GB Download
magnum-72b-v1-iMat-Q5_K_M-00002-of-00002.gguf GGUF Q5_K_M 8.81 GB Download
magnum-72b-v1-iMat-Q5_K_S-00001-of-00002.gguf GGUF Q5_K_S 41.88 GB Download
magnum-72b-v1-iMat-Q5_K_S-00002-of-00002.gguf GGUF Q5_K_S 5.97 GB Download
magnum-72b-v1-iMat-Q6_K-00001-of-00002.gguf GGUF Q6_K 41.69 GB Download
magnum-72b-v1-iMat-Q6_K-00002-of-00002.gguf GGUF Q6_K 18.24 GB Download
magnum-72b-v1-iMat-Q8_0-00001-of-00002.gguf GGUF 41.91 GB Download
magnum-72b-v1-iMat-Q8_0-00002-of-00002.gguf GGUF 30.04 GB Download

Model Details Live

Model Slug
quant-cartel/magnum-72b-v1-imat-gguf
Author
Quant-Cartel
Pipeline Task
Library
Created
2024-06-18
Last Modified
2024-06-19
Gated
No
Private
No
HF SHA
688f25d3273f391948542003841be5996c062c23
License
other
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "other",
    "license_name": "tongyi-qianwen",
    "license_link": "LICENSE",
    "tags": [
      "chat",
      "qwen",
      "opus"
    ],
    "frontmatter": {
      "license": "other",
      "license_name": "tongyi-qianwen",
      "license_link": "LICENSE",
      "tags": [
        "chat",
        "qwen",
        "opus"
      ]
    },
    "hero_image_url": "",
    "summary": "`` e88 88e                               d8 d888 888b  8888 8888  ,\"Y88b 888 8e   d88 C8888 8888D 8888 8888 \"8\" 888 888 88b d88888 Y888 888P  Y888 888P ,ee 888 888 888  888 \"88 88\"    \"88 88\"  \"88 888 888 888  888 b 8b, e88'Y88                  d8           888 d888  'Y  ,\"Y88b 888,8,  d88    ,e e,  888 C8888     \"8\" 888 888 \"  d88888 d88 88b 888 Y888  ,d ,ee 888 888     888   888   , 888 \"88,d88 \"88 888 888     888    \"YeeP\" 888 PROUDLY PRESENTS ``",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlicense_name: tongyi-qianwen\nlicense_link: LICENSE\ntags:\n- chat\n- qwen\n- opus\n---\n\n```\n  e88 88e                               d8     \n d888 888b  8888 8888  ,\"Y88b 888 8e   d88     \nC8888 8888D 8888 8888 \"8\" 888 888 88b d88888   \n Y888 888P  Y888 888P ,ee 888 888 888  888     \n  \"88 88\"    \"88 88\"  \"88 888 888 888  888     \n      b                                        \n      8b,                                      \n \n  e88'Y88                  d8           888    \n d888  'Y  ,\"Y88b 888,8,  d88    ,e e,  888    \nC8888     \"8\" 888 888 \"  d88888 d88 88b 888    \n Y888  ,d ,ee 888 888     888   888   , 888    \n  \"88,d88 \"88 888 888     888    \"YeeP\" 888    \n                                               \nPROUDLY PRESENTS         \n```\n\n## magnum-72b-v1-iMat-GGUF\n\n\n\nQuantized from fp16 with love.\n* Weighted quantizations were created using fp16 GGUF and [groups_merged.txt](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384) in 92 chunks and n_ctx=512\n\nFor a brief rundown of iMatrix quant performance please see this [PR](https://github.com/ggerganov/llama.cpp/pull/5747)\n\n<b>All quants are verified working prior to uploading to repo for your safety and convenience. </b>\n\nOriginal model card [here](https://huggingface.co/alpindale/magnum-72b-v1)",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "chat",
    "qwen",
    "opus",
    "license:other",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 3,
  "downloads": 116,
  "gated": false,
  "private": false,
  "last_modified": "2024-06-19T05:18:52.000Z",
  "created_at": "2024-06-18T05:36:58.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "66711cfaabdd1ea72bee75e4",
  "id": "Quant-Cartel/magnum-72b-v1-iMat-GGUF",
  "modelId": "Quant-Cartel/magnum-72b-v1-iMat-GGUF",
  "sha": "688f25d3273f391948542003841be5996c062c23",
  "createdAt": "2024-06-18T05:36:58.000Z",
  "lastModified": "2024-06-19T05:18:52.000Z",
  "author": "Quant-Cartel",
  "downloads": 116,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 25
}