Model Intelligence Sheet

quant-cartel/magnum-72b-v1-imat-gguf overview

ggufchatqwenopuslicense:otherendpoints_compatibleregion:usimatrixconversational

quant-cartel/magnum-72b-v1-imat-gguf visual

Downloads

116

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

22 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
magnum-72b-v1-iMat-IQ1_M.gguf	GGUF	IQ1_M	22.11 GB	Download
magnum-72b-v1-iMat-IQ1_S.gguf	GGUF	IQ1_S	21.13 GB	Download
magnum-72b-v1-iMat-IQ2_S.gguf	GGUF	IQ2_S	26.02 GB	Download
magnum-72b-v1-iMat-IQ2_XS.gguf	GGUF	IQ2_XS	25.20 GB	Download
magnum-72b-v1-iMat-IQ2_XXS.gguf	GGUF	IQ2_XXS	23.74 GB	Download
magnum-72b-v1-iMat-IQ3_M.gguf	GGUF	IQ3_M	33.07 GB	Download
magnum-72b-v1-iMat-IQ3_S.gguf	GGUF	IQ3_S	32.12 GB	Download
magnum-72b-v1-iMat-IQ3_XS.gguf	GGUF	IQ3_XS	30.59 GB	Download
magnum-72b-v1-iMat-IQ3_XXS.gguf	GGUF	IQ3_XXS	29.66 GB	Download
magnum-72b-v1-iMat-IQ4_XS.gguf	GGUF	IQ4_XS	36.98 GB	Download
magnum-72b-v1-iMat-Q2_K.gguf	GGUF	Q2_K	27.76 GB	Download
magnum-72b-v1-iMat-Q3_K_M.gguf	GGUF	Q3_K_M	35.11 GB	Download
magnum-72b-v1-iMat-Q4_K_M.gguf	GGUF	Q4_K_M	44.16 GB	Download
magnum-72b-v1-iMat-Q4_K_S.gguf	GGUF	Q4_K_S	40.88 GB	Download
magnum-72b-v1-iMat-Q5_K_M-00001-of-00002.gguf	GGUF	Q5_K_M	41.90 GB	Download
magnum-72b-v1-iMat-Q5_K_M-00002-of-00002.gguf	GGUF	Q5_K_M	8.81 GB	Download
magnum-72b-v1-iMat-Q5_K_S-00001-of-00002.gguf	GGUF	Q5_K_S	41.88 GB	Download
magnum-72b-v1-iMat-Q5_K_S-00002-of-00002.gguf	GGUF	Q5_K_S	5.97 GB	Download
magnum-72b-v1-iMat-Q6_K-00001-of-00002.gguf	GGUF	Q6_K	41.69 GB	Download
magnum-72b-v1-iMat-Q6_K-00002-of-00002.gguf	GGUF	Q6_K	18.24 GB	Download
magnum-72b-v1-iMat-Q8_0-00001-of-00002.gguf	GGUF	—	41.91 GB	Download
magnum-72b-v1-iMat-Q8_0-00002-of-00002.gguf	GGUF	—	30.04 GB	Download

Model Details Live

Model Slug

quant-cartel/magnum-72b-v1-imat-gguf

Author

Quant-Cartel

Pipeline Task

—

Library

—

Created

2024-06-18

Last Modified

2024-06-19

Gated

Private

HF SHA

688f25d3273f391948542003841be5996c062c23

License

other

Language

Unknown

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "other",
    "license_name": "tongyi-qianwen",
    "license_link": "LICENSE",
    "tags": [
      "chat",
      "qwen",
      "opus"
    ],
    "frontmatter": {
      "license": "other",
      "license_name": "tongyi-qianwen",
      "license_link": "LICENSE",
      "tags": [
        "chat",
        "qwen",
        "opus"
      ]
    },
    "hero_image_url": "",
    "summary": "`` e88 88e                               d8 d888 888b  8888 8888  ,\"Y88b 888 8e   d88 C8888 8888D 8888 8888 \"8\" 888 888 88b d88888 Y888 888P  Y888 888P ,ee 888 888 888  888 \"88 88\"    \"88 88\"  \"88 888 888 888  888 b 8b, e88'Y88                  d8           888 d888  'Y  ,\"Y88b 888,8,  d88    ,e e,  888 C8888     \"8\" 888 888 \"  d88888 d88 88b 888 Y888  ,d ,ee 888 888     888   888   , 888 \"88,d88 \"88 888 888     888    \"YeeP\" 888 PROUDLY PRESENTS ``",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlicense_name: tongyi-qianwen\nlicense_link: LICENSE\ntags:\n- chat\n- qwen\n- opus\n---\n\n```\n  e88 88e                               d8     \n d888 888b  8888 8888  ,\"Y88b 888 8e   d88     \nC8888 8888D 8888 8888 \"8\" 888 888 88b d88888   \n Y888 888P  Y888 888P ,ee 888 888 888  888     \n  \"88 88\"    \"88 88\"  \"88 888 888 888  888     \n      b                                        \n      8b,                                      \n \n  e88'Y88                  d8           888    \n d888  'Y  ,\"Y88b 888,8,  d88    ,e e,  888    \nC8888     \"8\" 888 888 \"  d88888 d88 88b 888    \n Y888  ,d ,ee 888 888     888   888   , 888    \n  \"88,d88 \"88 888 888     888    \"YeeP\" 888    \n                                               \nPROUDLY PRESENTS         \n```\n\n## magnum-72b-v1-iMat-GGUF\n\n\n\nQuantized from fp16 with love.\n* Weighted quantizations were created using fp16 GGUF and [groups_merged.txt](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384) in 92 chunks and n_ctx=512\n\nFor a brief rundown of iMatrix quant performance please see this [PR](https://github.com/ggerganov/llama.cpp/pull/5747)\n\n<b>All quants are verified working prior to uploading to repo for your safety and convenience. </b>\n\nOriginal model card [here](https://huggingface.co/alpindale/magnum-72b-v1)",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "chat",
    "qwen",
    "opus",
    "license:other",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 3,
  "downloads": 116,
  "gated": false,
  "private": false,
  "last_modified": "2024-06-19T05:18:52.000Z",
  "created_at": "2024-06-18T05:36:58.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "66711cfaabdd1ea72bee75e4",
  "id": "Quant-Cartel/magnum-72b-v1-iMat-GGUF",
  "modelId": "Quant-Cartel/magnum-72b-v1-iMat-GGUF",
  "sha": "688f25d3273f391948542003841be5996c062c23",
  "createdAt": "2024-06-18T05:36:58.000Z",
  "lastModified": "2024-06-19T05:18:52.000Z",
  "author": "Quant-Cartel",
  "downloads": 116,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 25
}