maziyarpanahi/calme-2.2-qwen2-72b-gguf Q5_K_M GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

maziyarpanahi/calme-2.2-qwen2-72b-gguf overview

The GGUF and quantized models here are based on MaziyarPanahi/calme-2.2-qwen2-72b model

ggufqwenqwen-2quantized2-bit3-bit4-bit5-bit6-bit8-bit16-bitGGUFtext-generationlicense:otherregion:usimatrixconversational

maziyarpanahi/calme-2.2-qwen2-72b-gguf visual

Downloads

244

Likes

Pipeline

text-generation

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

43 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
calme-2.2-qwen2-72b.IQ1_M.gguf	GGUF	IQ1_M	22.11 GB	Download
calme-2.2-qwen2-72b.IQ1_S.gguf	GGUF	IQ1_S	21.13 GB	Download
calme-2.2-qwen2-72b.IQ2_XS.gguf	GGUF	IQ2_XS	25.20 GB	Download
calme-2.2-qwen2-72b.IQ3_XS.gguf	GGUF	IQ3_XS	30.59 GB	Download
calme-2.2-qwen2-72b.IQ4_XS.gguf	GGUF	IQ4_XS	36.98 GB	Download
calme-2.2-qwen2-72b.Q2_K.gguf	GGUF	Q2_K	27.76 GB	Download
calme-2.2-qwen2-72b.Q3_K_L.gguf	GGUF	Q3_K_L	36.79 GB	Download
calme-2.2-qwen2-72b.Q3_K_M.gguf	GGUF	Q3_K_M	35.11 GB	Download
calme-2.2-qwen2-72b.Q3_K_S.gguf	GGUF	Q3_K_S	32.12 GB	Download
calme-2.2-qwen2-72b.Q4_K_M.gguf	GGUF	Q4_K_M	44.16 GB	Download
calme-2.2-qwen2-72b.Q4_K_S.gguf	GGUF	Q4_K_S	40.88 GB	Download
calme-2.2-qwen2-72b.Q5_K_M.gguf-00001-of-00008.gguf	GGUF	Q5_K_M	7.53 GB	Download
calme-2.2-qwen2-72b.Q5_K_M.gguf-00002-of-00008.gguf	GGUF	Q5_K_M	6.70 GB	Download
calme-2.2-qwen2-72b.Q5_K_M.gguf-00003-of-00008.gguf	GGUF	Q5_K_M	6.23 GB	Download
calme-2.2-qwen2-72b.Q5_K_M.gguf-00004-of-00008.gguf	GGUF	Q5_K_M	6.22 GB	Download
calme-2.2-qwen2-72b.Q5_K_M.gguf-00005-of-00008.gguf	GGUF	Q5_K_M	6.54 GB	Download
calme-2.2-qwen2-72b.Q5_K_M.gguf-00006-of-00008.gguf	GGUF	Q5_K_M	6.46 GB	Download
calme-2.2-qwen2-72b.Q5_K_M.gguf-00007-of-00008.gguf	GGUF	Q5_K_M	6.75 GB	Download
calme-2.2-qwen2-72b.Q5_K_M.gguf-00008-of-00008.gguf	GGUF	Q5_K_M	4.28 GB	Download
calme-2.2-qwen2-72b.Q6_K.gguf-00001-of-00008.gguf	GGUF	Q6_K	8.50 GB	Download
calme-2.2-qwen2-72b.Q6_K.gguf-00002-of-00008.gguf	GGUF	Q6_K	8.11 GB	Download
calme-2.2-qwen2-72b.Q6_K.gguf-00003-of-00008.gguf	GGUF	Q6_K	7.55 GB	Download
calme-2.2-qwen2-72b.Q6_K.gguf-00004-of-00008.gguf	GGUF	Q6_K	7.55 GB	Download
calme-2.2-qwen2-72b.Q6_K.gguf-00005-of-00008.gguf	GGUF	Q6_K	7.88 GB	Download
calme-2.2-qwen2-72b.Q6_K.gguf-00006-of-00008.gguf	GGUF	Q6_K	7.78 GB	Download
calme-2.2-qwen2-72b.Q6_K.gguf-00007-of-00008.gguf	GGUF	Q6_K	7.88 GB	Download
calme-2.2-qwen2-72b.Q6_K.gguf-00008-of-00008.gguf	GGUF	Q6_K	4.69 GB	Download
calme-2.2-qwen2-72b.Q8_0.gguf-00001-of-00008.gguf	GGUF	—	10.30 GB	Download
calme-2.2-qwen2-72b.Q8_0.gguf-00002-of-00008.gguf	GGUF	—	9.65 GB	Download
calme-2.2-qwen2-72b.Q8_0.gguf-00003-of-00008.gguf	GGUF	—	9.07 GB	Download
calme-2.2-qwen2-72b.Q8_0.gguf-00004-of-00008.gguf	GGUF	—	9.07 GB	Download
calme-2.2-qwen2-72b.Q8_0.gguf-00005-of-00008.gguf	GGUF	—	9.42 GB	Download
calme-2.2-qwen2-72b.Q8_0.gguf-00006-of-00008.gguf	GGUF	—	9.30 GB	Download
calme-2.2-qwen2-72b.Q8_0.gguf-00007-of-00008.gguf	GGUF	—	9.42 GB	Download
calme-2.2-qwen2-72b.Q8_0.gguf-00008-of-00008.gguf	GGUF	—	5.72 GB	Download
calme-2.2-qwen2-72b.fp16.gguf-00001-of-00008.gguf	GGUF	—	19.39 GB	Download
calme-2.2-qwen2-72b.fp16.gguf-00002-of-00008.gguf	GGUF	—	18.17 GB	Download
calme-2.2-qwen2-72b.fp16.gguf-00003-of-00008.gguf	GGUF	—	17.08 GB	Download
calme-2.2-qwen2-72b.fp16.gguf-00004-of-00008.gguf	GGUF	—	17.07 GB	Download
calme-2.2-qwen2-72b.fp16.gguf-00005-of-00008.gguf	GGUF	—	17.73 GB	Download
calme-2.2-qwen2-72b.fp16.gguf-00006-of-00008.gguf	GGUF	—	17.50 GB	Download
calme-2.2-qwen2-72b.fp16.gguf-00007-of-00008.gguf	GGUF	—	17.73 GB	Download
calme-2.2-qwen2-72b.fp16.gguf-00008-of-00008.gguf	GGUF	—	10.76 GB	Download

Model Details Live

Model Slug

maziyarpanahi/calme-2.2-qwen2-72b-gguf

Author

MaziyarPanahi

Pipeline Task

text-generation

Library

—

Created

2024-08-02

Last Modified

2024-08-06

Gated

Private

HF SHA

963dd0d5decaf252021b589569245ac4e053d4ad

License

other

Language

Unknown

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "pipeline_tag": "text-generation",
    "tags": [
      "qwen",
      "qwen-2",
      "quantized",
      "2-bit",
      "3-bit",
      "4-bit",
      "5-bit",
      "6-bit",
      "8-bit",
      "16-bit",
      "GGUF"
    ],
    "inference": false,
    "model_creator": "MaziyarPanahi",
    "model_name": "calme-2.2-qwen2-72b-GGUF",
    "quantized_by": "MaziyarPanahi",
    "license": "other",
    "license_name": "tongyi-qianwen",
    "license_link": "https://huggingface.co/Qwen/Qwen2-72B-Instruct/blob/main/LICENSE",
    "frontmatter": {
      "pipeline_tag": "text-generation",
      "tags": [
        "qwen",
        "qwen-2",
        "quantized",
        "2-bit",
        "3-bit",
        "4-bit",
        "5-bit",
        "6-bit",
        "8-bit",
        "16-bit",
        "GGUF"
      ],
      "inference": "false",
      "model_creator": "MaziyarPanahi",
      "model_name": "calme-2.2-qwen2-72b-GGUF",
      "quantized_by": "MaziyarPanahi",
      "license": "other",
      "license_name": "tongyi-qianwen",
      "license_link": "https://huggingface.co/Qwen/Qwen2-72B-Instruct/blob/main/LICENSE"
    },
    "hero_image_url": "",
    "summary": "The GGUF and quantized models here are based on MaziyarPanahi/calme-2.2-qwen2-72b model",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\npipeline_tag: text-generation\ntags:\n- qwen\n- qwen-2\n- quantized\n- 2-bit\n- 3-bit\n- 4-bit\n- 5-bit\n- 6-bit\n- 8-bit\n- 16-bit\n- GGUF\ninference: false\nmodel_creator: MaziyarPanahi\nmodel_name: calme-2.2-qwen2-72b-GGUF\nquantized_by: MaziyarPanahi\nlicense: other\nlicense_name: tongyi-qianwen\nlicense_link: https://huggingface.co/Qwen/Qwen2-72B-Instruct/blob/main/LICENSE\n---\n\n\n# MaziyarPanahi/calme-2.2-qwen2-72b-GGUF\n\nThe GGUF and quantized models here are based on [MaziyarPanahi/calme-2.2-qwen2-72b](https://huggingface.co/MaziyarPanahi/calme-2.2-qwen2-72b) model\n\n## How to download\nYou can download only the quants you need instead of cloning the entire repository as follows:\n\n```\nhuggingface-cli download MaziyarPanahi/calme-2.2-qwen2-72b-GGUF --local-dir . --include '*Q2_K*gguf'\n```\n\n## Load GGUF models\n\n\n```sh\n./llama.cpp/main -m mode_name.Q2_K.gguf -p \"<|im_start|>user\\nJust say 1, 2, 3 hi and NOTHING else\\n<|im_end|>\\n<|im_start|>assistant\\n\" -n 1024\n```\n\n\n\n\n## Original README\n\n---\n\n# MaziyarPanahi/calme-2.2-qwen2-72b\n\nThis is a fine-tuned version of the `Qwen/Qwen2-72B-Instruct` model. It aims to improve the base model across all benchmarks.\n\n# ⚡ Quantized GGUF\n\nAll GGUF models are available here: [MaziyarPanahi/calme-2.2-qwen2-72b-GGUF](https://huggingface.co/MaziyarPanahi/calme-2.2-qwen2-72b-GGUF)\n\n# 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)\n\n\n\n|    Tasks     |Version|Filter|n-shot|Metric|Value |   |Stderr|\n|--------------|------:|------|-----:|------|-----:|---|-----:|\n|truthfulqa_mc2|      2|none  |     0|acc   |0.6761|±  |0.0148|\n\n|  Tasks   |Version|Filter|n-shot|Metric|Value |   |Stderr|\n|----------|------:|------|-----:|------|-----:|---|-----:|\n|winogrande|      1|none  |     5|acc   |0.8248|±  |0.0107|\n\n|    Tasks    |Version|Filter|n-shot| Metric |Value |   |Stderr|\n|-------------|------:|------|-----:|--------|-----:|---|-----:|\n|arc_challenge|      1|none  |    25|acc     |0.6852|±  |0.0136|\n|             |       |none  |    25|acc_norm|0.7184|±  |0.0131|\n\n|Tasks|Version|     Filter     |n-shot|  Metric   |Value |   |Stderr|\n|-----|------:|----------------|-----:|-----------|-----:|---|-----:|\n|gsm8k|      3|strict-match    |     5|exact_match|0.8582|±  |0.0096|\n|     |       |flexible-extract|     5|exact_match|0.8893|±  |0.0086|\n\n# Prompt Template\n\nThis model uses `ChatML` prompt template:\n\n```\n<|im_start|>system\n{System}\n<|im_end|>\n<|im_start|>user\n{User}\n<|im_end|>\n<|im_start|>assistant\n{Assistant}\n````\n\n# How to use\n\n\n```python\n\n# Use a pipeline as a high-level helper\n\nfrom transformers import pipeline\n\nmessages = [\n    {\"role\": \"user\", \"content\": \"Who are you?\"},\n]\npipe = pipeline(\"text-generation\", model=\"MaziyarPanahi/calme-2.2-qwen2-72b\")\npipe(messages)\n\n\n# Load model directly\n\nfrom transformers import AutoTokenizer, AutoModelForCausalLM\n\ntokenizer = AutoTokenizer.from_pretrained(\"MaziyarPanahi/calme-2.2-qwen2-72b\")\nmodel = AutoModelForCausalLM.from_pretrained(\"MaziyarPanahi/calme-2.2-qwen2-72b\")\n```\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "qwen",
    "qwen-2",
    "quantized",
    "2-bit",
    "3-bit",
    "4-bit",
    "5-bit",
    "6-bit",
    "8-bit",
    "16-bit",
    "GGUF",
    "text-generation",
    "license:other",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 2,
  "downloads": 244,
  "gated": false,
  "private": false,
  "last_modified": "2024-08-06T11:06:38.000Z",
  "created_at": "2024-08-02T03:55:41.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "66ac58bd9c108b7a2adf2d07",
  "id": "MaziyarPanahi/calme-2.2-qwen2-72b-GGUF",
  "modelId": "MaziyarPanahi/calme-2.2-qwen2-72b-GGUF",
  "sha": "963dd0d5decaf252021b589569245ac4e053d4ad",
  "createdAt": "2024-08-02T03:55:41.000Z",
  "lastModified": "2024-08-06T11:06:38.000Z",
  "author": "MaziyarPanahi",
  "downloads": 244,
  "likes": 2,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 45
}

maziyarpanahi/calme-2.2-qwen2-72b-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard