Model Intelligence Sheet

richarderkhov/maziyarpanahi_-_calme-2.1-qwen2-72b-gguf overview

This model is a fine-tuned version of the powerful Qwen/Qwen2-72B-Instruct, pushing the boundaries of natural language understanding and generation even further. My goal was to create a versatile and robust model that excels across a wide range of benchmarks and real-world applications.

ggufendpoints_compatibleregion:usconversational

richarderkhov/maziyarpanahi_-_calme-2.1-qwen2-72b-gguf visual

Downloads

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

35 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
calme-2.1-qwen2-72b.IQ3_M.gguf	GGUF	IQ3_M	33.06 GB	Download
calme-2.1-qwen2-72b.IQ3_S.gguf	GGUF	IQ3_S	32.12 GB	Download
calme-2.1-qwen2-72b.IQ3_XS.gguf	GGUF	IQ3_XS	30.58 GB	Download
calme-2.1-qwen2-72b.Q2_K.gguf	GGUF	Q2_K	27.76 GB	Download
calme-2.1-qwen2-72b.Q3_K.gguf	GGUF	Q3_K	35.11 GB	Download
calme-2.1-qwen2-72b.Q3_K_L.gguf	GGUF	Q3_K_L	36.79 GB	Download
calme-2.1-qwen2-72b.Q3_K_M.gguf	GGUF	Q3_K_M	35.11 GB	Download
calme-2.1-qwen2-72b.Q3_K_S.gguf	GGUF	Q3_K_S	Unknown	Download
calme-2.1-qwen2-72b_IQ4_NL-00001-of-00002.gguf	GGUF	IQ4_NL	27.32 GB	Download
calme-2.1-qwen2-72b_IQ4_XS-00001-of-00002.gguf	GGUF	IQ4_XS	36.45 GB	Download
calme-2.1-qwen2-72b_IQ4_XS-00002-of-00002.gguf	GGUF	IQ4_XS	971.85 MB	Download
calme-2.1-qwen2-72b_Q4_0-00001-of-00002.gguf	GGUF	—	37.24 GB	Download
calme-2.1-qwen2-72b_Q4_0-00002-of-00002.gguf	GGUF	—	1.16 GB	Download
calme-2.1-qwen2-72b_Q4_1-00001-of-00002.gguf	GGUF	—	37.15 GB	Download
calme-2.1-qwen2-72b_Q4_1-00002-of-00002.gguf	GGUF	—	5.41 GB	Download
calme-2.1-qwen2-72b_Q4_K-00001-of-00002.gguf	GGUF	Q4_K	37.25 GB	Download
calme-2.1-qwen2-72b_Q4_K-00002-of-00002.gguf	GGUF	Q4_K	6.90 GB	Download
calme-2.1-qwen2-72b_Q4_K_M-00001-of-00002.gguf	GGUF	Q4_K_M	37.25 GB	Download
calme-2.1-qwen2-72b_Q4_K_M-00002-of-00002.gguf	GGUF	Q4_K_M	6.90 GB	Download
calme-2.1-qwen2-72b_Q4_K_S-00001-of-00002.gguf	GGUF	Q4_K_S	37.14 GB	Download
calme-2.1-qwen2-72b_Q4_K_S-00002-of-00002.gguf	GGUF	Q4_K_S	3.73 GB	Download
calme-2.1-qwen2-72b_Q5_0-00001-of-00002.gguf	GGUF	—	37.18 GB	Download
calme-2.1-qwen2-72b_Q5_0-00002-of-00002.gguf	GGUF	—	9.53 GB	Download
calme-2.1-qwen2-72b_Q5_1-00001-of-00002.gguf	GGUF	—	37.22 GB	Download
calme-2.1-qwen2-72b_Q5_1-00002-of-00002.gguf	GGUF	—	13.66 GB	Download
calme-2.1-qwen2-72b_Q5_K-00001-of-00002.gguf	GGUF	Q5_K	37.23 GB	Download
calme-2.1-qwen2-72b_Q5_K-00002-of-00002.gguf	GGUF	Q5_K	13.47 GB	Download
calme-2.1-qwen2-72b_Q5_K_M-00001-of-00002.gguf	GGUF	Q5_K_M	37.23 GB	Download
calme-2.1-qwen2-72b_Q5_K_M-00002-of-00002.gguf	GGUF	Q5_K_M	13.47 GB	Download
calme-2.1-qwen2-72b_Q5_K_S-00001-of-00002.gguf	GGUF	Q5_K_S	37.10 GB	Download
calme-2.1-qwen2-72b_Q5_K_S-00002-of-00002.gguf	GGUF	Q5_K_S	10.74 GB	Download
calme-2.1-qwen2-72b_Q6_K-00001-of-00002.gguf	GGUF	Q6_K	37.22 GB	Download
calme-2.1-qwen2-72b_Q6_K-00002-of-00002.gguf	GGUF	Q6_K	22.71 GB	Download
calme-2.1-qwen2-72b_Q8_0-00001-of-00002.gguf	GGUF	—	37.09 GB	Download
calme-2.1-qwen2-72b_Q8_0-00002-of-00002.gguf	GGUF	—	34.86 GB	Download

Model Details Live

Model Slug

richarderkhov/maziyarpanahi_-_calme-2.1-qwen2-72b-gguf

Author

RichardErkhov

Pipeline Task

—

Library

—

Created

2024-10-09

Last Modified

2024-10-10

Gated

Private

HF SHA

eb9b2e1975defc5babe32f14572ca74c6a00cb6f

License

Unknown

Language

Unknown

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "./qwen2-fine-tunes-maziyar-panahi.webp",
    "summary": "This model is a fine-tuned version of the powerful Qwen/Qwen2-72B-Instruct, pushing the boundaries of natural language understanding and generation even further. My goal was to create a versatile and robust model that excels across a wide range of benchmarks and real-world applications.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\ncalme-2.1-qwen2-72b - GGUF\n- Model creator: https://huggingface.co/MaziyarPanahi/\n- Original model: https://huggingface.co/MaziyarPanahi/calme-2.1-qwen2-72b/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [calme-2.1-qwen2-72b.Q2_K.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/blob/main/calme-2.1-qwen2-72b.Q2_K.gguf) | Q2_K | 27.76GB |\n| [calme-2.1-qwen2-72b.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/blob/main/calme-2.1-qwen2-72b.IQ3_XS.gguf) | IQ3_XS | 30.58GB |\n| [calme-2.1-qwen2-72b.IQ3_S.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/blob/main/calme-2.1-qwen2-72b.IQ3_S.gguf) | IQ3_S | 32.12GB |\n| [calme-2.1-qwen2-72b.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/blob/main/calme-2.1-qwen2-72b.Q3_K_S.gguf) | Q3_K_S | 32.12GB |\n| [calme-2.1-qwen2-72b.IQ3_M.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/blob/main/calme-2.1-qwen2-72b.IQ3_M.gguf) | IQ3_M | 33.06GB |\n| [calme-2.1-qwen2-72b.Q3_K.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/blob/main/calme-2.1-qwen2-72b.Q3_K.gguf) | Q3_K | 35.11GB |\n| [calme-2.1-qwen2-72b.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/blob/main/calme-2.1-qwen2-72b.Q3_K_M.gguf) | Q3_K_M | 35.11GB |\n| [calme-2.1-qwen2-72b.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/blob/main/calme-2.1-qwen2-72b.Q3_K_L.gguf) | Q3_K_L | 36.79GB |\n| [calme-2.1-qwen2-72b.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | IQ4_XS | 37.4GB |\n| [calme-2.1-qwen2-72b.Q4_0.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q4_0 | 38.4GB |\n| [calme-2.1-qwen2-72b.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | IQ4_NL | 38.9GB |\n| [calme-2.1-qwen2-72b.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q4_K_S | 40.87GB |\n| [calme-2.1-qwen2-72b.Q4_K.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q4_K | 44.15GB |\n| [calme-2.1-qwen2-72b.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q4_K_M | 44.15GB |\n| [calme-2.1-qwen2-72b.Q4_1.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q4_1 | 42.55GB |\n| [calme-2.1-qwen2-72b.Q5_0.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q5_0 | 46.71GB |\n| [calme-2.1-qwen2-72b.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q5_K_S | 47.84GB |\n| [calme-2.1-qwen2-72b.Q5_K.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q5_K | 50.7GB |\n| [calme-2.1-qwen2-72b.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q5_K_M | 50.7GB |\n| [calme-2.1-qwen2-72b.Q5_1.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q5_1 | 50.87GB |\n| [calme-2.1-qwen2-72b.Q6_K.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q6_K | 59.92GB |\n| [calme-2.1-qwen2-72b.Q8_0.gguf](https://huggingface.co/RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf/tree/main/) | Q8_0 | 71.95GB |\n\n\n\n\nOriginal model description:\n---\nlanguage:\n- en\nlicense: other\nlibrary_name: transformers\ntags:\n- chat\n- qwen\n- qwen2\n- finetune\n- chatml\nbase_model: Qwen/Qwen2-72B-Instruct\nlicense_name: tongyi-qianwen\nlicense_link: https://huggingface.co/Qwen/Qwen2-72B-Instruct/blob/main/LICENSE\npipeline_tag: text-generation\ninference: false\nmodel_creator: MaziyarPanahi\nquantized_by: MaziyarPanahi\nmodel-index:\n- name: calme-2.1-qwen2-72b\n  results:\n  - task:\n      type: text-generation\n      name: Text Generation\n    dataset:\n      name: IFEval (0-Shot)\n      type: HuggingFaceH4/ifeval\n      args:\n        num_few_shot: 0\n    metrics:\n    - type: inst_level_strict_acc and prompt_level_strict_acc\n      value: 81.63\n      name: strict accuracy\n    source:\n      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-qwen2-72b\n      name: Open LLM Leaderboard\n  - task:\n      type: text-generation\n      name: Text Generation\n    dataset:\n      name: BBH (3-Shot)\n      type: BBH\n      args:\n        num_few_shot: 3\n    metrics:\n    - type: acc_norm\n      value: 57.33\n      name: normalized accuracy\n    source:\n      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-qwen2-72b\n      name: Open LLM Leaderboard\n  - task:\n      type: text-generation\n      name: Text Generation\n    dataset:\n      name: MATH Lvl 5 (4-Shot)\n      type: hendrycks/competition_math\n      args:\n        num_few_shot: 4\n    metrics:\n    - type: exact_match\n      value: 36.03\n      name: exact match\n    source:\n      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-qwen2-72b\n      name: Open LLM Leaderboard\n  - task:\n      type: text-generation\n      name: Text Generation\n    dataset:\n      name: GPQA (0-shot)\n      type: Idavidrein/gpqa\n      args:\n        num_few_shot: 0\n    metrics:\n    - type: acc_norm\n      value: 17.45\n      name: acc_norm\n    source:\n      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-qwen2-72b\n      name: Open LLM Leaderboard\n  - task:\n      type: text-generation\n      name: Text Generation\n    dataset:\n      name: MuSR (0-shot)\n      type: TAUR-Lab/MuSR\n      args:\n        num_few_shot: 0\n    metrics:\n    - type: acc_norm\n      value: 20.15\n      name: acc_norm\n    source:\n      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-qwen2-72b\n      name: Open LLM Leaderboard\n  - task:\n      type: text-generation\n      name: Text Generation\n    dataset:\n      name: MMLU-PRO (5-shot)\n      type: TIGER-Lab/MMLU-Pro\n      config: main\n      split: test\n      args:\n        num_few_shot: 5\n    metrics:\n    - type: acc\n      value: 49.05\n      name: accuracy\n    source:\n      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-qwen2-72b\n      name: Open LLM Leaderboard\n---\n\n<img src=\"./qwen2-fine-tunes-maziyar-panahi.webp\" alt=\"Qwen2 fine-tune\" width=\"800\" style=\"margin-left:'auto' margin-right:'auto' display:'block'\"/>\n\n# MaziyarPanahi/calme-2.1-qwen2-72b\n\nThis model is a fine-tuned version of the powerful `Qwen/Qwen2-72B-Instruct`, pushing the boundaries of natural language understanding and generation even further. My goal was to create a versatile and robust model that excels across a wide range of benchmarks and real-world applications.\n\n## Use Cases\n\nThis model is suitable for a wide range of applications, including but not limited to:\n\n- Advanced question-answering systems\n- Intelligent chatbots and virtual assistants\n- Content generation and summarization\n- Code generation and analysis\n- Complex problem-solving and decision support\n\n# ⚡ Quantized GGUF\n\nAll GGUF models are available here: [MaziyarPanahi/calme-2.1-qwen2-72b-GGUF](https://huggingface.co/MaziyarPanahi/calme-2.1-qwen2-72b-GGUF)\n\n# 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)\n\nDetailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__calme-2.1-qwen2-72b)\n\n|      Metric       |Value|\n|-------------------|----:|\n|Avg.               |43.61|\n|IFEval (0-Shot)    |81.63|\n|BBH (3-Shot)       |57.33|\n|MATH Lvl 5 (4-Shot)|36.03|\n|GPQA (0-shot)      |17.45|\n|MuSR (0-shot)      |20.15|\n|MMLU-PRO (5-shot)  |49.05|\n\n\n\n|    Tasks     |Version|Filter|n-shot|Metric|Value |   |Stderr|\n|--------------|------:|------|-----:|------|-----:|---|-----:|\n|truthfulqa_mc2|      2|none  |     0|acc   |0.6761|±  |0.0148|\n\n|  Tasks   |Version|Filter|n-shot|Metric|Value |   |Stderr|\n|----------|------:|------|-----:|------|-----:|---|-----:|\n|winogrande|      1|none  |     5|acc   |0.8248|±  |0.0107|\n\n|    Tasks    |Version|Filter|n-shot| Metric |Value |   |Stderr|\n|-------------|------:|------|-----:|--------|-----:|---|-----:|\n|arc_challenge|      1|none  |    25|acc     |0.6852|±  |0.0136|\n|             |       |none  |    25|acc_norm|0.7184|±  |0.0131|\n\n|Tasks|Version|     Filter     |n-shot|  Metric   |Value |   |Stderr|\n|-----|------:|----------------|-----:|-----------|-----:|---|-----:|\n|gsm8k|      3|strict-match    |     5|exact_match|0.8582|±  |0.0096|\n|     |       |flexible-extract|     5|exact_match|0.8893|±  |0.0086|\n\n# Prompt Template\n\nThis model uses `ChatML` prompt template:\n\n```\n<|im_start|>system\n{System}\n<|im_end|>\n<|im_start|>user\n{User}\n<|im_end|>\n<|im_start|>assistant\n{Assistant}\n````\n\n# How to use\n\n\n```python\n\n# Use a pipeline as a high-level helper\n\nfrom transformers import pipeline\n\nmessages = [\n    {\"role\": \"user\", \"content\": \"Who are you?\"},\n]\npipe = pipeline(\"text-generation\", model=\"MaziyarPanahi/calme-2.1-qwen2-72b\")\npipe(messages)\n\n\n# Load model directly\n\nfrom transformers import AutoTokenizer, AutoModelForCausalLM\n\ntokenizer = AutoTokenizer.from_pretrained(\"MaziyarPanahi/calme-2.1-qwen2-72b\")\nmodel = AutoModelForCausalLM.from_pretrained(\"MaziyarPanahi/calme-2.1-qwen2-72b\")\n```\n\n# Ethical Considerations\n\nAs with any large language model, users should be aware of potential biases and limitations. We recommend implementing appropriate safeguards and human oversight when deploying this model in production environments.\n\n\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 86,
  "gated": false,
  "private": false,
  "last_modified": "2024-10-10T09:52:14.000Z",
  "created_at": "2024-10-09T06:58:26.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "67062992e343e345b77d0734",
  "id": "RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf",
  "modelId": "RichardErkhov/MaziyarPanahi_-_calme-2.1-qwen2-72b-gguf",
  "sha": "eb9b2e1975defc5babe32f14572ca74c6a00cb6f",
  "createdAt": "2024-10-09T06:58:26.000Z",
  "lastModified": "2024-10-10T09:52:14.000Z",
  "author": "RichardErkhov",
  "downloads": 86,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 37
}