duyntnet/qwen2.5-coder-3b-instruct-imatrix-gguf IQ3_M GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

duyntnet/qwen2.5-coder-3b-instruct-imatrix-gguf overview

Comprehensive model page for duyntnet/qwen2.5-coder-3b-instruct-imatrix-gguf

transformersggufimatrixQwen2.5-Coder-3B-Instructtext-generationenarxiv:2409.12186license:otherregion:usconversational

duyntnet/qwen2.5-coder-3b-instruct-imatrix-gguf visual

Downloads

327

Likes

Pipeline

text-generation

Library

transformers

Visibility

Public

Access

Open

Repository Files & Downloads

27 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Qwen2.5-Coder-3B-Instruct-IQ1_M.gguf	GGUF	IQ1_M	810.65 MB	Download
Qwen2.5-Coder-3B-Instruct-IQ1_S.gguf	GGUF	IQ1_S	754.45 MB	Download
Qwen2.5-Coder-3B-Instruct-IQ2_M.gguf	GGUF	IQ2_M	1.06 GB	Download
Qwen2.5-Coder-3B-Instruct-IQ2_S.gguf	GGUF	IQ2_S	1012.74 MB	Download
Qwen2.5-Coder-3B-Instruct-IQ2_XS.gguf	GGUF	IQ2_XS	983.76 MB	Download
Qwen2.5-Coder-3B-Instruct-IQ2_XXS.gguf	GGUF	IQ2_XXS	904.32 MB	Download
Qwen2.5-Coder-3B-Instruct-IQ3_M.gguf	GGUF	IQ3_M	1.39 GB	Download
Qwen2.5-Coder-3B-Instruct-IQ3_S.gguf	GGUF	IQ3_S	1.36 GB	Download
Qwen2.5-Coder-3B-Instruct-IQ3_XS.gguf	GGUF	IQ3_XS	1.30 GB	Download
Qwen2.5-Coder-3B-Instruct-IQ3_XXS.gguf	GGUF	IQ3_XXS	1.19 GB	Download
Qwen2.5-Coder-3B-Instruct-IQ4_NL.gguf	GGUF	IQ4_NL	1.70 GB	Download
Qwen2.5-Coder-3B-Instruct-IQ4_XS.gguf	GGUF	IQ4_XS	1.62 GB	Download
Qwen2.5-Coder-3B-Instruct-Q2_K.gguf	GGUF	Q2_K	1.19 GB	Download
Qwen2.5-Coder-3B-Instruct-Q2_K_S.gguf	GGUF	Q2_K_S	1.12 GB	Download
Qwen2.5-Coder-3B-Instruct-Q3_K_L.gguf	GGUF	Q3_K_L	1.59 GB	Download
Qwen2.5-Coder-3B-Instruct-Q3_K_M.gguf	GGUF	Q3_K_M	1.48 GB	Download
Qwen2.5-Coder-3B-Instruct-Q3_K_S.gguf	GGUF	Q3_K_S	1.35 GB	Download
Qwen2.5-Coder-3B-Instruct-Q4_0.gguf	GGUF	—	1.70 GB	Download
Qwen2.5-Coder-3B-Instruct-Q4_1.gguf	GGUF	—	1.86 GB	Download
Qwen2.5-Coder-3B-Instruct-Q4_K_M.gguf	GGUF	Q4_K_M	1.80 GB	Download
Qwen2.5-Coder-3B-Instruct-Q4_K_S.gguf	GGUF	Q4_K_S	1.71 GB	Download
Qwen2.5-Coder-3B-Instruct-Q5_0.gguf	GGUF	—	2.03 GB	Download
Qwen2.5-Coder-3B-Instruct-Q5_1.gguf	GGUF	—	2.18 GB	Download
Qwen2.5-Coder-3B-Instruct-Q5_K_M.gguf	GGUF	Q5_K_M	2.07 GB	Download
Qwen2.5-Coder-3B-Instruct-Q5_K_S.gguf	GGUF	Q5_K_S	2.02 GB	Download
Qwen2.5-Coder-3B-Instruct-Q6_K.gguf	GGUF	Q6_K	2.36 GB	Download
Qwen2.5-Coder-3B-Instruct-Q8_0.gguf	GGUF	—	3.06 GB	Download

Model Details Live

Model Slug

duyntnet/qwen2.5-coder-3b-instruct-imatrix-gguf

Author

duyntnet

Pipeline Task

text-generation

Library

transformers

Created

2024-11-14

Last Modified

2024-11-14

Gated

Private

HF SHA

5e7806a04c868e85327b681cfdd901b9efa9e278

License

other

Language

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "other",
    "language": [
      "en"
    ],
    "pipeline_tag": "text-generation",
    "inference": false,
    "tags": [
      "transformers",
      "gguf",
      "imatrix",
      "Qwen2.5-Coder-3B-Instruct"
    ],
    "frontmatter": {
      "license": "other",
      "language": [
        "en"
      ],
      "pipeline_tag": "text-generation",
      "inference": "false",
      "tags": [
        "transformers",
        "gguf",
        "imatrix",
        "Qwen2.5-Coder-3B-Instruct"
      ]
    },
    "hero_image_url": "",
    "summary": "",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- Qwen2.5-Coder-3B-Instruct\n---\nQuantizations of https://huggingface.co/Qwen/Qwen2.5-Coder-3B-Instruct\n\n\n### Inference Clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [ollama](https://github.com/ollama/ollama)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [GPT4All](https://github.com/nomic-ai/gpt4all)\n* [jan](https://github.com/janhq/jan)\n---\n\n# From original readme\n\n## Introduction\n\nQwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 billion parameters, to meet the needs of different developers. Qwen2.5-Coder brings the following improvements upon CodeQwen1.5:\n\n- Significantly improvements in **code generation**, **code reasoning** and **code fixing**. Base on the strong Qwen2.5, we scale up the training tokens into 5.5 trillion including source code, text-code grounding, Synthetic data, etc. Qwen2.5-Coder-32B has become the current state-of-the-art open-source codeLLM, with its coding abilities matching those of GPT-4o.\n- A more comprehensive foundation for real-world applications such as **Code Agents**. Not only enhancing coding capabilities but also maintaining its strengths in mathematics and general competencies.\n\n**This repo contains the instruction-tuned 3B Qwen2.5-Coder model**, which has the following features:\n- Type: Causal Language Models\n- Training Stage: Pretraining & Post-training\n- Architecture: transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias and tied word embeddings\n- Number of Parameters: 3.09B\n- Number of Paramaters (Non-Embedding): 2.77B\n- Number of Layers: 36\n- Number of Attention Heads (GQA): 16 for Q and 2 for KV\n- Context Length: Full 32,768 tokens\n  \nFor more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2.5-coder-family/), [GitHub](https://github.com/QwenLM/Qwen2.5-Coder), [Documentation](https://qwen.readthedocs.io/en/latest/), [Arxiv](https://arxiv.org/abs/2409.12186).\n\n## Requirements\n\nThe code of Qwen2.5-Coder has been in the latest Hugging face `transformers` and we advise you to use the latest version of `transformers`.\n\nWith `transformers<4.37.0`, you will encounter the following error:\n```\nKeyError: 'qwen2'\n```\n\n## Quickstart\n\nHere provides a code snippet with `apply_chat_template` to show you how to load the tokenizer and model and how to generate contents.\n\n```python\nfrom transformers import AutoModelForCausalLM, AutoTokenizer\n\nmodel_name = \"Qwen/Qwen2.5-Coder-3B-Instruct\"\n\nmodel = AutoModelForCausalLM.from_pretrained(\n    model_name,\n    torch_dtype=\"auto\",\n    device_map=\"auto\"\n)\ntokenizer = AutoTokenizer.from_pretrained(model_name)\n\nprompt = \"write a quick sort algorithm.\"\nmessages = [\n    {\"role\": \"system\", \"content\": \"You are Qwen, created by Alibaba Cloud. You are a helpful assistant.\"},\n    {\"role\": \"user\", \"content\": prompt}\n]\ntext = tokenizer.apply_chat_template(\n    messages,\n    tokenize=False,\n    add_generation_prompt=True\n)\nmodel_inputs = tokenizer([text], return_tensors=\"pt\").to(model.device)\n\ngenerated_ids = model.generate(\n    **model_inputs,\n    max_new_tokens=512\n)\ngenerated_ids = [\n    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)\n]\n\nresponse = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]\n```",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "imatrix",
    "Qwen2.5-Coder-3B-Instruct",
    "text-generation",
    "en",
    "arxiv:2409.12186",
    "license:other",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 327,
  "gated": false,
  "private": false,
  "last_modified": "2024-11-14T19:07:50.000Z",
  "created_at": "2024-11-14T18:09:07.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "67363cc32a3c30eac406cfd5",
  "id": "duyntnet/Qwen2.5-Coder-3B-Instruct-imatrix-GGUF",
  "modelId": "duyntnet/Qwen2.5-Coder-3B-Instruct-imatrix-GGUF",
  "sha": "5e7806a04c868e85327b681cfdd901b9efa9e278",
  "createdAt": "2024-11-14T18:09:07.000Z",
  "lastModified": "2024-11-14T19:07:50.000Z",
  "author": "duyntnet",
  "downloads": 327,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 29
}

duyntnet/qwen2.5-coder-3b-instruct-imatrix-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard