Model Intelligence Sheet

samuelchristlie/deepseek-r1-0528-qwen3-8b-gguf overview

Direct GGUF Quantizations of DeepSeek-R1-0528-Qwen3-8B This repository provides GGUF quantized models for deepseek-ai/DeepSeek-R1-0528-Qwen3-8B. DeepSeek-R1-0528-Qwen3-8B is a powerful 8 billion parameter Large Language Model developed by DeepSeek AI. It is an Instruct model based on the Qwen3 architecture, excelling in a wide range of text generation tasks including chat, coding, and reasoning. These GGUF versions are optimized for efficient CPU and GPU inference using llama.cpp and compatible tools. This release includes various quantization levels (e.g., Q2K, Q3KM, Q4KM, Q5KM, Q6K, Q8_0) to suit different hardware capabilities and performance requirements.

ggufbase_model:Qwen/Qwen3-8Bbase_model:quantized:Qwen/Qwen3-8Blicense:mitendpoints_compatibleregion:usconversational

samuelchristlie/deepseek-r1-0528-qwen3-8b-gguf visual

Downloads

1,122

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

18 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
DeepSeek-R1-0528-Qwen3-8B-Q2_K.gguf	GGUF	Q2_K	3.06 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q3_K.gguf	GGUF	Q3_K	3.84 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q3_K_L.gguf	GGUF	Q3_K_L	4.13 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q3_K_M.gguf	GGUF	Q3_K_M	3.84 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q3_K_S.gguf	GGUF	Q3_K_S	3.51 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q4_0.gguf	GGUF	—	4.45 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q4_1.gguf	GGUF	—	4.89 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q4_K.gguf	GGUF	Q4_K	4.68 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q4_K_M.gguf	GGUF	Q4_K_M	4.68 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q4_K_S.gguf	GGUF	Q4_K_S	4.47 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q5_0.gguf	GGUF	—	5.33 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q5_1.gguf	GGUF	—	5.77 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q5_K.gguf	GGUF	Q5_K	5.45 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q5_K_M.gguf	GGUF	Q5_K_M	5.45 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q5_K_S.gguf	GGUF	Q5_K_S	5.33 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q6_K.gguf	GGUF	Q6_K	6.26 GB	Download
DeepSeek-R1-0528-Qwen3-8B-Q8_0.gguf	GGUF	—	8.11 GB	Download
DeepSeek-R1-0528-Qwen3-8B-f16.gguf	GGUF	F16	15.26 GB	Download

Model Details Live

Model Slug

samuelchristlie/deepseek-r1-0528-qwen3-8b-gguf

Author

samuelchristlie

Pipeline Task

—

Library

—

Created

2025-05-29

Last Modified

2025-05-29

Gated

Private

HF SHA

a10e157e66ed2502ec28f008c6354f42c0bc6010

License

mit

Language

Unknown

Base Model

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B, Qwen/Qwen3-8B

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "mit",
    "base_model": [
      "deepseek-ai/DeepSeek-R1-0528-Qwen3-8B",
      "Qwen/Qwen3-8B"
    ],
    "frontmatter": {
      "license": "mit",
      "base_model": [
        "deepseek-ai/DeepSeek-R1-0528-Qwen3-8B",
        "Qwen/Qwen3-8B"
      ]
    },
    "hero_image_url": "",
    "summary": "## Direct GGUF Quantizations of DeepSeek-R1-0528-Qwen3-8B This repository provides GGUF quantized models for **deepseek-ai/DeepSeek-R1-0528-Qwen3-8B**. DeepSeek-R1-0528-Qwen3-8B is a powerful 8 billion parameter Large Language Model developed by DeepSeek AI. It is an Instruct model based on the Qwen3 architecture, excelling in a wide range of text generation tasks including chat, coding, and reasoning. These GGUF versions are optimized for efficient CPU and GPU inference using llama.cpp and compatible tools. This release includes various quantization levels (e.g., Q2_K, Q3_K_M, Q4_K_M, Q5_K_M, Q6_K, Q8_0) to suit different hardware capabilities and performance requirements.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: mit\nbase_model:\n- deepseek-ai/DeepSeek-R1-0528-Qwen3-8B\n- Qwen/Qwen3-8B\n---\n\n# DeepSeek-R1-0528-Qwen3-8B-GGUF\n## Direct GGUF Quantizations of DeepSeek-R1-0528-Qwen3-8B\n\nThis repository provides GGUF quantized models for **deepseek-ai/DeepSeek-R1-0528-Qwen3-8B**.\n\nDeepSeek-R1-0528-Qwen3-8B is a powerful 8 billion parameter Large Language Model developed by DeepSeek AI. It is an Instruct model based on the Qwen3 architecture, excelling in a wide range of text generation tasks including chat, coding, and reasoning. These GGUF versions are optimized for efficient CPU and GPU inference using `llama.cpp` and compatible tools.\n\nThis release includes various quantization levels (e.g., Q2_K, Q3_K_M, Q4_K_M, Q5_K_M, Q6_K, Q8_0) to suit different hardware capabilities and performance requirements.\n\n## Table of Contents 📝\n1. ▶ [Usage](#usage)\n2. 📃 [License](#license)\n3. 🙏 [Acknowledgements](#acknowledgements)\n\n<a name=\"usage\"/>\n\n## ▶ Usage\n\n### 1. Download Models\nDownload models using `huggingface-cli`:\n```bash\npip install \"huggingface_hub[cli]\"\nhuggingface-cli download samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF --local-dir ./DeepSeek-R1-0528-Qwen3-8B-GGUF\n```\nYou can also download directly from [this page](https://huggingface.co/samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF/tree/main)\n\n### 2. Inference\nTo use these GGUF files, you'll need a compatible inference engine like `llama.cpp` or clients built on top of it (e.g., `Ollama`, `LM Studio`, `KoboldCpp`, `text-generation-webui` with `llama.cpp` backend).\n\n<a name=\"license\"/>\n\n## 📃 License\nThis model is a GGUF conversion of the original `deepseek-ai/DeepSeek-R1-0528-Qwen3-8B` model. The original model is licensed under the **MIT License**, and this derivative work adheres to the terms of that license. Please review the original license for full details.\n\n<a name=\"acknowledgements\"/>\n\n## 🙏 Acknowledgements\n*   **DeepSeek AI** for developing and open-sourcing the powerful DeepSeek-R1-0528-Qwen3-8B model:\n    *   [DeepSeek-R1-0528-Qwen3-8B on Hugging Face](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528-Qwen3-8B)\n*   The **llama.cpp** project and its contributors for the GGUF format and the incredible tooling that makes local LLM inference accessible.\n    *   [llama.cpp GitHub Repository](https://github.com/ggerganov/llama.cpp)\n*   **city96**:\n    *   [https://huggingface.co/city96](https://huggingface.co/city96)\n\n</div>",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "base_model:Qwen/Qwen3-8B",
    "base_model:quantized:Qwen/Qwen3-8B",
    "license:mit",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 1122,
  "gated": false,
  "private": false,
  "last_modified": "2025-05-29T18:24:37.000Z",
  "created_at": "2025-05-29T13:48:18.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "683865a2cce754234b5a84fc",
  "id": "samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF",
  "modelId": "samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF",
  "sha": "a10e157e66ed2502ec28f008c6354f42c0bc6010",
  "createdAt": "2025-05-29T13:48:18.000Z",
  "lastModified": "2025-05-29T18:24:37.000Z",
  "author": "samuelchristlie",
  "downloads": 1122,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 20
}