GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

samuelchristlie/deepseek-r1-0528-qwen3-8b-gguf overview

Direct GGUF Quantizations of DeepSeek-R1-0528-Qwen3-8B This repository provides GGUF quantized models for deepseek-ai/DeepSeek-R1-0528-Qwen3-8B. DeepSeek-R1-0528-Qwen3-8B is a powerful 8 billion parameter Large Language Model developed by DeepSeek AI. It is an Instruct model based on the Qwen3 architecture, excelling in a wide range of text generation tasks including chat, coding, and reasoning. These GGUF versions are optimized for efficient CPU and GPU inference using llama.cpp and compatible tools. This release includes various quantization levels (e.g., Q2K, Q3KM, Q4KM, Q5KM, Q6K, Q8_0) to suit different hardware capabilities and performance requirements.

ggufbase_model:Qwen/Qwen3-8Bbase_model:quantized:Qwen/Qwen3-8Blicense:mitendpoints_compatibleregion:usconversational
samuelchristlie/deepseek-r1-0528-qwen3-8b-gguf visual
Downloads
1,122
Likes
0
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

18 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
DeepSeek-R1-0528-Qwen3-8B-Q2_K.gguf GGUF Q2_K 3.06 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q3_K.gguf GGUF Q3_K 3.84 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q3_K_L.gguf GGUF Q3_K_L 4.13 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q3_K_M.gguf GGUF Q3_K_M 3.84 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q3_K_S.gguf GGUF Q3_K_S 3.51 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q4_0.gguf GGUF 4.45 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q4_1.gguf GGUF 4.89 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q4_K.gguf GGUF Q4_K 4.68 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q4_K_M.gguf GGUF Q4_K_M 4.68 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q4_K_S.gguf GGUF Q4_K_S 4.47 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q5_0.gguf GGUF 5.33 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q5_1.gguf GGUF 5.77 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q5_K.gguf GGUF Q5_K 5.45 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q5_K_M.gguf GGUF Q5_K_M 5.45 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q5_K_S.gguf GGUF Q5_K_S 5.33 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q6_K.gguf GGUF Q6_K 6.26 GB Download
DeepSeek-R1-0528-Qwen3-8B-Q8_0.gguf GGUF 8.11 GB Download
DeepSeek-R1-0528-Qwen3-8B-f16.gguf GGUF F16 15.26 GB Download

Model Details Live

Model Slug
samuelchristlie/deepseek-r1-0528-qwen3-8b-gguf
Author
samuelchristlie
Pipeline Task
Library
Created
2025-05-29
Last Modified
2025-05-29
Gated
No
Private
No
HF SHA
a10e157e66ed2502ec28f008c6354f42c0bc6010
License
mit
Language
Unknown
Base Model
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B, Qwen/Qwen3-8B

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "mit",
    "base_model": [
      "deepseek-ai/DeepSeek-R1-0528-Qwen3-8B",
      "Qwen/Qwen3-8B"
    ],
    "frontmatter": {
      "license": "mit",
      "base_model": [
        "deepseek-ai/DeepSeek-R1-0528-Qwen3-8B",
        "Qwen/Qwen3-8B"
      ]
    },
    "hero_image_url": "",
    "summary": "## Direct GGUF Quantizations of DeepSeek-R1-0528-Qwen3-8B This repository provides GGUF quantized models for **deepseek-ai/DeepSeek-R1-0528-Qwen3-8B**. DeepSeek-R1-0528-Qwen3-8B is a powerful 8 billion parameter Large Language Model developed by DeepSeek AI. It is an Instruct model based on the Qwen3 architecture, excelling in a wide range of text generation tasks including chat, coding, and reasoning. These GGUF versions are optimized for efficient CPU and GPU inference using llama.cpp and compatible tools. This release includes various quantization levels (e.g., Q2_K, Q3_K_M, Q4_K_M, Q5_K_M, Q6_K, Q8_0) to suit different hardware capabilities and performance requirements.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: mit\nbase_model:\n- deepseek-ai/DeepSeek-R1-0528-Qwen3-8B\n- Qwen/Qwen3-8B\n---\n\n# DeepSeek-R1-0528-Qwen3-8B-GGUF\n## Direct GGUF Quantizations of DeepSeek-R1-0528-Qwen3-8B\n\nThis repository provides GGUF quantized models for **deepseek-ai/DeepSeek-R1-0528-Qwen3-8B**.\n\nDeepSeek-R1-0528-Qwen3-8B is a powerful 8 billion parameter Large Language Model developed by DeepSeek AI. It is an Instruct model based on the Qwen3 architecture, excelling in a wide range of text generation tasks including chat, coding, and reasoning. These GGUF versions are optimized for efficient CPU and GPU inference using `llama.cpp` and compatible tools.\n\nThis release includes various quantization levels (e.g., Q2_K, Q3_K_M, Q4_K_M, Q5_K_M, Q6_K, Q8_0) to suit different hardware capabilities and performance requirements.\n\n## Table of Contents 📝\n1. ▶ [Usage](#usage)\n2. 📃 [License](#license)\n3. 🙏 [Acknowledgements](#acknowledgements)\n\n<a name=\"usage\"/>\n\n## ▶ Usage\n\n### 1. Download Models\nDownload models using `huggingface-cli`:\n```bash\npip install \"huggingface_hub[cli]\"\nhuggingface-cli download samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF --local-dir ./DeepSeek-R1-0528-Qwen3-8B-GGUF\n```\nYou can also download directly from [this page](https://huggingface.co/samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF/tree/main)\n\n### 2. Inference\nTo use these GGUF files, you'll need a compatible inference engine like `llama.cpp` or clients built on top of it (e.g., `Ollama`, `LM Studio`, `KoboldCpp`, `text-generation-webui` with `llama.cpp` backend).\n\n<a name=\"license\"/>\n\n## 📃 License\nThis model is a GGUF conversion of the original `deepseek-ai/DeepSeek-R1-0528-Qwen3-8B` model. The original model is licensed under the **MIT License**, and this derivative work adheres to the terms of that license. Please review the original license for full details.\n\n<a name=\"acknowledgements\"/>\n\n## 🙏 Acknowledgements\n*   **DeepSeek AI** for developing and open-sourcing the powerful DeepSeek-R1-0528-Qwen3-8B model:\n    *   [DeepSeek-R1-0528-Qwen3-8B on Hugging Face](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528-Qwen3-8B)\n*   The **llama.cpp** project and its contributors for the GGUF format and the incredible tooling that makes local LLM inference accessible.\n    *   [llama.cpp GitHub Repository](https://github.com/ggerganov/llama.cpp)\n*   **city96**:\n    *   [https://huggingface.co/city96](https://huggingface.co/city96)\n\n</div>",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "base_model:Qwen/Qwen3-8B",
    "base_model:quantized:Qwen/Qwen3-8B",
    "license:mit",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 1122,
  "gated": false,
  "private": false,
  "last_modified": "2025-05-29T18:24:37.000Z",
  "created_at": "2025-05-29T13:48:18.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "683865a2cce754234b5a84fc",
  "id": "samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF",
  "modelId": "samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF",
  "sha": "a10e157e66ed2502ec28f008c6354f42c0bc6010",
  "createdAt": "2025-05-29T13:48:18.000Z",
  "lastModified": "2025-05-29T18:24:37.000Z",
  "author": "samuelchristlie",
  "downloads": 1122,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 20
}