Model Intelligence Sheet
samuelchristlie/deepseek-r1-0528-qwen3-8b-gguf overview
Direct GGUF Quantizations of DeepSeek-R1-0528-Qwen3-8B This repository provides GGUF quantized models for deepseek-ai/DeepSeek-R1-0528-Qwen3-8B. DeepSeek-R1-0528-Qwen3-8B is a powerful 8 billion parameter Large Language Model developed by DeepSeek AI. It is an Instruct model based on the Qwen3 architecture, excelling in a wide range of text generation tasks including chat, coding, and reasoning. These GGUF versions are optimized for efficient CPU and GPU inference using llama.cpp and compatible tools. This release includes various quantization levels (e.g., Q2K, Q3KM, Q4KM, Q5KM, Q6K, Q8_0) to suit different hardware capabilities and performance requirements.
Downloads
1,122
Likes
0
Pipeline
—
Library
—
Visibility
Public
Access
Open
Repository Files & Downloads
18 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| DeepSeek-R1-0528-Qwen3-8B-Q2_K.gguf | GGUF | Q2_K | 3.06 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q3_K.gguf | GGUF | Q3_K | 3.84 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q3_K_L.gguf | GGUF | Q3_K_L | 4.13 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q3_K_M.gguf | GGUF | Q3_K_M | 3.84 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q3_K_S.gguf | GGUF | Q3_K_S | 3.51 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q4_0.gguf | GGUF | — | 4.45 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q4_1.gguf | GGUF | — | 4.89 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q4_K.gguf | GGUF | Q4_K | 4.68 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q4_K_M.gguf | GGUF | Q4_K_M | 4.68 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q4_K_S.gguf | GGUF | Q4_K_S | 4.47 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q5_0.gguf | GGUF | — | 5.33 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q5_1.gguf | GGUF | — | 5.77 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q5_K.gguf | GGUF | Q5_K | 5.45 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q5_K_M.gguf | GGUF | Q5_K_M | 5.45 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q5_K_S.gguf | GGUF | Q5_K_S | 5.33 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q6_K.gguf | GGUF | Q6_K | 6.26 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-Q8_0.gguf | GGUF | — | 8.11 GB | Download |
| DeepSeek-R1-0528-Qwen3-8B-f16.gguf | GGUF | F16 | 15.26 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "mit",
"base_model": [
"deepseek-ai/DeepSeek-R1-0528-Qwen3-8B",
"Qwen/Qwen3-8B"
],
"frontmatter": {
"license": "mit",
"base_model": [
"deepseek-ai/DeepSeek-R1-0528-Qwen3-8B",
"Qwen/Qwen3-8B"
]
},
"hero_image_url": "",
"summary": "## Direct GGUF Quantizations of DeepSeek-R1-0528-Qwen3-8B This repository provides GGUF quantized models for **deepseek-ai/DeepSeek-R1-0528-Qwen3-8B**. DeepSeek-R1-0528-Qwen3-8B is a powerful 8 billion parameter Large Language Model developed by DeepSeek AI. It is an Instruct model based on the Qwen3 architecture, excelling in a wide range of text generation tasks including chat, coding, and reasoning. These GGUF versions are optimized for efficient CPU and GPU inference using llama.cpp and compatible tools. This release includes various quantization levels (e.g., Q2_K, Q3_K_M, Q4_K_M, Q5_K_M, Q6_K, Q8_0) to suit different hardware capabilities and performance requirements.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: mit\nbase_model:\n- deepseek-ai/DeepSeek-R1-0528-Qwen3-8B\n- Qwen/Qwen3-8B\n---\n\n# DeepSeek-R1-0528-Qwen3-8B-GGUF\n## Direct GGUF Quantizations of DeepSeek-R1-0528-Qwen3-8B\n\nThis repository provides GGUF quantized models for **deepseek-ai/DeepSeek-R1-0528-Qwen3-8B**.\n\nDeepSeek-R1-0528-Qwen3-8B is a powerful 8 billion parameter Large Language Model developed by DeepSeek AI. It is an Instruct model based on the Qwen3 architecture, excelling in a wide range of text generation tasks including chat, coding, and reasoning. These GGUF versions are optimized for efficient CPU and GPU inference using `llama.cpp` and compatible tools.\n\nThis release includes various quantization levels (e.g., Q2_K, Q3_K_M, Q4_K_M, Q5_K_M, Q6_K, Q8_0) to suit different hardware capabilities and performance requirements.\n\n## Table of Contents 📝\n1. ▶ [Usage](#usage)\n2. 📃 [License](#license)\n3. 🙏 [Acknowledgements](#acknowledgements)\n\n<a name=\"usage\"/>\n\n## ▶ Usage\n\n### 1. Download Models\nDownload models using `huggingface-cli`:\n```bash\npip install \"huggingface_hub[cli]\"\nhuggingface-cli download samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF --local-dir ./DeepSeek-R1-0528-Qwen3-8B-GGUF\n```\nYou can also download directly from [this page](https://huggingface.co/samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF/tree/main)\n\n### 2. Inference\nTo use these GGUF files, you'll need a compatible inference engine like `llama.cpp` or clients built on top of it (e.g., `Ollama`, `LM Studio`, `KoboldCpp`, `text-generation-webui` with `llama.cpp` backend).\n\n<a name=\"license\"/>\n\n## 📃 License\nThis model is a GGUF conversion of the original `deepseek-ai/DeepSeek-R1-0528-Qwen3-8B` model. The original model is licensed under the **MIT License**, and this derivative work adheres to the terms of that license. Please review the original license for full details.\n\n<a name=\"acknowledgements\"/>\n\n## 🙏 Acknowledgements\n* **DeepSeek AI** for developing and open-sourcing the powerful DeepSeek-R1-0528-Qwen3-8B model:\n * [DeepSeek-R1-0528-Qwen3-8B on Hugging Face](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528-Qwen3-8B)\n* The **llama.cpp** project and its contributors for the GGUF format and the incredible tooling that makes local LLM inference accessible.\n * [llama.cpp GitHub Repository](https://github.com/ggerganov/llama.cpp)\n* **city96**:\n * [https://huggingface.co/city96](https://huggingface.co/city96)\n\n</div>",
"related_quantizations": []
},
"tags": [
"gguf",
"base_model:Qwen/Qwen3-8B",
"base_model:quantized:Qwen/Qwen3-8B",
"license:mit",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 0,
"downloads": 1122,
"gated": false,
"private": false,
"last_modified": "2025-05-29T18:24:37.000Z",
"created_at": "2025-05-29T13:48:18.000Z",
"pipeline_tag": "",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "683865a2cce754234b5a84fc",
"id": "samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF",
"modelId": "samuelchristlie/DeepSeek-R1-0528-Qwen3-8B-GGUF",
"sha": "a10e157e66ed2502ec28f008c6354f42c0bc6010",
"createdAt": "2025-05-29T13:48:18.000Z",
"lastModified": "2025-05-29T18:24:37.000Z",
"author": "samuelchristlie",
"downloads": 1122,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "",
"library_name": "",
"siblings_count": 20
}