glogwa68/qwen3-0.6b-distill-glm-4.7-think-gguf q8_0 GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
glogwa68/qwen3-0.6b-distill-glm-4.7-think-gguf overview
GGUF quantized versions of Qwen3-0.6B fine-tuned model.
Downloads
142
Likes
1
Pipeline
text-generation
Library
gguf
Visibility
Public
Access
Open
Repository Files & Downloads
15 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Qwen3-0.6B-DISTILL-glm-4.7-think-f16.gguf | GGUF | F16 | 1.41 GB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q2_k.gguf | GGUF | Q2_K | 331.20 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q3_k_l.gguf | GGUF | Q3_K_L | 415.18 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q3_k_m.gguf | GGUF | Q3_K_M | 394.80 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q3_k_s.gguf | GGUF | Q3_K_S | 371.86 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q4_0.gguf | GGUF | — | 447.35 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q4_1.gguf | GGUF | — | 482.87 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q4_k_m.gguf | GGUF | Q4_K_M | 461.79 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q4_k_s.gguf | GGUF | Q4_K_S | 448.98 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q5_0.gguf | GGUF | — | 518.40 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q5_1.gguf | GGUF | — | 553.92 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q5_k_m.gguf | GGUF | Q5_K_M | 525.84 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q5_k_s.gguf | GGUF | Q5_K_S | 518.40 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q6_k.gguf | GGUF | Q6_K | 593.89 MB | Download |
| Qwen3-0.6B-DISTILL-glm-4.7-think-q8_0.gguf | GGUF | — | 767.47 MB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"base_model": "Qwen/Qwen3-0.6B",
"library_name": "gguf",
"license": "apache-2.0",
"language": [
"en"
],
"tags": [
"qwen3",
"gguf",
"quantized",
"distillation"
],
"pipeline_tag": "text-generation",
"frontmatter": {},
"hero_image_url": "",
"summary": "GGUF quantized versions of Qwen3-0.6B fine-tuned model.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\r\nbase_model: Qwen/Qwen3-0.6B\r\nlibrary_name: gguf\r\nlicense: apache-2.0\r\nlanguage:\r\n- en\r\ntags:\r\n- qwen3\r\n- gguf\r\n- quantized\r\n- distillation\r\npipeline_tag: text-generation\r\n---\r\n\r\n# Qwen3-0.6B-DISTILL-glm-4.7-think-GGUF\r\n\r\nGGUF quantized versions of Qwen3-0.6B fine-tuned model.\r\n\r\n## Available Formats\r\n\r\n| Filename | Size | Quant Type | Description |\r\n|----------|------|------------|-------------|\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-f16.gguf | 1.41 GB | F16 | Largest, original FP16 |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q2_k.gguf | 0.32 GB | Q2_K | Smallest, significant quality loss |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q3_k_l.gguf | 0.41 GB | Q3_K_L | Small, better quality |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q3_k_m.gguf | 0.39 GB | Q3_K_M | Small, balanced |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q3_k_s.gguf | 0.36 GB | Q3_K_S | Small, moderate quality loss |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q4_0.gguf | 0.44 GB | Q4_0 | Medium, legacy format |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q4_1.gguf | 0.47 GB | Q4_1 | |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q4_k_m.gguf | 0.45 GB | Q4_K_M | **Recommended** - Best balance |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q4_k_s.gguf | 0.44 GB | Q4_K_S | Medium, good balance |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q5_0.gguf | 0.51 GB | Q5_0 | Medium-large, good quality |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q5_1.gguf | 0.54 GB | Q5_1 | |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q5_k_m.gguf | 0.51 GB | Q5_K_M | Medium-large, high quality |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q5_k_s.gguf | 0.51 GB | Q5_K_S | Medium-large, high quality |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q6_k.gguf | 0.58 GB | Q6_K | Large, very high quality |\r\n| Qwen3-0.6B-DISTILL-glm-4.7-think-q8_0.gguf | 0.75 GB | Q8_0 | Large, near lossless |\r\n\r\n\r\n## Quick Start\r\n\r\n### Ollama\r\n```bash\r\nollama run hf.co/glogwa68/Qwen3-0.6B-DISTILL-glm-4.7-think-GGUF:Q4_K_M\r\n```\r\n\r\n### llama.cpp\r\n```bash\r\nllama-cli --hf-repo glogwa68/Qwen3-0.6B-DISTILL-glm-4.7-think-GGUF --hf-file qwen3-0.6b-distill-glm-4.7-think-q4_k_m.gguf -p \"Hello\"\r\n```\r\n",
"related_quantizations": []
},
"tags": [
"gguf",
"qwen3",
"quantized",
"distillation",
"text-generation",
"en",
"base_model:Qwen/Qwen3-0.6B",
"base_model:quantized:Qwen/Qwen3-0.6B",
"license:apache-2.0",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 1,
"downloads": 142,
"gated": false,
"private": false,
"last_modified": "2025-12-24T22:37:16.000Z",
"created_at": "2025-12-24T22:31:50.000Z",
"pipeline_tag": "text-generation",
"library_name": "gguf"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "694c69d6329f4825321109b8",
"id": "glogwa68/Qwen3-0.6B-DISTILL-glm-4.7-think-GGUF",
"modelId": "glogwa68/Qwen3-0.6B-DISTILL-glm-4.7-think-GGUF",
"sha": "764b476a11fcf5eab06e4767dbe747846de5d407",
"createdAt": "2025-12-24T22:31:50.000Z",
"lastModified": "2025-12-24T22:37:16.000Z",
"author": "glogwa68",
"downloads": 142,
"likes": 1,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "gguf",
"siblings_count": 17
}