lovedheart/glm-4.5-air-gguf-iq1_m 00005 GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
lovedheart/glm-4.5-air-gguf-iq1_m overview
Use unsloth BF16 GGUF to quantize IQ1M/S. Blk.46 is not being used in llama.cpp therefore the weights of blk.46 are quantized to TQ10 to have minimum memory allocation. --- Added MXFP4 version: 1) MXFP4: Embedding, Output are kept with Q6K. The attn layers use IQ4XS. All ffn expert layers including shared experts are quantized to SOTA MXFP4. 2) MXFP4 Max: Embedding, Output and attn layers are kept with Q6_K. First layer uses full precision. The rest of ffn expert layers are quantized to SOTA MXFP4. The shared experts weights keep BF16.
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| GLM-4.5-Air-IQ1_M_L-00001-of-00005.gguf | GGUF | IQ1_M_L | 8.11 GB | Download |
| GLM-4.5-Air-IQ1_M_L-00002-of-00005.gguf | GGUF | IQ1_M_L | 7.90 GB | Download |
| GLM-4.5-Air-IQ1_M_L-00003-of-00005.gguf | GGUF | IQ1_M_L | 7.56 GB | Download |
| GLM-4.5-Air-IQ1_M_L-00004-of-00005.gguf | GGUF | IQ1_M_L | 7.90 GB | Download |
| GLM-4.5-Air-IQ1_M_L-00005-of-00005.gguf | GGUF | IQ1_M_L | 3.73 GB | Download |
| GLM-4.5-Air-IQ1_M_XS-00001-of-00005.gguf | GGUF | IQ1_M_XS | 7.95 GB | Download |
| GLM-4.5-Air-IQ1_M_XS-00002-of-00005.gguf | GGUF | IQ1_M_XS | 7.85 GB | Download |
| GLM-4.5-Air-IQ1_M_XS-00003-of-00005.gguf | GGUF | IQ1_M_XS | 7.50 GB | Download |
| GLM-4.5-Air-IQ1_M_XS-00004-of-00005.gguf | GGUF | IQ1_M_XS | 7.85 GB | Download |
| GLM-4.5-Air-IQ1_M_XS-00005-of-00005.gguf | GGUF | IQ1_M_XS | 3.62 GB | Download |
| GLM-4.5-Air-IQ1_S_XS-00001-of-00005.gguf | GGUF | IQ1_S_XS | 7.61 GB | Download |
| GLM-4.5-Air-IQ1_S_XS-00002-of-00005.gguf | GGUF | IQ1_S_XS | 7.51 GB | Download |
| GLM-4.5-Air-IQ1_S_XS-00003-of-00005.gguf | GGUF | IQ1_S_XS | 7.16 GB | Download |
| GLM-4.5-Air-IQ1_S_XS-00004-of-00005.gguf | GGUF | IQ1_S_XS | 7.51 GB | Download |
| GLM-4.5-Air-IQ1_S_XS-00005-of-00005.gguf | GGUF | IQ1_S_XS | 3.51 GB | Download |
| GLM-4.5-Air-IQ2_XXS-00001-of-00005.gguf | GGUF | IQ2_XXS | 8.92 GB | Download |
| GLM-4.5-Air-IQ2_XXS-00002-of-00005.gguf | GGUF | IQ2_XXS | 8.39 GB | Download |
| GLM-4.5-Air-IQ2_XXS-00003-of-00005.gguf | GGUF | IQ2_XXS | 8.08 GB | Download |
| GLM-4.5-Air-IQ2_XXS-00004-of-00005.gguf | GGUF | IQ2_XXS | 8.39 GB | Download |
| GLM-4.5-Air-IQ2_XXS-00005-of-00005.gguf | GGUF | IQ2_XXS | 4.50 GB | Download |
| GLM-4.5-Air-IQ3_XXS_M-00001-of-00005.gguf | GGUF | IQ3_XXS_M | 10.98 GB | Download |
| GLM-4.5-Air-IQ3_XXS_M-00002-of-00005.gguf | GGUF | IQ3_XXS_M | 10.84 GB | Download |
| GLM-4.5-Air-IQ3_XXS_M-00003-of-00005.gguf | GGUF | IQ3_XXS_M | 10.49 GB | Download |
| GLM-4.5-Air-IQ3_XXS_M-00004-of-00005.gguf | GGUF | IQ3_XXS_M | 10.84 GB | Download |
| GLM-4.5-Air-IQ3_XXS_M-00005-of-00005.gguf | GGUF | IQ3_XXS_M | 4.93 GB | Download |
| GLM-4.5-Air-MXFP4_MOE-00001-of-00005.gguf | GGUF | — | 12.68 GB | Download |
| GLM-4.5-Air-MXFP4_MOE-00002-of-00005.gguf | GGUF | — | 12.33 GB | Download |
| GLM-4.5-Air-MXFP4_MOE-00003-of-00005.gguf | GGUF | — | 12.03 GB | Download |
| GLM-4.5-Air-MXFP4_MOE-00004-of-00005.gguf | GGUF | — | 12.33 GB | Download |
| GLM-4.5-Air-MXFP4_MOE-00005-of-00005.gguf | GGUF | — | 6.04 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_Max-00001-of-00005.gguf | GGUF | — | 13.42 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_Max-00002-of-00005.gguf | GGUF | — | 12.86 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_Max-00003-of-00005.gguf | GGUF | — | 12.62 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_Max-00004-of-00005.gguf | GGUF | — | 12.86 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_Max-00005-of-00005.gguf | GGUF | — | 6.25 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_S-00001-of-00005.gguf | GGUF | — | 10.69 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_S-00002-of-00005.gguf | GGUF | — | 10.46 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_S-00003-of-00005.gguf | GGUF | — | 11.40 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_S-00004-of-00005.gguf | GGUF | — | 10.81 GB | Download |
| GLM-4.5-Air-MXFP4_MOE_S-00005-of-00005.gguf | GGUF | — | 5.23 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "mit",
"base_model": [
"zai-org/GLM-4.5-Air"
],
"frontmatter": {
"license": "mit",
"base_model": [
"zai-org/GLM-4.5-Air"
]
},
"hero_image_url": "",
"summary": "Use unsloth BF16 GGUF to quantize IQ1_M/S. Blk.46 is not being used in llama.cpp therefore the weights of blk.46 are quantized to TQ1_0 to have minimum memory allocation. --- Added MXFP4 version: 1) MXFP4: Embedding, Output are kept with Q6_K. The attn layers use IQ4_XS. All ffn expert layers including shared experts are quantized to SOTA MXFP4. 2) MXFP4 Max: Embedding, Output and attn layers are kept with Q6_K. First layer uses full precision. The rest of ffn expert layers are quantized to SOTA MXFP4. The shared experts weights keep BF16.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: mit\nbase_model:\n- zai-org/GLM-4.5-Air\n---\n\nUse unsloth BF16 GGUF to quantize IQ1_M/S. Blk.46 is not being used in llama.cpp therefore the weights of blk.46 are quantized to TQ1_0 to have minimum memory allocation.\n\n---\nAdded MXFP4 version:\n1) MXFP4: Embedding, Output are kept with Q6_K. The attn layers use IQ4_XS. All ffn expert layers including shared experts are quantized to SOTA MXFP4.\n2) MXFP4 Max: Embedding, Output and attn layers are kept with Q6_K. First layer uses full precision. The rest of ffn expert layers are quantized to SOTA MXFP4. The shared experts weights keep BF16.",
"related_quantizations": []
},
"tags": [
"gguf",
"base_model:zai-org/GLM-4.5-Air",
"base_model:quantized:zai-org/GLM-4.5-Air",
"license:mit",
"endpoints_compatible",
"region:us",
"imatrix",
"conversational"
],
"likes": 4,
"downloads": 250,
"gated": false,
"private": false,
"last_modified": "2025-08-26T05:57:05.000Z",
"created_at": "2025-08-05T15:42:13.000Z",
"pipeline_tag": "",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "689226559416d9247e8b1d7f",
"id": "lovedheart/GLM-4.5-Air-GGUF-IQ1_M",
"modelId": "lovedheart/GLM-4.5-Air-GGUF-IQ1_M",
"sha": "dad9c6ab5eec5f95ca3518489cd3a27e465c1acb",
"createdAt": "2025-08-05T15:42:13.000Z",
"lastModified": "2025-08-26T05:57:05.000Z",
"author": "lovedheart",
"downloads": 250,
"likes": 4,
"gated": false,
"private": false,
"pipeline_tag": "",
"library_name": "",
"siblings_count": 42
}