GraySoft
Projects Models About FAQ Contact Download guIDE →

lovedheart/glm-4.5-air-gguf-iq1_m 00005 GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

lovedheart/glm-4.5-air-gguf-iq1_m overview

Use unsloth BF16 GGUF to quantize IQ1M/S. Blk.46 is not being used in llama.cpp therefore the weights of blk.46 are quantized to TQ10 to have minimum memory allocation. --- Added MXFP4 version: 1) MXFP4: Embedding, Output are kept with Q6K. The attn layers use IQ4XS. All ffn expert layers including shared experts are quantized to SOTA MXFP4. 2) MXFP4 Max: Embedding, Output and attn layers are kept with Q6_K. First layer uses full precision. The rest of ffn expert layers are quantized to SOTA MXFP4. The shared experts weights keep BF16.

ggufbase_model:zai-org/GLM-4.5-Airbase_model:quantized:zai-org/GLM-4.5-Airlicense:mitendpoints_compatibleregion:usimatrixconversational
lovedheart/glm-4.5-air-gguf-iq1_m visual
Downloads
250
Likes
4
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

40 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
GLM-4.5-Air-IQ1_M_L-00001-of-00005.gguf GGUF IQ1_M_L 8.11 GB Download
GLM-4.5-Air-IQ1_M_L-00002-of-00005.gguf GGUF IQ1_M_L 7.90 GB Download
GLM-4.5-Air-IQ1_M_L-00003-of-00005.gguf GGUF IQ1_M_L 7.56 GB Download
GLM-4.5-Air-IQ1_M_L-00004-of-00005.gguf GGUF IQ1_M_L 7.90 GB Download
GLM-4.5-Air-IQ1_M_L-00005-of-00005.gguf GGUF IQ1_M_L 3.73 GB Download
GLM-4.5-Air-IQ1_M_XS-00001-of-00005.gguf GGUF IQ1_M_XS 7.95 GB Download
GLM-4.5-Air-IQ1_M_XS-00002-of-00005.gguf GGUF IQ1_M_XS 7.85 GB Download
GLM-4.5-Air-IQ1_M_XS-00003-of-00005.gguf GGUF IQ1_M_XS 7.50 GB Download
GLM-4.5-Air-IQ1_M_XS-00004-of-00005.gguf GGUF IQ1_M_XS 7.85 GB Download
GLM-4.5-Air-IQ1_M_XS-00005-of-00005.gguf GGUF IQ1_M_XS 3.62 GB Download
GLM-4.5-Air-IQ1_S_XS-00001-of-00005.gguf GGUF IQ1_S_XS 7.61 GB Download
GLM-4.5-Air-IQ1_S_XS-00002-of-00005.gguf GGUF IQ1_S_XS 7.51 GB Download
GLM-4.5-Air-IQ1_S_XS-00003-of-00005.gguf GGUF IQ1_S_XS 7.16 GB Download
GLM-4.5-Air-IQ1_S_XS-00004-of-00005.gguf GGUF IQ1_S_XS 7.51 GB Download
GLM-4.5-Air-IQ1_S_XS-00005-of-00005.gguf GGUF IQ1_S_XS 3.51 GB Download
GLM-4.5-Air-IQ2_XXS-00001-of-00005.gguf GGUF IQ2_XXS 8.92 GB Download
GLM-4.5-Air-IQ2_XXS-00002-of-00005.gguf GGUF IQ2_XXS 8.39 GB Download
GLM-4.5-Air-IQ2_XXS-00003-of-00005.gguf GGUF IQ2_XXS 8.08 GB Download
GLM-4.5-Air-IQ2_XXS-00004-of-00005.gguf GGUF IQ2_XXS 8.39 GB Download
GLM-4.5-Air-IQ2_XXS-00005-of-00005.gguf GGUF IQ2_XXS 4.50 GB Download
GLM-4.5-Air-IQ3_XXS_M-00001-of-00005.gguf GGUF IQ3_XXS_M 10.98 GB Download
GLM-4.5-Air-IQ3_XXS_M-00002-of-00005.gguf GGUF IQ3_XXS_M 10.84 GB Download
GLM-4.5-Air-IQ3_XXS_M-00003-of-00005.gguf GGUF IQ3_XXS_M 10.49 GB Download
GLM-4.5-Air-IQ3_XXS_M-00004-of-00005.gguf GGUF IQ3_XXS_M 10.84 GB Download
GLM-4.5-Air-IQ3_XXS_M-00005-of-00005.gguf GGUF IQ3_XXS_M 4.93 GB Download
GLM-4.5-Air-MXFP4_MOE-00001-of-00005.gguf GGUF 12.68 GB Download
GLM-4.5-Air-MXFP4_MOE-00002-of-00005.gguf GGUF 12.33 GB Download
GLM-4.5-Air-MXFP4_MOE-00003-of-00005.gguf GGUF 12.03 GB Download
GLM-4.5-Air-MXFP4_MOE-00004-of-00005.gguf GGUF 12.33 GB Download
GLM-4.5-Air-MXFP4_MOE-00005-of-00005.gguf GGUF 6.04 GB Download
GLM-4.5-Air-MXFP4_MOE_Max-00001-of-00005.gguf GGUF 13.42 GB Download
GLM-4.5-Air-MXFP4_MOE_Max-00002-of-00005.gguf GGUF 12.86 GB Download
GLM-4.5-Air-MXFP4_MOE_Max-00003-of-00005.gguf GGUF 12.62 GB Download
GLM-4.5-Air-MXFP4_MOE_Max-00004-of-00005.gguf GGUF 12.86 GB Download
GLM-4.5-Air-MXFP4_MOE_Max-00005-of-00005.gguf GGUF 6.25 GB Download
GLM-4.5-Air-MXFP4_MOE_S-00001-of-00005.gguf GGUF 10.69 GB Download
GLM-4.5-Air-MXFP4_MOE_S-00002-of-00005.gguf GGUF 10.46 GB Download
GLM-4.5-Air-MXFP4_MOE_S-00003-of-00005.gguf GGUF 11.40 GB Download
GLM-4.5-Air-MXFP4_MOE_S-00004-of-00005.gguf GGUF 10.81 GB Download
GLM-4.5-Air-MXFP4_MOE_S-00005-of-00005.gguf GGUF 5.23 GB Download

Model Details Live

Model Slug
lovedheart/glm-4.5-air-gguf-iq1_m
Author
lovedheart
Pipeline Task
Library
Created
2025-08-05
Last Modified
2025-08-26
Gated
No
Private
No
HF SHA
dad9c6ab5eec5f95ca3518489cd3a27e465c1acb
License
mit
Language
Unknown
Base Model
zai-org/GLM-4.5-Air

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "mit",
    "base_model": [
      "zai-org/GLM-4.5-Air"
    ],
    "frontmatter": {
      "license": "mit",
      "base_model": [
        "zai-org/GLM-4.5-Air"
      ]
    },
    "hero_image_url": "",
    "summary": "Use unsloth BF16 GGUF to quantize IQ1_M/S. Blk.46 is not being used in llama.cpp therefore the weights of blk.46 are quantized to TQ1_0 to have minimum memory allocation. --- Added MXFP4 version: 1) MXFP4: Embedding, Output are kept with Q6_K. The attn layers use IQ4_XS. All ffn expert layers including shared experts are quantized to SOTA MXFP4. 2) MXFP4 Max: Embedding, Output and attn layers are kept with Q6_K. First layer uses full precision. The rest of ffn expert layers are quantized to SOTA MXFP4. The shared experts weights keep BF16.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: mit\nbase_model:\n- zai-org/GLM-4.5-Air\n---\n\nUse unsloth BF16 GGUF to quantize IQ1_M/S. Blk.46 is not being used in llama.cpp therefore the weights of blk.46 are quantized to TQ1_0 to have minimum memory allocation.\n\n---\nAdded MXFP4 version:\n1) MXFP4: Embedding, Output are kept with Q6_K. The attn layers use IQ4_XS. All ffn expert layers including shared experts are quantized to SOTA MXFP4.\n2) MXFP4 Max: Embedding, Output and attn layers are kept with Q6_K. First layer uses full precision. The rest of ffn expert layers are quantized to SOTA MXFP4. The shared experts weights keep BF16.",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "base_model:zai-org/GLM-4.5-Air",
    "base_model:quantized:zai-org/GLM-4.5-Air",
    "license:mit",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 4,
  "downloads": 250,
  "gated": false,
  "private": false,
  "last_modified": "2025-08-26T05:57:05.000Z",
  "created_at": "2025-08-05T15:42:13.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "689226559416d9247e8b1d7f",
  "id": "lovedheart/GLM-4.5-Air-GGUF-IQ1_M",
  "modelId": "lovedheart/GLM-4.5-Air-GGUF-IQ1_M",
  "sha": "dad9c6ab5eec5f95ca3518489cd3a27e465c1acb",
  "createdAt": "2025-08-05T15:42:13.000Z",
  "lastModified": "2025-08-26T05:57:05.000Z",
  "author": "lovedheart",
  "downloads": 250,
  "likes": 4,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 42
}