Model Intelligence Sheet

octopusmegalopod/some-glmocr-ggufs overview

GLM-OCR converted for use with llama.cpp. As of b8094, it may crash in llama.cpp if flash-attn is on or certain backends are used (e.g: CPU). ~~Regrettably, does not seem to perform well for single CJK line texts out of the box. Or perhaps it expects the image in a specific resolution, due to its dependence on PaddleLayout.~~ GLM-OCR in llama.cpp behaves differently depending on the maximum context for some reason. Setting -c 2000 gives wrong outputs typically, for example. Whereas -c 9000 has produced much better results so far in my tests. So far in my tests:

ggufimage-text-to-textbase_model:zai-org/GLM-OCRbase_model:quantized:zai-org/GLM-OCRendpoints_compatibleregion:usconversational

octopusmegalopod/some-glmocr-ggufs visual

Downloads

451

Likes

Pipeline

image-text-to-text

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

8 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
glmocr-BF16.gguf	GGUF	BF16	1.66 GB	Download
glmocr-Q4_K_M.gguf	GGUF	Q4_K_M	523.10 MB	Download
glmocr-Q8_0.gguf	GGUF	—	906.41 MB	Download
glmocr-f16.gguf	GGUF	F16	1.66 GB	Download
mmproj-glmocr-BF16.gguf	GGUF	BF16	843.06 MB	Download
mmproj-glmocr-Q4_1.gguf	GGUF	—	284.12 MB	Download
mmproj-glmocr-Q8_0.gguf	GGUF	—	461.96 MB	Download
mmproj-glmocr-f16.gguf	GGUF	F16	828.76 MB	Download

Model Details Live

Model Slug

octopusmegalopod/some-glmocr-ggufs

Author

octopusmegalopod

Pipeline Task

image-text-to-text

Library

—

Created

2026-02-18

Last Modified

2026-02-19

Gated

Private

HF SHA

318c473f8b924c4ccbd748d85ecc22a76d499c50

License

Unknown

Language

Unknown

Base Model

zai-org/GLM-OCR

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "base_model": [
      "zai-org/GLM-OCR"
    ],
    "pipeline_tag": "image-text-to-text",
    "frontmatter": {
      "base_model": [
        "zai-org/GLM-OCR"
      ],
      "pipeline_tag": "image-text-to-text"
    },
    "hero_image_url": "",
    "summary": "GLM-OCR converted for use with llama.cpp. As of b8094, it may crash in llama.cpp if flash-attn is on or certain backends are used (e.g: CPU). ~~Regrettably, does not seem to perform well for single CJK line texts out of the box. Or perhaps it expects the image in a specific resolution, due to its dependence on PaddleLayout.~~ GLM-OCR in llama.cpp behaves differently depending on the maximum context for some reason. Setting -c 2000 gives wrong outputs typically, for example. Whereas -c 9000 has produced much better results so far in my tests. So far in my tests:",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model:\n- zai-org/GLM-OCR\npipeline_tag: image-text-to-text\n---\n[GLM-OCR](https://huggingface.co/zai-org/GLM-OCR) converted for use with llama.cpp. As of b8094, it may crash in llama.cpp if `flash-attn` is on or certain backends are used (e.g: CPU).\n\n~~Regrettably, does not seem to perform well for single CJK line texts out of the box. Or perhaps it expects the image in a specific resolution, due to its dependence on PaddleLayout.~~\n\nGLM-OCR in llama.cpp behaves differently depending on the maximum context for some reason. Setting `-c 2000` gives wrong outputs typically, for example. Whereas `-c 9000` has produced much better results so far in my tests.\n\nSo far in my tests:\n- performs poorly on Korean horizontal text lines.\n- performs decently on Japanese horizontal text lines.\n- performs poorly on Japanese vertical text lines.\n- performs excellently on Chinese horizontal text lines.\n- relatively robust to quantization",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "image-text-to-text",
    "base_model:zai-org/GLM-OCR",
    "base_model:quantized:zai-org/GLM-OCR",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 1,
  "downloads": 451,
  "gated": false,
  "private": false,
  "last_modified": "2026-02-19T01:29:09.000Z",
  "created_at": "2026-02-18T21:47:46.000Z",
  "pipeline_tag": "image-text-to-text",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "69963382fb4584a79829750d",
  "id": "octopusmegalopod/some-glmocr-ggufs",
  "modelId": "octopusmegalopod/some-glmocr-ggufs",
  "sha": "318c473f8b924c4ccbd748d85ecc22a76d499c50",
  "createdAt": "2026-02-18T21:47:46.000Z",
  "lastModified": "2026-02-19T01:29:09.000Z",
  "author": "octopusmegalopod",
  "downloads": 451,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-text-to-text",
  "library_name": "",
  "siblings_count": 10
}