GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

octopusmegalopod/some-glmocr-ggufs overview

GLM-OCR converted for use with llama.cpp. As of b8094, it may crash in llama.cpp if flash-attn is on or certain backends are used (e.g: CPU). ~~Regrettably, does not seem to perform well for single CJK line texts out of the box. Or perhaps it expects the image in a specific resolution, due to its dependence on PaddleLayout.~~ GLM-OCR in llama.cpp behaves differently depending on the maximum context for some reason. Setting -c 2000 gives wrong outputs typically, for example. Whereas -c 9000 has produced much better results so far in my tests. So far in my tests:

ggufimage-text-to-textbase_model:zai-org/GLM-OCRbase_model:quantized:zai-org/GLM-OCRendpoints_compatibleregion:usconversational
octopusmegalopod/some-glmocr-ggufs visual
Downloads
451
Likes
1
Pipeline
image-text-to-text
Library
Visibility
Public
Access
Open

Repository Files & Downloads

8 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
glmocr-BF16.gguf GGUF BF16 1.66 GB Download
glmocr-Q4_K_M.gguf GGUF Q4_K_M 523.10 MB Download
glmocr-Q8_0.gguf GGUF 906.41 MB Download
glmocr-f16.gguf GGUF F16 1.66 GB Download
mmproj-glmocr-BF16.gguf GGUF BF16 843.06 MB Download
mmproj-glmocr-Q4_1.gguf GGUF 284.12 MB Download
mmproj-glmocr-Q8_0.gguf GGUF 461.96 MB Download
mmproj-glmocr-f16.gguf GGUF F16 828.76 MB Download

Model Details Live

Model Slug
octopusmegalopod/some-glmocr-ggufs
Author
octopusmegalopod
Pipeline Task
image-text-to-text
Library
Created
2026-02-18
Last Modified
2026-02-19
Gated
No
Private
No
HF SHA
318c473f8b924c4ccbd748d85ecc22a76d499c50
License
Unknown
Language
Unknown
Base Model
zai-org/GLM-OCR

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": [
      "zai-org/GLM-OCR"
    ],
    "pipeline_tag": "image-text-to-text",
    "frontmatter": {
      "base_model": [
        "zai-org/GLM-OCR"
      ],
      "pipeline_tag": "image-text-to-text"
    },
    "hero_image_url": "",
    "summary": "GLM-OCR converted for use with llama.cpp. As of b8094, it may crash in llama.cpp if flash-attn is on or certain backends are used (e.g: CPU). ~~Regrettably, does not seem to perform well for single CJK line texts out of the box. Or perhaps it expects the image in a specific resolution, due to its dependence on PaddleLayout.~~ GLM-OCR in llama.cpp behaves differently depending on the maximum context for some reason. Setting -c 2000 gives wrong outputs typically, for example. Whereas -c 9000 has produced much better results so far in my tests. So far in my tests:",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model:\n- zai-org/GLM-OCR\npipeline_tag: image-text-to-text\n---\n[GLM-OCR](https://huggingface.co/zai-org/GLM-OCR) converted for use with llama.cpp. As of b8094, it may crash in llama.cpp if `flash-attn` is on or certain backends are used (e.g: CPU).\n\n~~Regrettably, does not seem to perform well for single CJK line texts out of the box. Or perhaps it expects the image in a specific resolution, due to its dependence on PaddleLayout.~~\n\nGLM-OCR in llama.cpp behaves differently depending on the maximum context for some reason. Setting `-c 2000` gives wrong outputs typically, for example. Whereas `-c 9000` has produced much better results so far in my tests.\n\nSo far in my tests:\n- performs poorly on Korean horizontal text lines.\n- performs decently on Japanese horizontal text lines.\n- performs poorly on Japanese vertical text lines.\n- performs excellently on Chinese horizontal text lines.\n- relatively robust to quantization",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "image-text-to-text",
    "base_model:zai-org/GLM-OCR",
    "base_model:quantized:zai-org/GLM-OCR",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 1,
  "downloads": 451,
  "gated": false,
  "private": false,
  "last_modified": "2026-02-19T01:29:09.000Z",
  "created_at": "2026-02-18T21:47:46.000Z",
  "pipeline_tag": "image-text-to-text",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "69963382fb4584a79829750d",
  "id": "octopusmegalopod/some-glmocr-ggufs",
  "modelId": "octopusmegalopod/some-glmocr-ggufs",
  "sha": "318c473f8b924c4ccbd748d85ecc22a76d499c50",
  "createdAt": "2026-02-18T21:47:46.000Z",
  "lastModified": "2026-02-19T01:29:09.000Z",
  "author": "octopusmegalopod",
  "downloads": 451,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "image-text-to-text",
  "library_name": "",
  "siblings_count": 10
}