Question 1

What is cstr/qari-ocr-crispembed-GGUF?

Accepted Answer

---

license: apache-2.0

tags:

- ocr

- arabic

- gguf

- crispembed

- qwen2-vl

base_model: NAMAA-Space/Qari-OCR-0.2.2.1-VL-2B-Instruct

language: ar

pipeline_tag: image-text-to-text

---

Qari-OCR CrispEmbed GGUF

GGUF conversion of NAMAA-Space/Qari-OCR-0.2.2.1-VL-2B-Instruct (Apache-2.0) for use with CrispEmbed.

Model

Arabic OCR with full diacritics (tashkeel) support. Fine-tuned from Qwen2-VL-2B-Instruct via LoRA (r=16, alpha=16, 324 adapter pairs) on 50K Arabic OCR samples.

Architecture: Qwen2-VL-2B (32L ViT 1280d + spatial merger + 28L Qwen2 LLM 1536d, GQA 12/2)
Parameters: 2B
Performance: WER=0.221, CER=0.059, BLEU=0.597
Training: 50K Arabic OCR records, 1 epoch, LoRA on attention+MLP

Files

| File | Type | Size |

|------|------|------|

| qari-ocr-2b-f16.gguf | F16 | 4.7 GB |

| qari-ocr-2b-q8_0.gguf | Q8_0 | 2.3 GB |

| qari-ocr-2b-q4_k.gguf | Q4_K | 1.6 GB |

Usage

Uses the same qwen2vl engine as other Qwen2-VL models in CrispEmbed.

Conversion

LoRA adapter merged into full-precision Qwen2-VL-2B-Instruct base weights (324 pairs, tensor-by-tensor), then converted to GGUF via CrispEmbed converter. Quantized with crispembed-quantize (vision weights at Q8_0 floor).

License

Apache-2.0 (NAMAA-Space/Qari-OCR).

Question 2

What license applies to cstr/qari-ocr-crispembed-GGUF?

Accepted Answer

License: apache-2.0. Verify terms on Hugging Face before commercial use.

Question 3

How do I run cstr/qari-ocr-crispembed-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: image-text-to-text.

Question 4

How much VRAM or disk space does cstr/qari-ocr-crispembed-GGUF need?

Accepted Answer

Runs locally from ~5.3 MB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

cstr/qari-ocr-crispembed-GGUF overview

Repository Files & Downloads

Model Details

Model README