PaddleOCR-VL-1.6 — CrispEmbed GGUF

CrispEmbed-native GGUF quantizations of PaddlePaddle/PaddleOCR-VL-1.6.

Latest PaddleOCR-VL model with improved accuracy on OmniDocBench (96.3% SOTA). End-to-end VLM-based OCR: text recognition, table extraction, formula recognition, chart understanding. 109+ languages.

Files

| File | Size | Description |

|------|------|-------------|

| paddleocr-vl-1.6-q4_k.gguf | 1.3 GB | 4-bit K-quant — smallest |

| paddleocr-vl-1.6-q8_0.gguf | 1.4 GB | 8-bit quantization — recommended |

| paddleocr-vl-1.6-f16.gguf | 2.3 GB | fp16 reference |

Model

Architecture: NaViT-style ViT (27L, 1152d) + ERNIE-4.5-0.3B LLM (18L, 1024d, 16/2 GQA, MRoPE, SwiGLU)
Parameters: ~0.9B (same architecture as PaddleOCR-VL-0.9B, improved training)
OmniDocBench: 96.3% (SOTA)
Languages: 109+ (multilingual)
License: Apache 2.0

Usage

./crispembed -m paddleocr-vl-1.6-q8_0.gguf --ocr document.png

License

Apache 2.0

Question 2

What license applies to cstr/paddleocr-vl-1.6-GGUF?

Accepted Answer

License: apache-2.0. Verify terms on Hugging Face before commercial use.

Question 3

How do I run cstr/paddleocr-vl-1.6-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Question 4

How much VRAM or disk space does cstr/paddleocr-vl-1.6-GGUF need?

Accepted Answer

Runs locally from ~1.21 GB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

cstr/paddleocr-vl-1.6-GGUF overview

Repository Files & Downloads

Model Details

Model README

PaddleOCR-VL-1.6 — CrispEmbed GGUF

Files

Model

Usage

License

Run cstr/paddleocr-vl-1.6-GGUF with guIDE

File	Type	Quantization	Size	Link
paddleocr-vl-1.6-f16.gguf	GGUF	F16	2.23 GB	Download
paddleocr-vl-1.6-q4_k.gguf	GGUF	Q4_K	1.21 GB	Download
paddleocr-vl-1.6-q8_0.gguf	GGUF	Q8_0	1.38 GB	Download

Model ID	cstr/paddleocr-vl-1.6-GGUF
Author	cstr
Pipeline	—
License	apache-2.0
Base model	PaddlePaddle/PaddleOCR-VL-1.6
Last modified	2026-06-19T09:36:24.000Z