cstr/paddleocr-vl-1.6-GGUF overview
PaddleOCR VL 1.6 — CrispEmbed GGUF CrispEmbed native GGUF quantizations of PaddlePaddle/PaddleOCR VL 1.6 https://huggingface.co/PaddlePaddle/PaddleOCR VL 1.6 .…
Runs locally from ~1.21 GB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).
Repository Files & Downloads
Model Details
Model README
---
base_model: PaddlePaddle/PaddleOCR-VL-1.6
language:
- multilingual
license: apache-2.0
tags:
- gguf
- ocr
- document-understanding
- crispembed
- paddleocr
---
PaddleOCR-VL-1.6 — CrispEmbed GGUF
CrispEmbed-native GGUF quantizations of PaddlePaddle/PaddleOCR-VL-1.6.
Latest PaddleOCR-VL model with improved accuracy on OmniDocBench (96.3% SOTA). End-to-end VLM-based OCR: text recognition, table extraction, formula recognition, chart understanding. 109+ languages.
Files
| File | Size | Description |
|------|------|-------------|
| paddleocr-vl-1.6-q4_k.gguf | 1.3 GB | 4-bit K-quant — smallest |
| paddleocr-vl-1.6-q8_0.gguf | 1.4 GB | 8-bit quantization — recommended |
| paddleocr-vl-1.6-f16.gguf | 2.3 GB | fp16 reference |
Model
- Architecture: NaViT-style ViT (27L, 1152d) + ERNIE-4.5-0.3B LLM (18L, 1024d, 16/2 GQA, MRoPE, SwiGLU)
- Parameters: ~0.9B (same architecture as PaddleOCR-VL-0.9B, improved training)
- OmniDocBench: 96.3% (SOTA)
- Languages: 109+ (multilingual)
- License: Apache 2.0
Usage
./crispembed -m paddleocr-vl-1.6-q8_0.gguf --ocr document.png
License
Apache 2.0
Run cstr/paddleocr-vl-1.6-GGUF with guIDE
Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.
Source: Hugging Face · Compare models