Question 1

What is cstr/smoldocling-GGUF?

Accepted Answer

--- license: apache-2.0 base_model: ds4sd/SmolDocling-256M-preview tags: - ocr - document-understanding - doctags - vision-language - gguf - crispembed - ggml language: - en --- # SmolDocling-256M GGUF GGUF conversions of [ds4sd/SmolDocling-256M-preview](https://huggingface.co/ds4sd/SmolDocling-256M-preview) for [CrispEmbed](https://github.com/CrispStrobe/CrispEmbed) inference. Ultra-compact document conversion model (256M params). Generates DocTags structured markup from page images — OCR, layout, tables, formulas, code, charts. ## Model variants | File | Quant | Size | Notes | |------|-------|------|-------| | `smoldocling-f16.gguf` | F16 | 491 MB | Full precision | | `smoldocling-q8_0.gguf` | Q8_0 | 261 MB | Recommended | | `smoldocling-q4_k.gguf` | Q4_K | 153 MB | Max compression | ## Architecture - **Vision**: SigLIP ViT (12L, 768d, 12 heads, patch=16, 512px) - **Connector**: Pixel…

Question 2

What license applies to cstr/smoldocling-GGUF?

Accepted Answer

License: apache-2.0. Verify terms on Hugging Face before commercial use.

Question 3

How do I run cstr/smoldocling-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Question 4

How much VRAM or disk space does cstr/smoldocling-GGUF need?

Accepted Answer

Runs locally from ~154.2 MB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

cstr/smoldocling-GGUF overview

Repository Files & Downloads

Model Details

Model README

SmolDocling-256M GGUF

Model variants

Architecture

Usage

License

Credits

Run cstr/smoldocling-GGUF with guIDE

File	Type	Quantization	Size	Link
smoldocling-f16.gguf	GGUF	F16	490.9 MB	Download
smoldocling-q4_k.gguf	GGUF	Q4_K	154.2 MB	Download
smoldocling-q8_0.gguf	GGUF	Q8_0	262.3 MB	Download

Model ID	cstr/smoldocling-GGUF
Author	cstr
Pipeline	—
License	apache-2.0
Base model	ds4sd/SmolDocling-256M-preview
Last modified	2026-06-19T13:22:04.000Z