GraySoft
Projects Models Compare Cloud benchmarks FAQ Download guIDE →
Model Intelligence Sheet

cstr/text-super-resolution-gguf overview

Text Super Resolution & Restoration GGUF Models Lightweight super resolution and image restoration models converted to GGUF for CrispEmbed https://github.com/C…

crispembedggufsuper-resolutionimage-restorationocrtext-enhancementlicense:apache-2.0region:us

Runs locally from ~0.1 MB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

Downloads
327
Likes
0
Pipeline
Author

Repository Files & Downloads

21 GGUF files detected
Direct downloads for local inference
FileTypeQuantizationSizeLink
adair-ref.ggufGGUFGGUF0.1 MBDownload
dat-light-x2-f16.ggufGGUFF1637.7 MBDownload
dat-light-x2-f32.ggufGGUFF3238.8 MBDownload
dat-ref.ggufGGUFGGUF18.0 MBDownload
hat-ref.ggufGGUFGGUF5.8 MBDownload
hat-sr-x4-f16.ggufGGUFF1640.3 MBDownload
hat-sr-x4-q4_k.ggufGGUFQ4_K39.4 MBDownload
hat-sr-x4-q8_0.ggufGGUFQ8_039.7 MBDownload
pan-ref.ggufGGUFGGUF0.2 MBDownload
pan-x4-f16.ggufGGUFF160.5 MBDownload
pan-x4-q4_k.ggufGGUFQ4_K0.5 MBDownload
pan-x4-q8_0.ggufGGUFQ8_00.5 MBDownload
restormer-denoise-f16.ggufGGUFF1649.9 MBDownload
restormer-denoise-q4_k.ggufGGUFQ4_K28.0 MBDownload
restormer-denoise-q8_0.ggufGGUFQ8_035.6 MBDownload
swinir-light-x4-f16.ggufGGUFF1614.2 MBDownload
swinir-ref.ggufGGUFGGUF5.5 MBDownload
tbsrn-ref.ggufGGUFGGUF4.1 MBDownload
tbsrn-telescope-f16.ggufGGUFF162.2 MBDownload
tbsrn-telescope-q4_k.ggufGGUFQ4_K0.7 MBDownload
tbsrn-telescope-q8_0.ggufGGUFQ8_01.2 MBDownload

Model Details

Model IDcstr/text-super-resolution-gguf
Authorcstr
Pipeline
Licenseapache-2.0
Base model
Last modified2026-06-21T15:41:35.000Z

Model README

---

license: apache-2.0

tags:

- super-resolution

- image-restoration

- ocr

- text-enhancement

- gguf

- crispembed

library_name: crispembed

---

Text Super-Resolution & Restoration GGUF Models

Lightweight super-resolution and image restoration models converted to GGUF for CrispEmbed OCR preprocessing.

Models

| File | Architecture | Params | Scale | Size | License | Paper |

|------|-------------|--------|-------|------|---------|-------|

| tbsrn-telescope-f16.gguf | TBSRN (text-line SR) | 1.13M | 2x | 2.2 MB | Apache-2.0 | CVPR 2021 |

| pan-x4-f16.gguf | PAN (pixel attention) | 272K | 4x | 0.5 MB | Apache-2.0 | ECCV 2020W |

| hat-sr-x4-f16.gguf | HAT (hybrid attention transformer) | 21M | 4x | 40 MB | MIT | CVPR 2023 |

| dat-light-x2-f16.gguf | DAT-light (dual aggregation transformer) | 830K | 2x | 38 MB | Apache-2.0 | ICCV 2023 |

| restormer-denoise-f16.gguf | Restormer (denoising) | 26M | 1x | 50 MB | Apache-2.0 | CVPR 2022 |

TBSRN Telescope (text-line SR)

  • Task: Enhance individual detected text lines before recognition
  • Input: Text-line crop resized to 16x64 -> Output: 32x128 (2x)
  • Source: PaddleOCR sr_telescope (Apache-2.0)

PAN (whole-image 4x SR)

  • Task: Upscale full document pages (rescues 75dpi text)
  • Input: Any RGB image (tiled) -> Output: 4x upscale
  • Source: PaddleGAN pan_x4 (Apache-2.0)

HAT (hybrid attention transformer, 4x SR)

  • Task: High-quality 4x upscaling (CVPR 2023 SOTA on multiple SR benchmarks)
  • Input: Any RGB image (tiled) -> Output: 4x upscale
  • Architecture: Swin Transformer + overlapping cross-attention + channel attention
  • Source: XPixelGroup/HAT (MIT)

DAT-light (dual aggregation transformer, 2x SR)

  • Task: High-quality 2x upscaling with dual spatial+channel attention
  • Input: Any RGB image (tiled) -> Output: 2x upscale
  • Architecture: Split-channel windowed spatial attention + L2-normalized transposed channel attention + AIM + SGFN
  • Source: zhengchen1999/DAT (Apache-2.0)

Restormer (image denoising/restoration)

  • Task: Remove noise from document scans
  • Input: Any RGB image -> Output: Denoised (same size)
  • Architecture: Multi-Dconv head transposed attention, U-Net encoder-decoder
  • Source: swz30/Restormer (Apache-2.0)

Parity Verification

All models pass the CrispEmbed diff harness (Python reference vs C++ engine):

| Model | cos_sim | Status |

|-------|---------|--------|

| TBSRN | 0.999985 | PASS |

| PAN | 0.999654 | PASS |

| HAT | 0.999990 | PASS |

| DAT-light | 0.999956 | PASS |

| Restormer | 1.000000 | PASS |

Usage with CrispEmbed

from crispembed import CrispPanSr, CrispDatSr

# PAN: 4x upscale
sr = CrispPanSr("pan-x4-f16.gguf")
out, ow, oh = sr.process(pixels, width, height)

# DAT: 2x upscale (higher quality)
sr = CrispDatSr("dat-light-x2-f16.gguf")
out, ow, oh = sr.process(pixels, width, height)
# CLI
crispembed --pan-model pan-x4-f16.gguf --pan-sr input.png > output.ppm
crispembed --dat-model dat-light-x2-f16.gguf --dat-sr input.png > output.ppm

License

Apache-2.0 for all models except HAT (MIT). Both licenses are permissive.

Run cstr/text-super-resolution-gguf with guIDE

Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.

Download guIDE → · Browse 524k+ models · Compare models

Source: Hugging Face · Compare models