Model Comparison

davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-ggufvsqwen/qwen3-vl-8b-instruct-gguf

Side-by-side comparison of davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-gguf and qwen/qwen3-vl-8b-instruct-gguf: downloads, license, context length, tasks, and benchmarks.

davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-gguf

DavidAU · text-generation

GLM-4.7-Flash-Uncensored-Heretic-NEO-CODE-Imatrix-MAX-GGUF Specialized and Enhanced UNCENSORED/HERETIC GGUF quants for the new GLM-4.7-Flash, 30B-A3B MOE, mixture of experts model. [ https://huggingface.co/zai-org/GLM-4.7-Flash ] This model can be run on the GPU(s) and/or CPU du…

qwen/qwen3-vl-8b-instruct-gguf

Qwen · image-text-to-text

This repository provides GGUF-format weights for Qwen3-VL-8B-Instruct, split into two components: These files are compatible with llama.cpp, Ollama, and other GGUF-based tools, supporting inference on CPU, NVIDIA GPU (CUDA), Apple Silicon (Metal), Intel GPUs (SYCL), and more. Yo…

Side-by-side Specifications

	davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-gguf	qwen/qwen3-vl-8b-instruct-gguf
Author	DavidAU	Qwen
Pipeline Task	text-generation	image-text-to-text
Library	—	transformers
Downloads	53,412	34,761
Likes	334	82
License	Unknown	Unknown
Context Length	—	—
Created	2026-01-22	2025-10-31
Last Modified	2026-01-28	2025-11-01
Tags	ggufGLM 4.7 FlashthinkingreasoningNEO ImatrixMAX Quants16 bit precision output tensorhereticuncensoredabliterated	transformersggufimage-text-to-textarxiv:2505.09388arxiv:2502.13923arxiv:2409.12191arxiv:2308.12966base_model:Qwen/Qwen3-VL-8B-Instructbase_model:quantized:Qwen/Qwen3-VL-8B-Instructlicense:apache-2.0

View full details: davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-gguf · qwen/qwen3-vl-8b-instruct-gguf