GraySoft
Model Comparison

davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-ggufvsqwen/qwen3-vl-8b-instruct-gguf

Side-by-side comparison of davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-gguf and qwen/qwen3-vl-8b-instruct-gguf: downloads, license, context length, tasks, and benchmarks.

davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-gguf

DavidAU · text-generation

GLM-4.7-Flash-Uncensored-Heretic-NEO-CODE-Imatrix-MAX-GGUF Specialized and Enhanced UNCENSORED/HERETIC GGUF quants for the new GLM-4.7-Flash, 30B-A3B MOE, mixture of experts model. [ https://huggingface.co/zai-org/GLM-4.7-Flash ] This model can be run on the GPU(s) and/or CPU du…

qwen/qwen3-vl-8b-instruct-gguf

Qwen · image-text-to-text

This repository provides GGUF-format weights for Qwen3-VL-8B-Instruct, split into two components: These files are compatible with llama.cpp, Ollama, and other GGUF-based tools, supporting inference on CPU, NVIDIA GPU (CUDA), Apple Silicon (Metal), Intel GPUs (SYCL), and more. Yo…

Side-by-side Specifications

davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-ggufqwen/qwen3-vl-8b-instruct-gguf
AuthorDavidAUQwen
Pipeline Tasktext-generationimage-text-to-text
Librarytransformers
Downloads53,41234,761
Likes33482
LicenseUnknownUnknown
Context Length
Created2026-01-222025-10-31
Last Modified2026-01-282025-11-01
Tags
ggufGLM 4.7 FlashthinkingreasoningNEO ImatrixMAX Quants16 bit precision output tensorhereticuncensoredabliterated
transformersggufimage-text-to-textarxiv:2505.09388arxiv:2502.13923arxiv:2409.12191arxiv:2308.12966base_model:Qwen/Qwen3-VL-8B-Instructbase_model:quantized:Qwen/Qwen3-VL-8B-Instructlicense:apache-2.0

View full details: davidau/glm-4.7-flash-uncensored-heretic-neo-code-imatrix-max-gguf · qwen/qwen3-vl-8b-instruct-gguf