Model Comparison

ggml-org/embeddinggemma-300m-qat-q8_0-ggufvsluffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf

Side-by-side comparison of ggml-org/embeddinggemma-300m-qat-q8_0-gguf and luffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf: downloads, license, context length, tasks, and benchmarks.

ggml-org/embeddinggemma-300m-qat-q8_0-gguf

ggml-org · feature-extraction

# embeddinggemma-300m-qat-q8_0 GGUF Recommended way to run this model: ``sh llama-server -hf ggml-org/embeddinggemma-300m-qat-q8_0-GGUF --embeddings ` Then the endpoint can be accessed at http://localhost:8080/embedding, for example using curl: `console curl --request POST \ --u…

luffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf

LuffyTheFox · text-generation

Thinking is disabled by default in this model via modified chat template file baked in gguf. If you want to enable thinking set variable: {%- set enable_thinking = False %} to True in chat template. I extracted uncensored tensors made by HauhauCS via this script: https://pastebi…

Side-by-side Specifications

	ggml-org/embeddinggemma-300m-qat-q8_0-gguf	luffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf
Author	ggml-org	LuffyTheFox
Pipeline Task	feature-extraction	text-generation
Library	sentence-transformers	—
Downloads	47,806	79,621
Likes	15	100
License	Unknown	Unknown
Context Length	—	—
Created	2025-09-04	2026-03-15
Last Modified	2025-09-15	2026-03-18
Tags	sentence-transformersggufsentence-similarityfeature-extractionbase_model:google/embeddinggemma-300m-qat-q8_0-unquantizedbase_model:quantized:google/embeddinggemma-300m-qat-q8_0-unquantizedlicense:gemmaendpoints_compatibleregion:us	ggufqwen3_5unslothqwenqwen3.5reasoningchain-of-thoughtlorauncensorednot-for-all-audiences

View full details: ggml-org/embeddinggemma-300m-qat-q8_0-gguf · luffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf