Model Comparison

ccssne/gemma-4-31b-it-heretic-ara-ggufvsggml-org/embeddinggemma-300m-qat-q8_0-gguf

Side-by-side comparison of ccssne/gemma-4-31b-it-heretic-ara-gguf and ggml-org/embeddinggemma-300m-qat-q8_0-gguf: downloads, license, context length, tasks, and benchmarks.

ccssne/gemma-4-31b-it-heretic-ara-gguf

CCSSNE · —

# Gemma-4-31B-It-Heretic-GGUF Quantized using Unsloth on RTX 4070 Laptop . This version includes the vision adapter (mmproj) and is optimized for local inference.

ggml-org/embeddinggemma-300m-qat-q8_0-gguf

ggml-org · feature-extraction

# embeddinggemma-300m-qat-q8_0 GGUF Recommended way to run this model: ``sh llama-server -hf ggml-org/embeddinggemma-300m-qat-q8_0-GGUF --embeddings ` Then the endpoint can be accessed at http://localhost:8080/embedding, for example using curl: `console curl --request POST \ --u…

Side-by-side Specifications

	ccssne/gemma-4-31b-it-heretic-ara-gguf	ggml-org/embeddinggemma-300m-qat-q8_0-gguf
Author	CCSSNE	ggml-org
Pipeline Task	—	feature-extraction
Library	gguf	sentence-transformers
Downloads	48,714	47,806
Likes	4	15
License	Unknown	Unknown
Context Length	—	—
Created	2026-04-03	2025-09-04
Last Modified	2026-04-03	2025-09-15
Tags	ggufunslothgemma-4quantizedbase_model:trohrbaugh/gemma-4-31b-it-heretic-arabase_model:quantized:trohrbaugh/gemma-4-31b-it-heretic-araendpoints_compatibleregion:usconversational	sentence-transformersggufsentence-similarityfeature-extractionbase_model:google/embeddinggemma-300m-qat-q8_0-unquantizedbase_model:quantized:google/embeddinggemma-300m-qat-q8_0-unquantizedlicense:gemmaendpoints_compatibleregion:us

View full details: ccssne/gemma-4-31b-it-heretic-ara-gguf · ggml-org/embeddinggemma-300m-qat-q8_0-gguf