GraySoft
Model Comparison

ccssne/gemma-4-31b-it-heretic-ara-ggufvsggml-org/embeddinggemma-300m-qat-q8_0-gguf

Side-by-side comparison of ccssne/gemma-4-31b-it-heretic-ara-gguf and ggml-org/embeddinggemma-300m-qat-q8_0-gguf: downloads, license, context length, tasks, and benchmarks.

ccssne/gemma-4-31b-it-heretic-ara-gguf

CCSSNE · —

# Gemma-4-31B-It-Heretic-GGUF Quantized using Unsloth on RTX 4070 Laptop . This version includes the vision adapter (mmproj) and is optimized for local inference.

ggml-org/embeddinggemma-300m-qat-q8_0-gguf

ggml-org · feature-extraction

# embeddinggemma-300m-qat-q8_0 GGUF Recommended way to run this model: ``sh llama-server -hf ggml-org/embeddinggemma-300m-qat-q8_0-GGUF --embeddings ` Then the endpoint can be accessed at http://localhost:8080/embedding, for example using curl: `console curl --request POST \ --u…

Side-by-side Specifications

ccssne/gemma-4-31b-it-heretic-ara-ggufggml-org/embeddinggemma-300m-qat-q8_0-gguf
AuthorCCSSNEggml-org
Pipeline Taskfeature-extraction
Libraryggufsentence-transformers
Downloads48,71447,806
Likes415
LicenseUnknownUnknown
Context Length
Created2026-04-032025-09-04
Last Modified2026-04-032025-09-15
Tags
ggufunslothgemma-4quantizedbase_model:trohrbaugh/gemma-4-31b-it-heretic-arabase_model:quantized:trohrbaugh/gemma-4-31b-it-heretic-araendpoints_compatibleregion:usconversational
sentence-transformersggufsentence-similarityfeature-extractionbase_model:google/embeddinggemma-300m-qat-q8_0-unquantizedbase_model:quantized:google/embeddinggemma-300m-qat-q8_0-unquantizedlicense:gemmaendpoints_compatibleregion:us

View full details: ccssne/gemma-4-31b-it-heretic-ara-gguf · ggml-org/embeddinggemma-300m-qat-q8_0-gguf