Model Comparison

ggml-org/embeddinggemma-300m-qat-q8_0-ggufvsmaziyarpanahi/wizardlm-2-7b-gguf

Side-by-side comparison of ggml-org/embeddinggemma-300m-qat-q8_0-gguf and maziyarpanahi/wizardlm-2-7b-gguf: downloads, license, context length, tasks, and benchmarks.

ggml-org/embeddinggemma-300m-qat-q8_0-gguf

ggml-org · feature-extraction

# embeddinggemma-300m-qat-q8_0 GGUF Recommended way to run this model: ``sh llama-server -hf ggml-org/embeddinggemma-300m-qat-q8_0-GGUF --embeddings ` Then the endpoint can be accessed at http://localhost:8080/embedding, for example using curl: `console curl --request POST \ --u…

maziyarpanahi/wizardlm-2-7b-gguf

MaziyarPanahi · text-generation

pip install llama-cpp-python # With NVidia CUDA acceleration CMAKE_ARGS="-DLLAMA_CUBLAS=on" pip install llama-cpp-python # Or with OpenBLAS acceleration CMAKE_ARGS="-DLLAMA_BLAS=ON -DLLAMA_BLAS_VENDOR=OpenBLAS" pip install llama-cpp-python # Or with CLBLast acceleration CMAKE_AR…

Side-by-side Specifications

	ggml-org/embeddinggemma-300m-qat-q8_0-gguf	maziyarpanahi/wizardlm-2-7b-gguf
Author	ggml-org	MaziyarPanahi
Pipeline Task	feature-extraction	text-generation
Library	sentence-transformers	transformers
Downloads	47,806	86,217
Likes	15	83
License	Unknown	Unknown
Context Length	—	—
Created	2025-09-04	2024-04-15
Last Modified	2025-09-15	2024-04-15
Tags	sentence-transformersggufsentence-similarityfeature-extractionbase_model:google/embeddinggemma-300m-qat-q8_0-unquantizedbase_model:quantized:google/embeddinggemma-300m-qat-q8_0-unquantizedlicense:gemmaendpoints_compatibleregion:us	transformersggufmistralquantized2-bit3-bit4-bit5-bit6-bit8-bit

View full details: ggml-org/embeddinggemma-300m-qat-q8_0-gguf · maziyarpanahi/wizardlm-2-7b-gguf