Question 1

What is bandtor/gemma-4-E4B-it-GGUF?

Accepted Answer

--- base_model: google/gemma-4-E4B-it license: gemma tags: - gguf - ollama - gemma4 - q4_k_m - moe - llama-cpp - multimodal - image-text-to-text language: - en - pt - multilingual library_name: gguf pipeline_tag: image-text-to-text --- # Gemma 4 E4B Instruct — GGUF Q4_K_M Quantização **Q4_K_M** do modelo [google/gemma-4-E4B-it](https://huggingface.co/google/gemma-4-E4B-it), obtida de [unsloth/gemma-4-E4B-it-GGUF](https://huggingface.co/unsloth/gemma-4-E4B-it-GGUF). | Arquivo | Tamanho | Tipo | |---|---|---| | `gemma-4-E4B-it-Q4_K_M.gguf` | ~4.6 GB | Modelo principal (MoE sparse) | | `Modelfile` | — | Template Ollama pronto para uso | ## Especificações | Campo | Valor | |---|---| | **Parâmetros (total)** | 7.5B (MoE sparse) | | **Parâmetros (ativos)** | ~4.5B por token | | **Arquitetura** | `gemma4` MoE | | **Contexto máximo** | 128K tokens (131 072) | | **Quantização** | Q4_K_M | | **Ta…

Question 2

What license applies to bandtor/gemma-4-E4B-it-GGUF?

Accepted Answer

License: gemma. Verify terms on Hugging Face before commercial use.

Question 3

How do I run bandtor/gemma-4-E4B-it-GGUF locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: image-text-to-text.

Question 4

How much VRAM or disk space does bandtor/gemma-4-E4B-it-GGUF need?

Accepted Answer

Runs locally from ~4.64 GB disk (8 GB VRAM class GPUs with llama.cpp / guIDE).

bandtor/gemma-4-E4B-it-GGUF overview

Repository Files & Downloads

Model Details

Model README

Gemma 4 E4B Instruct — GGUF Q4_K_M

Especificações

Uso com Ollama

Pré-requisito: KV cache Q4 para contexto 128K

Uso com llama.cpp

Referências

Run bandtor/gemma-4-E4B-it-GGUF with guIDE

Model ID	bandtor/gemma-4-E4B-it-GGUF
Author	bandtor
Pipeline	image-text-to-text
License	gemma
Base model	google/gemma-4-E4B-it
Last modified	2026-06-08T03:06:37.000Z