Question 1

What is forkjoin-ai/gemma3-4b-it-gguf?

Accepted Answer

--- language: - en license: gemma library_name: gguf tags: - gguf - gemma3 - affectively - edgework - aether - distributed-inference - edge-deployment base_model: google/gemma-3-4b-it base_model_relation: quantized pipeline_tag: text-generation --- # Gemma3 4b IT (GGUF, Q4_K_M) > **Production-ready** GGUF quantization of [google/gemma-3-4b-it](https://huggingface.co/google/gemma-3-4b-it) for distributed text generation and conversation — powered by the [Aether](https://github.com/forkjoin-ai/aether) edge inference runtime on [Edgework.ai](https://edgework.ai). ## Model Details | Property | Value | |----------|-------| | Base model | [google/gemma-3-4b-it](https://huggingface.co/google/gemma-3-4b-it) | | Parameters | 4B | | Architecture | Gemma3 | | Quantization | Q4_K_M | | Format | GGUF | | Size | ~2.8 GB | | License | gemma | ## Usage ### With llama.cpp ```bash ./llama-cli -m google_g…

Question 2

What license applies to forkjoin-ai/gemma3-4b-it-gguf?

Accepted Answer

License: gemma. Verify terms on Hugging Face before commercial use.

Question 3

How do I run forkjoin-ai/gemma3-4b-it-gguf locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Question 4

How much VRAM or disk space does forkjoin-ai/gemma3-4b-it-gguf need?

Accepted Answer

Runs locally from ~2.32 GB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

forkjoin-ai/gemma3-4b-it-gguf overview

Repository Files & Downloads

Model Details

Model README

Gemma3 4b IT (GGUF, Q4_K_M)

Model Details

Usage

With llama.cpp

With Aether (Distributed Inference)

Also available: `.knot` (sovereign format)

Deployment Architecture

About

Run forkjoin-ai/gemma3-4b-it-gguf with guIDE

Model ID	forkjoin-ai/gemma3-4b-it-gguf
Author	forkjoin-ai
Pipeline	text-generation
License	gemma
Base model	google/gemma-3-4b-it
Last modified	2026-06-08T21:16:04.000Z

forkjoin-ai/gemma3-4b-it-gguf overview

Repository Files & Downloads

Model Details

Model README

Gemma3 4b IT (GGUF, Q4_K_M)

Model Details

Usage

With llama.cpp

With Aether (Distributed Inference)

Also available: .knot (sovereign format)

Deployment Architecture

About

Run forkjoin-ai/gemma3-4b-it-gguf with guIDE

Also available: `.knot` (sovereign format)