What is mradermacher/G4-Alice-v1.2-31B-GGUF?

--- base_model: AliceThirty/G4-Alice-v1.2-31B datasets: - zerofata/Instruct-Anime - zerofata/Gemini-3.1-Pro-GLM5-Characters - zerofata/Gemini-3.1-Pro-SmallWiki language: - en library_name: transformers license: apache-2.0 mradermacher: readme_rev: 1 quantized_by: mradermacher --- ## About static quants of https://huggingface.co/AliceThirty/G4-Alice-v1.2-31B ***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#G4-Alice-v1.2-31B-GGUF).*** weighted/imatrix quants are available at https://huggingface.co/mradermacher/G4-…

What license applies to mradermacher/G4-Alice-v1.2-31B-GGUF?

License: apache-2.0. Verify terms on Hugging Face before commercial use.

How do I run mradermacher/G4-Alice-v1.2-31B-GGUF locally?

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Model Intelligence Sheet

mradermacher/G4-Alice-v1.2-31B-GGUF overview

Q: How much VRAM or disk space does mradermacher/G4-Alice-v1.2-31B-GGUF need?

Runs locally from ~772.0 MB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

About < quantize version: 2 < output tensor quantised: 1 < convert type: hf < vocab type: < tags: < quants: x f16 Q4 K S Q2 K Q8 0 Q6 K Q3 K M Q3 K S Q3 K L Q4…

transformersggufendataset:zerofata/Instruct-Animedataset:zerofata/Gemini-3.1-Pro-GLM5-Charactersdataset:zerofata/Gemini-3.1-Pro-SmallWikibase_model:AliceThirty/G4-Alice-v1.2-31Bbase_model:quantized:AliceThirty/G4-Alice-v1.2-31Blicense:apache-2.0endpoints_compatibleregion:usconversational

Runs locally from ~772.0 MB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

Downloads

Likes

Pipeline

—

Author

mradermacher

Repository Files & Downloads

13 GGUF files detected

Direct downloads for local inference

File	Type	Quantization	Size	Link
G4-Alice-v1.2-31B.IQ4_XS.gguf	GGUF	GGUF	15.70 GB	Download
G4-Alice-v1.2-31B.Q2_K.gguf	GGUF	GGUF	11.10 GB	Download
G4-Alice-v1.2-31B.Q3_K_L.gguf	GGUF	GGUF	15.49 GB	Download
G4-Alice-v1.2-31B.Q3_K_M.gguf	GGUF	GGUF	14.24 GB	Download
G4-Alice-v1.2-31B.Q3_K_S.gguf	GGUF	GGUF	12.82 GB	Download
G4-Alice-v1.2-31B.Q4_K_M.gguf	GGUF	GGUF	17.40 GB	Download
G4-Alice-v1.2-31B.Q4_K_S.gguf	GGUF	GGUF	16.54 GB	Download
G4-Alice-v1.2-31B.Q5_K_M.gguf	GGUF	GGUF	20.35 GB	Download
G4-Alice-v1.2-31B.Q5_K_S.gguf	GGUF	GGUF	19.85 GB	Download
G4-Alice-v1.2-31B.Q6_K.gguf	GGUF	GGUF	23.47 GB	Download
G4-Alice-v1.2-31B.Q8_0.gguf	GGUF	GGUF	30.39 GB	Download
G4-Alice-v1.2-31B.mmproj-Q8_0.gguf	GGUF	Q8_0	772.0 MB	Download
G4-Alice-v1.2-31B.mmproj-f16.gguf	GGUF	F16	1.12 GB	Download

Model Details

Model ID	mradermacher/G4-Alice-v1.2-31B-GGUF
Author	mradermacher
Pipeline	—
License	apache-2.0
Base model	AliceThirty/G4-Alice-v1.2-31B
Last modified	2026-06-21T07:38:09.000Z

Model README

---

base_model: AliceThirty/G4-Alice-v1.2-31B

datasets:

zerofata/Instruct-Anime
zerofata/Gemini-3.1-Pro-GLM5-Characters
zerofata/Gemini-3.1-Pro-SmallWiki

language:

library_name: transformers

license: apache-2.0

mradermacher:

readme_rev: 1

quantized_by: mradermacher

---

About

static quants of https://huggingface.co/AliceThirty/G4-Alice-v1.2-31B

For a convenient overview and download list, visit our model page for this model.

weighted/imatrix quants are available at https://huggingface.co/mradermacher/G4-Alice-v1.2-31B-i1-GGUF

Usage

If you are unsure how to use GGUF files, refer to one of [TheBloke's

READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for

more details, including on how to concatenate multi-part files.

Provided Quants

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

|:-----|:-----|--------:|:------|

| GGUF | Q2_K | 12.0 | |

| GGUF | Q3_K_S | 13.9 | |

| GGUF | Q3_K_M | 15.4 | lower quality |

| GGUF | Q3_K_L | 16.7 | |

| GGUF | Q4_K_S | 17.9 | fast, recommended |

| GGUF | Q4_K_M | 18.8 | fast, recommended |

| GGUF | Q5_K_S | 21.4 | |

| GGUF | Q5_K_M | 21.9 | |

| GGUF | Q6_K | 25.3 | very good quality |

| GGUF | Q8_0 | 32.7 | fast, best quality |

Here is a handy graph by ikawrakow comparing some lower-quality quant

types (lower is better):

!image.png

And here are Artefact2's thoughts on the matter:

https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9

FAQ / Model Request

See https://huggingface.co/mradermacher/model_requests for some answers to

questions you might have and/or if you want some other model quantized.

Thanks

I thank my company, nethype GmbH, for letting

me use its servers and providing upgrades to my workstation to enable

this work in my free time.

Run mradermacher/G4-Alice-v1.2-31B-GGUF with guIDE

Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.

Download guIDE → · Browse 524k+ models · Compare models

Source: Hugging Face · Compare models