GraySoft
Projects Models Compare Cloud benchmarks FAQ Download guIDE β†’
Model Intelligence Sheet

deadbydawn101/ravenx-Gemma4-12B-MTP-OBLITERATED-OpenMAI-OpenMythos-deep-reasoning-GGUF overview

πŸ₯‡ WORLD FIRST: RavenX Γ— Gemma 4 12B MTP OBLITERATED β€” Deep Reasoning <p align="center" <b The first trained Gemma 4 12B on the planet.</b <br <b Proprietary t…

ggufmlxgemma4deep-reasoningapple-siliconravenxMTPabliteratedOBLITERATEDcommunity-projecttext-generationdataset:deadbydawn101/ravenx-gemma4-deep-reasoning-dataarxiv:2606.01444base_model:OBLITERATUS/Gemma-4-12B-OBLITERATEDbase_model:quantized:OBLITERATUS/Gemma-4-12B-OBLITERATEDlicense:otherendpoints_compatibleregion:usconversational

Runs locally from ~22.20 GB disk (24 GB VRAM class GPUs with llama.cpp / guIDE).

Downloads
0
Likes
0
Pipeline
text-generation

Repository Files & Downloads

1 GGUF files detected
Direct downloads for local inference
FileTypeQuantizationSizeLink
ravenx-Gemma4-12B-MTP-F16.ggufGGUFF1622.20 GBDownload

Model Details

Model IDdeadbydawn101/ravenx-Gemma4-12B-MTP-OBLITERATED-OpenMAI-OpenMythos-deep-reasoning-GGUF
Authordeadbydawn101
Pipelinetext-generation
Licenseother
Base modelOBLITERATUS/Gemma-4-12B-OBLITERATED
Last modified2026-06-11T00:28:50.000Z

Model README

---

license: other

license_name: gemma

license_link: https://ai.google.dev/gemma/terms

library_name: gguf

tags:

- mlx

- gemma4

- deep-reasoning

- apple-silicon

- ravenx

- MTP

- abliterated

- OBLITERATED

- community-project

base_model: OBLITERATUS/Gemma-4-12B-OBLITERATED

datasets:

- deadbydawn101/ravenx-gemma4-deep-reasoning-data

pipeline_tag: text-generation

---

πŸ₯‡ WORLD FIRST: RavenX Γ— Gemma 4 12B MTP OBLITERATED β€” Deep Reasoning

<p align="center">

<b>The first trained Gemma 4 12B on the planet.</b><br>

<b>Proprietary training methodology. MTP-ready architecture. $0 cloud cost.</b><br>

<b>GGUF F16 format β€” runs on ANY hardware (Ollama, LM Studio, llama.cpp, vLLM).</b>

</p>

> Gemma 4 was released June 3, 2026 β€” its gemma4_unified architecture wasn't supported by ANY training framework. We developed proprietary techniques to train it successfully.

---

⚠️ DISCLAIMER

This model is an experimental research proof of concept. Provided AS-IS for educational and research purposes only. The base model is abliterated (refusal filters removed). Use responsibly.

---

Community Project

This is a community project. We're combining methods from:

  • Google β€” Gemma 4 architecture, MTP heads, foundational model weights
  • OBLITERATUS β€” SOM-manifold two-pass abliteration of the base model
  • Microsoft β€” MAI hill-climbing methodology (open-sourced as OpenMAI)
  • MIT β€” Self-revising discovery systems, arXiv:2606.01444 (implemented as OpenSelfRevise)
  • Mirai Labs β€” RHT quantization and fused inference (open-sourced as OpenMirai)
  • RavenX β€” OpenMythos depth extrapolation, GRAM multi-trajectory scaling, and proprietary training pipeline

The training methodology used to produce this model is proprietary and patent pending.

---

Model Details

| Feature | Detail |

|---------|--------|

| Base | Gemma 4 12B (OBLITERATUS abliterated) |

| Architecture | gemma4_unified with MTP heads |

| Training | Proprietary methodology (patent pending) |

| Training Rounds | 9 progressive rounds |

| Training Data | 8,158 examples from 15 curated sources |

| Best Val Loss | 0.882 |

| Hardware | Apple M4 Max 128GB β€” $0 cloud cost |

| Format | GGUF F16 (universal β€” Ollama, LM Studio, llama.cpp) |

What Makes This Different

This model was trained using an experimental proprietary methodology that produces self-aware reasoning behavior through a novel approach to training data preparation and model fine-tuning.

Key results:

  • Emergent behaviors not present in training data (Anti-Problem technique, Toolbox generation)
  • Structured multi-pass reasoning across code, math, and analysis tasks
  • Self-honest assessment of capabilities and limitations

The specific training methodology is patent pending (USPTO Application #64/087,357, filed June 10, 2026) and is not disclosed in this model card.

Technical Discoveries (Open β€” Community Contributions)

The following technical discoveries made during training are shared with the community:

| Discovery | Detail |

|-----------|--------|

| Flip-train-flip | Temporarily change gemma4_unified β†’ gemma4 in config.json for LoRA training, then restore. Multimodal capabilities preserved. |

| Chat template required | Gemma 4 produces garbled output without apply_chat_template(). Not a bug β€” it's required. |

| Tokenizer patch for GGUF | extra_special_tokens must be converted from list to dict for GGUF conversion to work. One-line fix. |

| Val loss spikes are normal | When introducing new data formats, val loss spikes but recovers in 1-2 rounds. Don't panic. |

Usage

from mlx_lm import load, generate
from mlx_lm.sample_utils import make_sampler
import json

# Flip config for mlx-lm compatibility
config = json.load(open("config.json"))
config["model_type"] = "gemma4"
json.dump(config, open("config.json", "w"), indent=2)

model, tokenizer = load(".")
sampler = make_sampler(temp=0.7, top_p=0.9)

# MUST use chat template!
messages = [{"role": "user", "content": "Your question here"}]
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
response = generate(model, tokenizer, prompt=prompt, max_tokens=500, sampler=sampler, verbose=True)

# Restore config
config["model_type"] = "gemma4_unified"
json.dump(config, open("config.json", "w"), indent=2)

MLX Version

MLX version available here β€” optimized for Apple Silicon.

Part of the RavenX Ecosystem

| Project | Description |

|---------|-------------|

| OpenMAI | Microsoft MAI hill-climbing (open-sourced) |

| OpenSelfRevise | MIT self-revising discovery (implemented) |

| OpenMirai | Model-agnostic quantization + inference |

| OpenMythos-MLX | Recursive depth extrapolation |

| GRAM-MLX | Multi-trajectory width scaling |

| ravenx-memory | Hybrid triple-backend agent memory |

| star-platinum-cluster | Distributed training cluster |

| RavenX-CyberAgent | Security assessment model (745K+ examples) |

Contributors

Built by Gabriel Garcia / RavenX LLC + Claude (Anthropic)

Training methodology: Patent Pending β€” USPTO Application #64/087,357

License

Gemma License (model weights) β€” Training methodology proprietary

---

> "We don't give up. We do what others don't and build what isn't possible." β€” RavenX LLC

Run deadbydawn101/ravenx-Gemma4-12B-MTP-OBLITERATED-OpenMAI-OpenMythos-deep-reasoning-GGUF with guIDE

Download guIDE β€” the AI-native code editor with local LLM inference and 69 built-in tools.

Download guIDE β†’ Β· Browse 524k+ models Β· Compare models

Source: Hugging Face Β· Compare models