deadbydawn101/ravenx-Gemma4-12B-MTP-OBLITERATED-OpenMAI-OpenMythos-deep-reasoning-GGUF overview
π₯ WORLD FIRST: RavenX Γ Gemma 4 12B MTP OBLITERATED β Deep Reasoning <p align="center" <b The first trained Gemma 4 12B on the planet.</b <br <b Proprietary tβ¦
Runs locally from ~22.20 GB disk (24 GB VRAM class GPUs with llama.cpp / guIDE).
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| ravenx-Gemma4-12B-MTP-F16.gguf | GGUF | F16 | 22.20 GB | Download |
Model Details
| Model ID | deadbydawn101/ravenx-Gemma4-12B-MTP-OBLITERATED-OpenMAI-OpenMythos-deep-reasoning-GGUF |
|---|---|
| Author | deadbydawn101 |
| Pipeline | text-generation |
| License | other |
| Base model | OBLITERATUS/Gemma-4-12B-OBLITERATED |
| Last modified | 2026-06-11T00:28:50.000Z |
Model README
---
license: other
license_name: gemma
license_link: https://ai.google.dev/gemma/terms
library_name: gguf
tags:
- mlx
- gemma4
- deep-reasoning
- apple-silicon
- ravenx
- MTP
- abliterated
- OBLITERATED
- community-project
base_model: OBLITERATUS/Gemma-4-12B-OBLITERATED
datasets:
- deadbydawn101/ravenx-gemma4-deep-reasoning-data
pipeline_tag: text-generation
---
π₯ WORLD FIRST: RavenX Γ Gemma 4 12B MTP OBLITERATED β Deep Reasoning
<p align="center">
<b>The first trained Gemma 4 12B on the planet.</b><br>
<b>Proprietary training methodology. MTP-ready architecture. $0 cloud cost.</b><br>
<b>GGUF F16 format β runs on ANY hardware (Ollama, LM Studio, llama.cpp, vLLM).</b>
</p>
> Gemma 4 was released June 3, 2026 β its gemma4_unified architecture wasn't supported by ANY training framework. We developed proprietary techniques to train it successfully.
---
β οΈ DISCLAIMER
This model is an experimental research proof of concept. Provided AS-IS for educational and research purposes only. The base model is abliterated (refusal filters removed). Use responsibly.
---
Community Project
This is a community project. We're combining methods from:
- Google β Gemma 4 architecture, MTP heads, foundational model weights
- OBLITERATUS β SOM-manifold two-pass abliteration of the base model
- Microsoft β MAI hill-climbing methodology (open-sourced as OpenMAI)
- MIT β Self-revising discovery systems, arXiv:2606.01444 (implemented as OpenSelfRevise)
- Mirai Labs β RHT quantization and fused inference (open-sourced as OpenMirai)
- RavenX β OpenMythos depth extrapolation, GRAM multi-trajectory scaling, and proprietary training pipeline
The training methodology used to produce this model is proprietary and patent pending.
---
Model Details
| Feature | Detail |
|---------|--------|
| Base | Gemma 4 12B (OBLITERATUS abliterated) |
| Architecture | gemma4_unified with MTP heads |
| Training | Proprietary methodology (patent pending) |
| Training Rounds | 9 progressive rounds |
| Training Data | 8,158 examples from 15 curated sources |
| Best Val Loss | 0.882 |
| Hardware | Apple M4 Max 128GB β $0 cloud cost |
| Format | GGUF F16 (universal β Ollama, LM Studio, llama.cpp) |
What Makes This Different
This model was trained using an experimental proprietary methodology that produces self-aware reasoning behavior through a novel approach to training data preparation and model fine-tuning.
Key results:
- Emergent behaviors not present in training data (Anti-Problem technique, Toolbox generation)
- Structured multi-pass reasoning across code, math, and analysis tasks
- Self-honest assessment of capabilities and limitations
The specific training methodology is patent pending (USPTO Application #64/087,357, filed June 10, 2026) and is not disclosed in this model card.
Technical Discoveries (Open β Community Contributions)
The following technical discoveries made during training are shared with the community:
| Discovery | Detail |
|-----------|--------|
| Flip-train-flip | Temporarily change gemma4_unified β gemma4 in config.json for LoRA training, then restore. Multimodal capabilities preserved. |
| Chat template required | Gemma 4 produces garbled output without apply_chat_template(). Not a bug β it's required. |
| Tokenizer patch for GGUF | extra_special_tokens must be converted from list to dict for GGUF conversion to work. One-line fix. |
| Val loss spikes are normal | When introducing new data formats, val loss spikes but recovers in 1-2 rounds. Don't panic. |
Usage
from mlx_lm import load, generate
from mlx_lm.sample_utils import make_sampler
import json
# Flip config for mlx-lm compatibility
config = json.load(open("config.json"))
config["model_type"] = "gemma4"
json.dump(config, open("config.json", "w"), indent=2)
model, tokenizer = load(".")
sampler = make_sampler(temp=0.7, top_p=0.9)
# MUST use chat template!
messages = [{"role": "user", "content": "Your question here"}]
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
response = generate(model, tokenizer, prompt=prompt, max_tokens=500, sampler=sampler, verbose=True)
# Restore config
config["model_type"] = "gemma4_unified"
json.dump(config, open("config.json", "w"), indent=2)
MLX Version
MLX version available here β optimized for Apple Silicon.
Part of the RavenX Ecosystem
| Project | Description |
|---------|-------------|
| OpenMAI | Microsoft MAI hill-climbing (open-sourced) |
| OpenSelfRevise | MIT self-revising discovery (implemented) |
| OpenMirai | Model-agnostic quantization + inference |
| OpenMythos-MLX | Recursive depth extrapolation |
| GRAM-MLX | Multi-trajectory width scaling |
| ravenx-memory | Hybrid triple-backend agent memory |
| star-platinum-cluster | Distributed training cluster |
| RavenX-CyberAgent | Security assessment model (745K+ examples) |
Contributors
Built by Gabriel Garcia / RavenX LLC + Claude (Anthropic)
Training methodology: Patent Pending β USPTO Application #64/087,357
License
Gemma License (model weights) β Training methodology proprietary
---
> "We don't give up. We do what others don't and build what isn't possible." β RavenX LLC
Run deadbydawn101/ravenx-Gemma4-12B-MTP-OBLITERATED-OpenMAI-OpenMythos-deep-reasoning-GGUF with guIDE
Download guIDE β the AI-native code editor with local LLM inference and 69 built-in tools.
Source: Hugging Face Β· Compare models