lhca521/minimax-m2.7-abliterated-heretic-gguf Q4_K_M GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
lhca521/minimax-m2.7-abliterated-heretic-gguf overview
This is a GGUF release of an abliterated version of MiniMaxAI's MiniMax-M2.7. By applying Heretic's Ablated Refusal Adaptation (ARA), the base refusal behavior was removed at the weight level. The result keeps MiniMax-M2.7's sparse MoE reasoning, long-context instruction following, and general capability profile, but no longer defaults to the original refusal pattern.
Downloads
222
Likes
0
Pipeline
text-generation
Library
gguf
Visibility
Public
Access
Open
Repository Files & Downloads
29 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| MiniMax-M2.7-abliterated-BF16-00001-of-00010.gguf | GGUF | BF16 | 44.50 GB | Download |
| MiniMax-M2.7-abliterated-BF16-00002-of-00010.gguf | GGUF | BF16 | 45.60 GB | Download |
| MiniMax-M2.7-abliterated-BF16-00003-of-00010.gguf | GGUF | BF16 | 45.51 GB | Download |
| MiniMax-M2.7-abliterated-BF16-00004-of-00010.gguf | GGUF | BF16 | 45.60 GB | Download |
| MiniMax-M2.7-abliterated-BF16-00005-of-00010.gguf | GGUF | BF16 | 45.60 GB | Download |
| MiniMax-M2.7-abliterated-BF16-00006-of-00010.gguf | GGUF | BF16 | 45.51 GB | Download |
| MiniMax-M2.7-abliterated-BF16-00007-of-00010.gguf | GGUF | BF16 | 45.60 GB | Download |
| MiniMax-M2.7-abliterated-BF16-00008-of-00010.gguf | GGUF | BF16 | 45.60 GB | Download |
| MiniMax-M2.7-abliterated-BF16-00009-of-00010.gguf | GGUF | BF16 | 45.51 GB | Download |
| MiniMax-M2.7-abliterated-BF16-00010-of-00010.gguf | GGUF | BF16 | 17.06 GB | Download |
| MiniMax-M2.7-abliterated-Q2_K.gguf-00001-of-00002.gguf | GGUF | Q2_K | 41.62 GB | Download |
| MiniMax-M2.7-abliterated-Q2_K.gguf-00002-of-00002.gguf | GGUF | Q2_K | 35.96 GB | Download |
| MiniMax-M2.7-abliterated-Q3_K_M.gguf-00001-of-00003.gguf | GGUF | Q3_K_M | 41.74 GB | Download |
| MiniMax-M2.7-abliterated-Q3_K_M.gguf-00002-of-00003.gguf | GGUF | Q3_K_M | 41.69 GB | Download |
| MiniMax-M2.7-abliterated-Q3_K_M.gguf-00003-of-00003.gguf | GGUF | Q3_K_M | 18.34 GB | Download |
| MiniMax-M2.7-abliterated-Q4_K_M.gguf-00001-of-00004.gguf | GGUF | Q4_K_M | 41.85 GB | Download |
| MiniMax-M2.7-abliterated-Q4_K_M.gguf-00002-of-00004.gguf | GGUF | Q4_K_M | 41.81 GB | Download |
| MiniMax-M2.7-abliterated-Q4_K_M.gguf-00003-of-00004.gguf | GGUF | Q4_K_M | 41.69 GB | Download |
| MiniMax-M2.7-abliterated-Q4_K_M.gguf-00004-of-00004.gguf | GGUF | Q4_K_M | 3.48 GB | Download |
| MiniMax-M2.7-abliterated-Q6_K.gguf-00001-of-00005.gguf | GGUF | Q6_K | 41.18 GB | Download |
| MiniMax-M2.7-abliterated-Q6_K.gguf-00002-of-00005.gguf | GGUF | Q6_K | 41.15 GB | Download |
| MiniMax-M2.7-abliterated-Q6_K.gguf-00003-of-00005.gguf | GGUF | Q6_K | 41.12 GB | Download |
| MiniMax-M2.7-abliterated-Q6_K.gguf-00004-of-00005.gguf | GGUF | Q6_K | 41.15 GB | Download |
| MiniMax-M2.7-abliterated-Q6_K.gguf-00005-of-00005.gguf | GGUF | Q6_K | 10.26 GB | Download |
| MiniMax-M2.7-abliterated-Q8_0-00001-of-00005.gguf | GGUF | — | 46.05 GB | Download |
| MiniMax-M2.7-abliterated-Q8_0-00002-of-00005.gguf | GGUF | — | 46.03 GB | Download |
| MiniMax-M2.7-abliterated-Q8_0-00003-of-00005.gguf | GGUF | — | 45.98 GB | Download |
| MiniMax-M2.7-abliterated-Q8_0-00004-of-00005.gguf | GGUF | — | 46.02 GB | Download |
| MiniMax-M2.7-abliterated-Q8_0-00005-of-00005.gguf | GGUF | — | 42.35 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"base_model": "MiniMaxAI/MiniMax-M2.7",
"library_name": "gguf",
"pipeline_tag": "text-generation",
"license": "other",
"license_name": "non-commercial",
"license_link": "https://github.com/MiniMax-AI/MiniMax-M2.7/blob/main/LICENSE",
"tags": [
"gguf",
"minimax",
"minimax_m2",
"moe",
"mixture-of-experts",
"abliterated",
"uncensored",
"heretic",
"ara",
"llama-cpp"
],
"quantized_by": "Youssofal",
"frontmatter": {
"base_model": "MiniMaxAI/MiniMax-M2.7",
"library_name": "gguf",
"pipeline_tag": "text-generation",
"license": "other",
"license_name": "non-commercial",
"license_link": "https://github.com/MiniMax-AI/MiniMax-M2.7/blob/main/LICENSE",
"tags": [
"gguf",
"minimax",
"minimax_m2",
"moe",
"mixture-of-experts",
"abliterated",
"uncensored",
"heretic",
"ara",
"llama-cpp"
],
"quantized_by": "Youssofal"
},
"hero_image_url": "",
"summary": "This is a GGUF release of an abliterated version of MiniMaxAI's MiniMax-M2.7. By applying Heretic's Ablated Refusal Adaptation (ARA), the base refusal behavior was removed at the weight level. The result keeps MiniMax-M2.7's sparse MoE reasoning, long-context instruction following, and general capability profile, but no longer defaults to the original refusal pattern.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nbase_model: MiniMaxAI/MiniMax-M2.7\nlibrary_name: gguf\npipeline_tag: text-generation\nlicense: other\nlicense_name: non-commercial\nlicense_link: https://github.com/MiniMax-AI/MiniMax-M2.7/blob/main/LICENSE\ntags:\n - gguf\n - minimax\n - minimax_m2\n - moe\n - mixture-of-experts\n - abliterated\n - uncensored\n - heretic\n - ara\n - llama-cpp\nquantized_by: Youssofal\n---\n\n# MiniMax-M2.7-Abliterated-Heretic-GGUF\n\nThis is a GGUF release of an abliterated version of MiniMaxAI's MiniMax-M2.7.\n\nBy applying Heretic's Ablated Refusal Adaptation (ARA), the base refusal behavior was removed at the weight level. The result keeps MiniMax-M2.7's sparse MoE reasoning, long-context instruction following, and general capability profile, but no longer defaults to the original refusal pattern.\n\n## Methodology & Model Notes\n\nMiniMax-M2.7 is a 229B sparse MoE model with 10B active parameters per token, 62 layers, hybrid attention, 256 local experts with 8 active per token, and a 200K context window.\n\nThis release was produced with a direct Heretic ARA run using the fixed parameter set below:\n\n- `start_layer_index = 30`\n- `end_layer_index = 51`\n- `preserve_good_behavior_weight = 0.4512`\n- `steer_bad_behavior_weight = 0.0037`\n- `overcorrect_relative_weight = 0.8804`\n- `neighbor_count = 14`\n\nThe direct ARA run completed with `Refusals: 0/25`.\n\nThe resulting abliterated checkpoint was exported to BF16 and then converted to GGUF for llama.cpp-compatible deployment.\n\n## Files\n\n- `MiniMax-M2.7-abliterated-BF16/`: BF16 GGUF split into 10 parts\n- `MiniMax-M2.7-abliterated-Q8_0/`: Q8_0 GGUF split into 5 parts\n- `MiniMax-M2.7-abliterated-Q3_K_M/`: Q3_K_M GGUF split for Hub delivery\n- Additional quants will be added from the same abliterated BF16 GGUF source\n\n## Prompt Format\n\n```text\n]~!b[]~b]system\n{system_prompt}[e~[\n]~b]user\n{prompt}[e~[\n]~b]ai\n<think>\n```\n\n## Running\n\n```bash\nllama-server \\\n -m <quant-file.gguf> \\\n -ngl 999 -c 32768 --jinja \\\n --reasoning-format auto -fa \\\n --temp 1.0 --top-p 0.95 --top-k 40\n```\n\n## Model Architecture\n\n| Spec | Value |\n|---|---|\n| Total Parameters | 229B (sparse MoE) |\n| Active Parameters | 10B per token |\n| Experts | 256 local, 8 per token |\n| Layers | 62 |\n| Attention | Hybrid: 7 Lightning + 1 softmax per 8-block |\n| Context | 200K |\n| Base Model | MiniMaxAI/MiniMax-M2.7 |\n\n## Disclaimer\n\nThis model has had refusal behavior removed at the weight level. It will answer prompts that the base model would normally refuse. You are responsible for how you use it.\n\n## Credits\n\n- Base model: [MiniMaxAI/MiniMax-M2.7](https://huggingface.co/MiniMaxAI/MiniMax-M2.7)\n- Refusal removal pipeline: [Heretic](https://github.com/andyrdt/heretic) with the ARA method\n- GGUF runtime and quantization: [llama.cpp](https://github.com/ggml-org/llama.cpp)\n\n## License\n\nThis release inherits the base MiniMax-M2.7 license.\n\n**NON-COMMERCIAL.** Commercial use requires written authorization from MiniMax.\n",
"related_quantizations": []
},
"tags": [
"gguf",
"minimax",
"minimax_m2",
"moe",
"mixture-of-experts",
"abliterated",
"uncensored",
"heretic",
"ara",
"llama-cpp",
"text-generation",
"base_model:MiniMaxAI/MiniMax-M2.7",
"base_model:quantized:MiniMaxAI/MiniMax-M2.7",
"license:other",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 0,
"downloads": 222,
"gated": false,
"private": false,
"last_modified": "2026-04-15T13:41:50.000Z",
"created_at": "2026-04-15T13:41:50.000Z",
"pipeline_tag": "text-generation",
"library_name": "gguf"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "69df959e0373e07ad9a11065",
"id": "lhca521/MiniMax-M2.7-Abliterated-Heretic-GGUF",
"modelId": "lhca521/MiniMax-M2.7-Abliterated-Heretic-GGUF",
"sha": "7670790baedc41e7e0b8a6e038bc88ba51856e44",
"createdAt": "2026-04-15T13:41:50.000Z",
"lastModified": "2026-04-15T13:41:50.000Z",
"author": "lhca521",
"downloads": 222,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "gguf",
"siblings_count": 31
}