duyntnet/mistral-nemo-12b-arliai-rpmax-v1.1-imatrix-gguf IQ3_XXS GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
duyntnet/mistral-nemo-12b-arliai-rpmax-v1.1-imatrix-gguf overview
ArliAI-RPMax-12B-v1.1 is a variant based on Mistral Nemo 12B Instruct 2407. This is arguably the most successful RPMax model due to how Mistral is already very uncensored in the first place. ### Training Details Sequence Length: 8192 Training Duration: Approximately 2 days on 2x3090Ti Epochs: 1 epoch training for minimized repetition sickness QLORA: 64-rank 128-alpha, resulting in ~2% trainable weights Learning Rate: 0.00001 Gradient accumulation: Very low 32 for better learning. ### Suggested Prompt Format Mistral Instruct Prompt Format ### Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |20.64| |IFEval (0-Shot) |53.49| |BBH (3-Shot) |24.81| |MATH Lvl 5 (4-Shot)| 9.21| |GPQA (0-shot) | 4.25| |MuSR (0-shot) | 5.56| |MMLU-PRO (5-shot) |26.49|
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ1_M.gguf | GGUF | IQ1_M | 3.00 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ1_S.gguf | GGUF | IQ1_S | 2.79 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ2_M.gguf | GGUF | IQ2_M | 4.13 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ2_S.gguf | GGUF | IQ2_S | 3.85 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ2_XS.gguf | GGUF | IQ2_XS | 3.65 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ2_XXS.gguf | GGUF | IQ2_XXS | 3.35 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ3_M.gguf | GGUF | IQ3_M | 5.33 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ3_S.gguf | GGUF | IQ3_S | 5.18 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ3_XS.gguf | GGUF | IQ3_XS | 4.94 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ3_XXS.gguf | GGUF | IQ3_XXS | 4.61 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ4_NL.gguf | GGUF | IQ4_NL | 6.61 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-IQ4_XS.gguf | GGUF | IQ4_XS | 6.28 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q2_K.gguf | GGUF | Q2_K | 4.46 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q2_K_S.gguf | GGUF | Q2_K_S | 4.19 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q3_K_L.gguf | GGUF | Q3_K_L | 6.11 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q3_K_M.gguf | GGUF | Q3_K_M | 5.67 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q3_K_S.gguf | GGUF | Q3_K_S | 5.15 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q4_0.gguf | GGUF | — | 6.61 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q4_1.gguf | GGUF | — | 7.26 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q4_K_M.gguf | GGUF | Q4_K_M | 6.96 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q4_K_S.gguf | GGUF | Q4_K_S | 6.63 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q5_0.gguf | GGUF | — | 7.96 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q5_1.gguf | GGUF | — | 8.61 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q5_K_M.gguf | GGUF | Q5_K_M | 8.13 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q5_K_S.gguf | GGUF | Q5_K_S | 7.93 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q6_K.gguf | GGUF | Q6_K | 9.37 GB | Download |
| Mistral-Nemo-12B-ArliAI-RPMax-v1.1-Q8_0.gguf | GGUF | — | 12.13 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": false,
"tags": [
"transformers",
"gguf",
"imatrix",
"Mistral-Nemo-12B-ArliAI-RPMax-v1.1"
],
"frontmatter": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": "false",
"tags": [
"transformers",
"gguf",
"imatrix",
"Mistral-Nemo-12B-ArliAI-RPMax-v1.1"
]
},
"hero_image_url": "",
"summary": "ArliAI-RPMax-12B-v1.1 is a variant based on Mistral Nemo 12B Instruct 2407. This is arguably the most successful RPMax model due to how Mistral is already very uncensored in the first place. ### Training Details * **Sequence Length**: 8192 * **Training Duration**: Approximately 2 days on 2x3090Ti * **Epochs**: 1 epoch training for minimized repetition sickness * **QLORA**: 64-rank 128-alpha, resulting in ~2% trainable weights * **Learning Rate**: 0.00001 * **Gradient accumulation**: Very low 32 for better learning. ### Suggested Prompt Format Mistral Instruct Prompt Format ### Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |-------------------|----:| |Avg. |20.64| |IFEval (0-Shot) |53.49| |BBH (3-Shot) |24.81| |MATH Lvl 5 (4-Shot)| 9.21| |GPQA (0-shot) | 4.25| |MuSR (0-shot) | 5.56| |MMLU-PRO (5-shot) |26.49|",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- Mistral-Nemo-12B-ArliAI-RPMax-v1.1\n---\nQuantizations of https://huggingface.co/ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.1\n\n### Inference Clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [ollama](https://github.com/ollama/ollama)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [jan](https://github.com/janhq/jan)\n* [GPT4All](https://github.com/nomic-ai/gpt4all)\n---\n\n# From original readme\n\nArliAI-RPMax-12B-v1.1 is a variant based on Mistral Nemo 12B Instruct 2407.\n\nThis is arguably the most successful RPMax model due to how Mistral is already very uncensored in the first place.\n\n### Training Details\n\n* **Sequence Length**: 8192\n* **Training Duration**: Approximately 2 days on 2x3090Ti\n* **Epochs**: 1 epoch training for minimized repetition sickness\n* **QLORA**: 64-rank 128-alpha, resulting in ~2% trainable weights\n* **Learning Rate**: 0.00001\n* **Gradient accumulation**: Very low 32 for better learning.\n\n### Suggested Prompt Format\n\nMistral Instruct Prompt Format\n\n### [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)\nDetailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ArliAI__ArliAI-RPMax-12B-v1.1)\n\n| Metric |Value|\n|-------------------|----:|\n|Avg. |20.64|\n|IFEval (0-Shot) |53.49|\n|BBH (3-Shot) |24.81|\n|MATH Lvl 5 (4-Shot)| 9.21|\n|GPQA (0-shot) | 4.25|\n|MuSR (0-shot) | 5.56|\n|MMLU-PRO (5-shot) |26.49|\n\n",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"imatrix",
"Mistral-Nemo-12B-ArliAI-RPMax-v1.1",
"text-generation",
"en",
"license:other",
"region:us",
"conversational"
],
"likes": 0,
"downloads": 228,
"gated": false,
"private": false,
"last_modified": "2025-02-27T02:56:29.000Z",
"created_at": "2025-02-27T00:55:35.000Z",
"pipeline_tag": "text-generation",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "67bfb807e3df937c41e0f08b",
"id": "duyntnet/Mistral-Nemo-12B-ArliAI-RPMax-v1.1-imatrix-GGUF",
"modelId": "duyntnet/Mistral-Nemo-12B-ArliAI-RPMax-v1.1-imatrix-GGUF",
"sha": "8accb14e298f79944f89face7dd8a707b4897281",
"createdAt": "2025-02-27T00:55:35.000Z",
"lastModified": "2025-02-27T02:56:29.000Z",
"author": "duyntnet",
"downloads": 228,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "transformers",
"siblings_count": 29
}