Model Intelligence Sheet
quant-cartel/llama-3.05-nemotron-tenyxchat-storybreaker-70b-imat-gguf overview
Quantized with love from fp16. Original model author: Envoid Importance Matrix calculated using groups_merged.txt 88 chunks n_ctx=512 Calculation uses f16 precision model weights Original model README here and below: # Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B is a 40/60 SLERP Merge of Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B onto nvidia/Llama-3.1-Nemotron-70B-Instruct-HF utilizing the following config:
Downloads
112
Likes
1
Pipeline
text-generation
Library
—
Visibility
Public
Access
Open
Repository Files & Downloads
23 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ1_M.gguf | GGUF | IQ1_M | 15.60 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ1_S.gguf | GGUF | IQ1_S | 14.29 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ2_M.gguf | GGUF | IQ2_M | 22.46 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ2_S.gguf | GGUF | IQ2_S | 20.71 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ2_XS.gguf | GGUF | IQ2_XS | 19.69 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ2_XXS.gguf | GGUF | IQ2_XXS | 17.79 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ3_M.gguf | GGUF | IQ3_M | 29.74 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ3_S.gguf | GGUF | IQ3_S | 28.79 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ3_XS.gguf | GGUF | IQ3_XS | 27.29 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ3_XXS.gguf | GGUF | IQ3_XXS | 25.58 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ4_XS.gguf | GGUF | IQ4_XS | 35.30 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q2_K.gguf | GGUF | Q2_K | 24.56 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q3_K_L.gguf | GGUF | Q3_K_L | 34.59 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q3_K_M.gguf | GGUF | Q3_K_M | 31.91 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q3_K_S.gguf | GGUF | Q3_K_S | 28.79 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q4_K_M.gguf | GGUF | Q4_K_M | 39.60 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q4_K_S.gguf | GGUF | Q4_K_S | 37.58 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q5_K_M.gguf | GGUF | Q5_K_M | 46.52 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q5_K_S.gguf | GGUF | Q5_K_S | 45.32 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q6_K-00001-of-00002.gguf | GGUF | Q6_K | 46.43 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q6_K-00002-of-00002.gguf | GGUF | Q6_K | 7.49 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q8_0-00001-of-00002.gguf | GGUF | — | 46.35 GB | Download |
| Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q8_0-00002-of-00002.gguf | GGUF | — | 23.48 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "cc-by-nc-4.0",
"base_model_relation": "quantized",
"quantized_by": "Quant-Cartel",
"base_model": "Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B",
"pipeline_tag": "text-generation",
"tags": [
"iMat",
"GGUF",
"merge"
],
"frontmatter": {
"license": "cc-by-nc-4.0",
"base_model_relation": "quantized",
"quantized_by": "Quant-Cartel",
"base_model": "Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B",
"pipeline_tag": "text-generation",
"tags": [
"iMat",
"GGUF",
"merge"
]
},
"hero_image_url": "https://files.catbox.moe/07cjw5.jpg",
"summary": "Quantized with love from fp16. Original model author: Envoid * Importance Matrix calculated using groups_merged.txt * 88 chunks * n_ctx=512 * Calculation uses f16 precision model weights Original model README here and below:  # Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B is a 40/60 SLERP Merge of Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B onto nvidia/Llama-3.1-Nemotron-70B-Instruct-HF utilizing the following config: `` models: merge_method: slerp base_model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF parameters: t: dtype: bfloat16 ``",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: cc-by-nc-4.0\nbase_model_relation: quantized\nquantized_by: Quant-Cartel\nbase_model: Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B\npipeline_tag: text-generation\ntags:\n- iMat\n- GGUF\n- merge\n---\n```\n e88 88e d8 \n d888 888b 8888 8888 ,\"Y88b 888 8e d88 \nC8888 8888D 8888 8888 \"8\" 888 888 88b d88888 \n Y888 888P Y888 888P ,ee 888 888 888 888 \n \"88 88\" \"88 88\" \"88 888 888 888 888 \n b \n 8b, \n \n e88'Y88 d8 888 \n d888 'Y ,\"Y88b 888,8, d88 ,e e, 888 \nC8888 \"8\" 888 888 \" d88888 d88 88b 888 \n Y888 ,d ,ee 888 888 888 888 , 888 \n \"88,d88 \"88 888 888 888 \"YeeP\" 888 \n \nPROUDLY PRESENTS \n```\n# Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-GGUF\n\nQuantized with love from fp16.\n\nOriginal model author: [Envoid](https://huggingface.co/Envoid/)\n\n* Importance Matrix calculated using [groups_merged.txt](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384)\n* 88 chunks\n* n_ctx=512\n* Calculation uses f16 precision model weights\n\nOriginal model README [here](https://huggingface.co/Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B) and below:\n\n\n\n# Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B\n\nis a 40/60 SLERP Merge of [Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B](https://huggingface.co/Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B?not-for-all-audiences=true) onto [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF) utilizing the following config:\n```\nmodels:\n - model: ./Envoid_Llama-3-TenyxChat-DaybreakStorywriter-70B\n - model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF\nmerge_method: slerp\nbase_model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF\nparameters:\n t:\n - value: 0.4\ndtype: bfloat16\n```\n## Caution: As is always the case with SLERP merges there may be edge cases inwhich certain unintended model behaviors emerge. So always use with caution.\n\nThe 'sloppiness' of Nemotron seems to be somewhat reigned in (but still exists) while maintaining its personable assistant personality and safety (In assistant mode it will still prompt you with a warning before producing sensitive content).\n\nOverall it provides a solid option for RP and creative writing while still functioning as an assistant model, if desired. If used to continue a roleplay it will generally follow the ongoing cadence of the conversation.\n\n### It utilizes the Llama 3 prompt format.\n",
"related_quantizations": []
},
"tags": [
"gguf",
"iMat",
"GGUF",
"merge",
"text-generation",
"base_model:Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B",
"base_model:quantized:Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B",
"license:cc-by-nc-4.0",
"endpoints_compatible",
"region:us",
"imatrix",
"conversational"
],
"likes": 1,
"downloads": 112,
"gated": false,
"private": false,
"last_modified": "2024-10-19T10:29:27.000Z",
"created_at": "2024-10-17T15:58:33.000Z",
"pipeline_tag": "text-generation",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "67113429cc054ebaac22ac78",
"id": "Quant-Cartel/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-GGUF",
"modelId": "Quant-Cartel/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-GGUF",
"sha": "7754210a2144773eacf199b2806362c7fe86b9c6",
"createdAt": "2024-10-17T15:58:33.000Z",
"lastModified": "2024-10-19T10:29:27.000Z",
"author": "Quant-Cartel",
"downloads": 112,
"likes": 1,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "",
"siblings_count": 25
}