GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

quant-cartel/llama-3.05-nemotron-tenyxchat-storybreaker-70b-imat-gguf overview

Quantized with love from fp16. Original model author: Envoid Importance Matrix calculated using groups_merged.txt 88 chunks n_ctx=512 Calculation uses f16 precision model weights Original model README here and below: # Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B is a 40/60 SLERP Merge of Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B onto nvidia/Llama-3.1-Nemotron-70B-Instruct-HF utilizing the following config:

ggufiMatGGUFmergetext-generationbase_model:Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70Bbase_model:quantized:Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70Blicense:cc-by-nc-4.0endpoints_compatibleregion:usimatrixconversational
quant-cartel/llama-3.05-nemotron-tenyxchat-storybreaker-70b-imat-gguf visual
Downloads
112
Likes
1
Pipeline
text-generation
Library
Visibility
Public
Access
Open

Repository Files & Downloads

23 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ1_M.gguf GGUF IQ1_M 15.60 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ1_S.gguf GGUF IQ1_S 14.29 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ2_M.gguf GGUF IQ2_M 22.46 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ2_S.gguf GGUF IQ2_S 20.71 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ2_XS.gguf GGUF IQ2_XS 19.69 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ2_XXS.gguf GGUF IQ2_XXS 17.79 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ3_M.gguf GGUF IQ3_M 29.74 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ3_S.gguf GGUF IQ3_S 28.79 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ3_XS.gguf GGUF IQ3_XS 27.29 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ3_XXS.gguf GGUF IQ3_XXS 25.58 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-IQ4_XS.gguf GGUF IQ4_XS 35.30 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q2_K.gguf GGUF Q2_K 24.56 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q3_K_L.gguf GGUF Q3_K_L 34.59 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q3_K_M.gguf GGUF Q3_K_M 31.91 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q3_K_S.gguf GGUF Q3_K_S 28.79 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q4_K_M.gguf GGUF Q4_K_M 39.60 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q4_K_S.gguf GGUF Q4_K_S 37.58 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q5_K_M.gguf GGUF Q5_K_M 46.52 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q5_K_S.gguf GGUF Q5_K_S 45.32 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q6_K-00001-of-00002.gguf GGUF Q6_K 46.43 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q6_K-00002-of-00002.gguf GGUF Q6_K 7.49 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q8_0-00001-of-00002.gguf GGUF 46.35 GB Download
Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-Q8_0-00002-of-00002.gguf GGUF 23.48 GB Download

Model Details Live

Model Slug
quant-cartel/llama-3.05-nemotron-tenyxchat-storybreaker-70b-imat-gguf
Author
Quant-Cartel
Pipeline Task
text-generation
Library
Created
2024-10-17
Last Modified
2024-10-19
Gated
No
Private
No
HF SHA
7754210a2144773eacf199b2806362c7fe86b9c6
License
cc-by-nc-4.0
Language
Unknown
Base Model
Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "cc-by-nc-4.0",
    "base_model_relation": "quantized",
    "quantized_by": "Quant-Cartel",
    "base_model": "Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B",
    "pipeline_tag": "text-generation",
    "tags": [
      "iMat",
      "GGUF",
      "merge"
    ],
    "frontmatter": {
      "license": "cc-by-nc-4.0",
      "base_model_relation": "quantized",
      "quantized_by": "Quant-Cartel",
      "base_model": "Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B",
      "pipeline_tag": "text-generation",
      "tags": [
        "iMat",
        "GGUF",
        "merge"
      ]
    },
    "hero_image_url": "https://files.catbox.moe/07cjw5.jpg",
    "summary": "Quantized with love from fp16. Original model author: Envoid * Importance Matrix calculated using groups_merged.txt *  88 chunks *  n_ctx=512 *  Calculation uses f16 precision model weights Original model README here and below: ![](https://files.catbox.moe/07cjw5.jpg) # Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B is a 40/60 SLERP Merge of Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B onto nvidia/Llama-3.1-Nemotron-70B-Instruct-HF utilizing the following config: `` models: merge_method: slerp base_model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF parameters: t: dtype: bfloat16 ``",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: cc-by-nc-4.0\nbase_model_relation: quantized\nquantized_by: Quant-Cartel\nbase_model: Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B\npipeline_tag: text-generation\ntags:\n- iMat\n- GGUF\n- merge\n---\n```\n  e88 88e                               d8     \n d888 888b  8888 8888  ,\"Y88b 888 8e   d88     \nC8888 8888D 8888 8888 \"8\" 888 888 88b d88888   \n Y888 888P  Y888 888P ,ee 888 888 888  888     \n  \"88 88\"    \"88 88\"  \"88 888 888 888  888     \n      b                                        \n      8b,                                      \n \n  e88'Y88                  d8           888    \n d888  'Y  ,\"Y88b 888,8,  d88    ,e e,  888    \nC8888     \"8\" 888 888 \"  d88888 d88 88b 888    \n Y888  ,d ,ee 888 888     888   888   , 888    \n  \"88,d88 \"88 888 888     888    \"YeeP\" 888    \n                                               \nPROUDLY PRESENTS         \n```\n# Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-GGUF\n\nQuantized with love from fp16.\n\nOriginal model author: [Envoid](https://huggingface.co/Envoid/)\n\n* Importance Matrix calculated using [groups_merged.txt](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384)\n*  88 chunks\n*  n_ctx=512\n*  Calculation uses f16 precision model weights\n\nOriginal model README [here](https://huggingface.co/Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B) and below:\n\n![](https://files.catbox.moe/07cjw5.jpg)\n\n# Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B\n\nis a 40/60 SLERP Merge of [Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B](https://huggingface.co/Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B?not-for-all-audiences=true) onto [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF) utilizing the following config:\n```\nmodels:\n  - model: ./Envoid_Llama-3-TenyxChat-DaybreakStorywriter-70B\n  - model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF\nmerge_method: slerp\nbase_model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF\nparameters:\n  t:\n    - value: 0.4\ndtype: bfloat16\n```\n## Caution: As is always the case with SLERP merges there may be edge cases inwhich certain unintended model behaviors emerge. So always use with caution.\n\nThe 'sloppiness' of Nemotron seems to be somewhat reigned in (but still exists) while maintaining its personable assistant personality and safety (In assistant mode it will still prompt you with a warning before producing sensitive content).\n\nOverall it provides a solid option for RP and creative writing while still functioning as an assistant model, if desired. If used to continue a roleplay it will generally follow the ongoing cadence of the conversation.\n\n### It utilizes the Llama 3 prompt format.\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "iMat",
    "GGUF",
    "merge",
    "text-generation",
    "base_model:Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B",
    "base_model:quantized:Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B",
    "license:cc-by-nc-4.0",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 1,
  "downloads": 112,
  "gated": false,
  "private": false,
  "last_modified": "2024-10-19T10:29:27.000Z",
  "created_at": "2024-10-17T15:58:33.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "67113429cc054ebaac22ac78",
  "id": "Quant-Cartel/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-GGUF",
  "modelId": "Quant-Cartel/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-GGUF",
  "sha": "7754210a2144773eacf199b2806362c7fe86b9c6",
  "createdAt": "2024-10-17T15:58:33.000Z",
  "lastModified": "2024-10-19T10:29:27.000Z",
  "author": "Quant-Cartel",
  "downloads": 112,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 25
}