GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

ycros/bagelmisterytour-v2-8x7b-gguf overview

BagelMIsteryTour-v2-8x7B These are GGUF quantized versions of BagelMIsteryTour-v2-8x7B Bagel, Mixtral Instruct, with extra spices. Give it a taste. Works with Alpaca prompt formats, though the Mistral format should also work. !image/jpeg I started experimenting around seeing if I could improve or fix some of Bagel's problems. Totally inspired by seeing how well Doctor-Shotgun's Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss worked (which is a LimaRP tune on top of base Mixtral, and then merged with Mixtral Instruct) - I decided to try some merges of Bagel with Mixtral Instruct as a result. Somehow I ended up here, Bagel, Mixtral Instruct, a little bit of LimaRP, a little bit of Sao10K's Sensualize. So far in my testing it's working very well, and while it seems fairly unaligned on a lot of stuff, it's maybe a little too aligned on a few specific things (which I think comes from Sensualize) - so that's something to play with in the future, or maybe try to DPO out. I've been running (temp last) minP 0.1, dynatemp 0.5-4, rep pen 1.07, rep range 1024. I've been testing Alpaca style Instruction/Response, and Instruction/Input/Response and those seem to work well, I expect Mistral's prompt format would also work well. You may need to add a stopping string on "{{char}}:" for RPs because it can sometimes duplicate those out in responses and waffle on. Seems to hold up and not fall apart at long contexts like Bagel and some other Mixtral tunes seem to, definitely doesn't seem prone to loopyness either. Can be pushed into extravagant prose if the scene/setting calls for it. Version 2: lowered the mix of Sensualize. This is a merge of pre-trained language models created using mergekit.

ggufmergekitmergearxiv:2311.03099arxiv:2306.01708base_model:DS-Archive/limarp-zloss-mixtral-8x7b-qlorabase_model:merge:DS-Archive/limarp-zloss-mixtral-8x7b-qlorabase_model:Sao10K/Sensualize-Mixtral-bf16base_model:merge:Sao10K/Sensualize-Mixtral-bf16base_model:jondurbin/bagel-dpo-8x7b-v0.2base_model:merge:jondurbin/bagel-dpo-8x7b-v0.2base_model:mistralai/Mixtral-8x7B-Instruct-v0.1base_model:merge:mistralai/Mixtral-8x7B-Instruct-v0.1base_model:mistralai/Mixtral-8x7B-v0.1base_model:merge:mistralai/Mixtral-8x7B-v0.1license:cc-by-nc-4.0endpoints_compatibleregion:us
ycros/bagelmisterytour-v2-8x7b-gguf visual
Downloads
271
Likes
28
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

16 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
BagelMIsteryTour-v2-8x7B.IQ1_S.imx.gguf GGUF IQ1_S 8.88 GB Download
BagelMIsteryTour-v2-8x7B.IQ2_XS.imx.gguf GGUF IQ2_XS 12.73 GB Download
BagelMIsteryTour-v2-8x7B.IQ2_XXS.imx.gguf GGUF IQ2_XXS 11.44 GB Download
BagelMIsteryTour-v2-8x7B.IQ3_XXS.imx.gguf GGUF IQ3_XXS 17.05 GB Download
BagelMIsteryTour-v2-8x7B.IQ4_NL.imx.gguf GGUF IQ4_NL 24.69 GB Download
BagelMIsteryTour-v2-8x7B.Q3_K_L.imx.gguf GGUF Q3_K_L 22.51 GB Download
BagelMIsteryTour-v2-8x7B.Q3_K_M.imx.gguf GGUF Q3_K_M 20.99 GB Download
BagelMIsteryTour-v2-8x7B.Q3_K_S.imx.gguf GGUF Q3_K_S 19.03 GB Download
BagelMIsteryTour-v2-8x7B.Q3_K_XS.imx.gguf GGUF Q3_K_XS 17.75 GB Download
BagelMIsteryTour-v2-8x7B.Q4_K_M.imx.gguf GGUF Q4_K_M 26.49 GB Download
BagelMIsteryTour-v2-8x7B.Q4_K_S.imx.gguf GGUF Q4_K_S 24.91 GB Download
BagelMIsteryTour-v2-8x7B.Q5_1.imx.gguf GGUF 32.71 GB Download
BagelMIsteryTour-v2-8x7B.Q5_K_M.imx.gguf GGUF Q5_K_M 30.94 GB Download
BagelMIsteryTour-v2-8x7B.Q5_K_S.imx.gguf GGUF Q5_K_S 30.01 GB Download
BagelMIsteryTour-v2-8x7B.Q6_K.imx.gguf GGUF Q6_K 35.74 GB Download
BagelMIsteryTour-v2-8x7B.Q8_0.gguf GGUF 46.22 GB Download

Model Details Live

Model Slug
ycros/bagelmisterytour-v2-8x7b-gguf
Author
ycros
Pipeline Task
Library
Created
2024-01-19
Last Modified
2024-04-03
Gated
No
Private
No
HF SHA
aaacd96614f4a03b0fffa1438473efd7b617bcbf
License
cc-by-nc-4.0
Language
Unknown
Base Model
mistralai/Mixtral-8x7B-v0.1, jondurbin/bagel-dpo-8x7b-v0.2, Sao10K/Sensualize-Mixtral-bf16, mistralai/Mixtral-8x7B-v0.1, Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora, mistralai/Mixtral-8x7B-Instruct-v0.1

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": [
      "mistralai/Mixtral-8x7B-v0.1",
      "jondurbin/bagel-dpo-8x7b-v0.2",
      "Sao10K/Sensualize-Mixtral-bf16",
      "mistralai/Mixtral-8x7B-v0.1",
      "Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora",
      "mistralai/Mixtral-8x7B-Instruct-v0.1"
    ],
    "tags": [
      "mergekit",
      "merge"
    ],
    "license": "cc-by-nc-4.0",
    "frontmatter": {
      "base_model": [
        "mistralai/Mixtral-8x7B-v0.1",
        "jondurbin/bagel-dpo-8x7b-v0.2",
        "Sao10K/Sensualize-Mixtral-bf16",
        "mistralai/Mixtral-8x7B-v0.1",
        "Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora",
        "mistralai/Mixtral-8x7B-Instruct-v0.1"
      ],
      "tags": [
        "mergekit",
        "merge"
      ],
      "license": "cc-by-nc-4.0"
    },
    "hero_image_url": "https://cdn-uploads.huggingface.co/production/uploads/63044fa07373aacccd8a7c53/lxNMzXo_dq_JCP9YyUyaw.jpeg",
    "summary": "# BagelMIsteryTour-v2-8x7B These are GGUF quantized versions of BagelMIsteryTour-v2-8x7B Bagel, Mixtral Instruct, with extra spices. Give it a taste. Works with Alpaca prompt formats, though the Mistral format should also work. !image/jpeg I started experimenting around seeing if I could improve or fix some of Bagel's problems. Totally inspired by seeing how well Doctor-Shotgun's Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss worked (which is a LimaRP tune on top of base Mixtral, and then merged with Mixtral Instruct) - I decided to try some merges of Bagel with Mixtral Instruct as a result. Somehow I ended up here, Bagel, Mixtral Instruct, a little bit of LimaRP, a little bit of Sao10K's Sensualize. So far in my testing it's working very well, and while it seems fairly unaligned on a lot of stuff, it's maybe a little too aligned on a few specific things (which I think comes from Sensualize) - so that's something to play with in the future, or maybe try to DPO out. I've been running (temp last) minP 0.1, dynatemp 0.5-4, rep pen 1.07, rep range 1024. I've been testing Alpaca style Instruction/Response, and Instruction/Input/Response and those seem to work well, I expect Mistral's prompt format would also work well. You may need to add a stopping string on \"{{char}}:\" for RPs because it can sometimes duplicate those out in responses and waffle on. Seems to hold up and not fall apart at long contexts like Bagel and some other Mixtral tunes seem to, definitely doesn't seem prone to loopyness either. Can be pushed into extravagant prose if the scene/setting calls for it. __Version 2:__ lowered the mix of Sensualize. This is a merge of pre-trained language models created using mergekit.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model:\n- mistralai/Mixtral-8x7B-v0.1\n- jondurbin/bagel-dpo-8x7b-v0.2\n- Sao10K/Sensualize-Mixtral-bf16\n- mistralai/Mixtral-8x7B-v0.1\n- Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora\n- mistralai/Mixtral-8x7B-Instruct-v0.1\ntags:\n- mergekit\n- merge\nlicense: cc-by-nc-4.0\n\n---\n# BagelMIsteryTour-v2-8x7B\n\nThese are GGUF quantized versions of [BagelMIsteryTour-v2-8x7B](https://huggingface.co/ycros/BagelMIsteryTour-v2-8x7B)\n\nBagel, Mixtral Instruct, with extra spices. Give it a taste. Works with Alpaca prompt formats, though the Mistral format should also work.\n\n![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63044fa07373aacccd8a7c53/lxNMzXo_dq_JCP9YyUyaw.jpeg)\n\nI started experimenting around seeing if I could improve or fix some of Bagel's problems. Totally inspired by seeing how well Doctor-Shotgun's Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss worked (which is a LimaRP tune on top of base Mixtral, and then merged with Mixtral Instruct) - I decided to try some merges of Bagel with Mixtral Instruct as a result.\n\nSomehow I ended up here, Bagel, Mixtral Instruct, a little bit of LimaRP, a little bit of Sao10K's Sensualize. So far in my testing it's working very well, and while it seems fairly unaligned on a lot of stuff, it's maybe a little too aligned on a few specific things (which I think comes from Sensualize) - so that's something to play with in the future, or maybe try to DPO out.\n\nI've been running (temp last) minP 0.1, dynatemp 0.5-4, rep pen 1.07, rep range 1024. I've been testing Alpaca style Instruction/Response, and Instruction/Input/Response and those seem to work well, I expect Mistral's prompt format would also work well. You may need to add a stopping string on \"{{char}}:\" for RPs because it can sometimes duplicate those out in responses and waffle on. Seems to hold up and not fall apart at long contexts like Bagel and some other Mixtral tunes seem to, definitely doesn't seem prone to loopyness either. Can be pushed into extravagant prose if the scene/setting calls for it.\n\n__Version 2:__ lowered the mix of Sensualize.\n\nThis is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).\n\n## Merge Details\n### Merge Method\n\nThis model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) as a base.\n\n### Models Merged\n\nThe following models were included in the merge:\n* [jondurbin/bagel-dpo-8x7b-v0.2](https://huggingface.co/jondurbin/bagel-dpo-8x7b-v0.2)\n* [Sao10K/Sensualize-Mixtral-bf16](https://huggingface.co/Sao10K/Sensualize-Mixtral-bf16)\n* [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) + [Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora](https://huggingface.co/Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora)\n* [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1)\n\n### Configuration\n\nThe following YAML configuration was used to produce this model:\n\n```yaml\nbase_model: mistralai/Mixtral-8x7B-v0.1\nmodels:\n  - model: mistralai/Mixtral-8x7B-v0.1+Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora\n    parameters:\n      density: 0.5\n      weight: 0.2\n  - model: Sao10K/Sensualize-Mixtral-bf16\n    parameters:\n      density: 0.5\n      weight: 0.1\n  - model: mistralai/Mixtral-8x7B-Instruct-v0.1\n    parameters:\n      density: 0.6\n      weight: 1.0\n  - model: jondurbin/bagel-dpo-8x7b-v0.2\n    parameters:\n      density: 0.6\n      weight: 0.5\nmerge_method: dare_ties\ndtype: bfloat16\n\n\n```\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "mergekit",
    "merge",
    "arxiv:2311.03099",
    "arxiv:2306.01708",
    "base_model:DS-Archive/limarp-zloss-mixtral-8x7b-qlora",
    "base_model:merge:DS-Archive/limarp-zloss-mixtral-8x7b-qlora",
    "base_model:Sao10K/Sensualize-Mixtral-bf16",
    "base_model:merge:Sao10K/Sensualize-Mixtral-bf16",
    "base_model:jondurbin/bagel-dpo-8x7b-v0.2",
    "base_model:merge:jondurbin/bagel-dpo-8x7b-v0.2",
    "base_model:mistralai/Mixtral-8x7B-Instruct-v0.1",
    "base_model:merge:mistralai/Mixtral-8x7B-Instruct-v0.1",
    "base_model:mistralai/Mixtral-8x7B-v0.1",
    "base_model:merge:mistralai/Mixtral-8x7B-v0.1",
    "license:cc-by-nc-4.0",
    "endpoints_compatible",
    "region:us"
  ],
  "likes": 28,
  "downloads": 271,
  "gated": false,
  "private": false,
  "last_modified": "2024-04-03T01:01:56.000Z",
  "created_at": "2024-01-19T05:06:05.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "65aa033d0f54e50344da8c67",
  "id": "ycros/BagelMIsteryTour-v2-8x7B-GGUF",
  "modelId": "ycros/BagelMIsteryTour-v2-8x7B-GGUF",
  "sha": "aaacd96614f4a03b0fffa1438473efd7b617bcbf",
  "createdAt": "2024-01-19T05:06:05.000Z",
  "lastModified": "2024-04-03T01:01:56.000Z",
  "author": "ycros",
  "downloads": 271,
  "likes": 28,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 19
}