duyntnet/thespice-7b-ft-v0.3.1-imatrix-gguf Q5_0 GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
duyntnet/thespice-7b-ft-v0.3.1-imatrix-gguf overview
The latest TheSpice, dipped in Mama Liz's LimaRP Oil. I've focused on making the model more flexible and provide a more unique experience. I'm still working on cleaning up my dataset, but I've shrunken it down a lot to focus on a "less is more" approach. This is ultimate a return to form of the way I used to train Thespis, with more of a focus on a small hand edited dataset.
Downloads
133
Likes
0
Pipeline
text-generation
Library
transformers
Visibility
Public
Access
Open
Repository Files & Downloads
32 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| TheSpice-7b-FT-v0.3.1-IQ1_M.gguf | GGUF | IQ1_M | 1.63 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ1_S.gguf | GGUF | IQ1_S | 1.50 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ2_M.gguf | GGUF | IQ2_M | 2.33 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ2_S.gguf | GGUF | IQ2_S | 2.15 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ2_XS.gguf | GGUF | IQ2_XS | 2.05 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ2_XXS.gguf | GGUF | IQ2_XXS | 1.85 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ3_M.gguf | GGUF | IQ3_M | 3.06 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ3_S.gguf | GGUF | IQ3_S | 2.96 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ3_XS.gguf | GGUF | IQ3_XS | 2.81 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ3_XXS.gguf | GGUF | IQ3_XXS | 2.63 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ4_NL.gguf | GGUF | IQ4_NL | 3.84 GB | Download |
| TheSpice-7b-FT-v0.3.1-IQ4_XS.gguf | GGUF | IQ4_XS | 3.64 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q2_K.gguf | GGUF | Q2_K | 2.53 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q2_K_S.gguf | GGUF | Q2_K_S | 2.36 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q2_K_X.gguf | GGUF | Q2_K_X | 2.65 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q3_K_L.gguf | GGUF | Q3_K_L | 3.56 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q3_K_M.gguf | GGUF | Q3_K_M | 3.28 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q3_K_S.gguf | GGUF | Q3_K_S | 2.95 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q4_0.gguf | GGUF | — | 3.84 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q4_1.gguf | GGUF | — | 4.24 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q4_K_M.gguf | GGUF | Q4_K_M | 4.07 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q4_K_M_X.gguf | GGUF | Q4_K_M_X | 4.16 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q4_K_S.gguf | GGUF | Q4_K_S | 3.86 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q5_0.gguf | GGUF | — | 4.67 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q5_1.gguf | GGUF | — | 5.07 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q5_K_M.gguf | GGUF | Q5_K_M | 4.78 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q5_K_M_X.gguf | GGUF | Q5_K_M_X | 4.85 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q5_K_S.gguf | GGUF | Q5_K_S | 4.65 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q6_K.gguf | GGUF | Q6_K | 5.53 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q6_K_X.gguf | GGUF | Q6_K_X | 5.82 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q8_0.gguf | GGUF | — | 7.17 GB | Download |
| TheSpice-7b-FT-v0.3.1-Q8_0_X.gguf | GGUF | — | 7.40 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": false,
"tags": [
"transformers",
"gguf",
"imatrix",
"TheSpice-7b-FT-v0.3.1"
],
"frontmatter": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": "false",
"tags": [
"transformers",
"gguf",
"imatrix",
"TheSpice-7b-FT-v0.3.1"
]
},
"hero_image_url": "https://cdn-uploads.huggingface.co/production/uploads/64dd7cda3d6b954bf7cdd922/59vi4VWP2d0bCbsW2eU8h.png",
"summary": "The latest TheSpice, dipped in Mama Liz's LimaRP Oil. I've focused on making the model more flexible and provide a more unique experience. I'm still working on cleaning up my dataset, but I've shrunken it down a lot to focus on a \"less is more\" approach. This is ultimate a return to form of the way I used to train Thespis, with more of a focus on a small hand edited dataset.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- TheSpice-7b-FT-v0.3.1\n---\nQuantizations of https://huggingface.co/cgato/TheSpice-7b-FT-v0.3.1\n\n### Experiment\n\nQuants **ending in \"_X\"** are experimental quants. These quants are the same as normal quants, but their token embedding weights are set to Q8_0 except for Q6_K and Q8_0 which are set to F16. The change will make these experimental quants larger but ***in theory***, should result in improved performance.\n\nList of experimental quants: \n* Q2_K_X\n* Q4_K_M_X\n* Q5_K_M_X\n* Q6_K_X\n* Q8_0_X\n\n---\n\n### Inference Clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [JanAI](https://github.com/janhq/jan)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [ollama](https://github.com/ollama/ollama)\n\n---\n\n# From original readme\n\nThe latest TheSpice, dipped in Mama Liz's LimaRP Oil.\nI've focused on making the model more flexible and provide a more unique experience. \nI'm still working on cleaning up my dataset, but I've shrunken it down a lot to focus on a \"less is more\" approach.\nThis is ultimate a return to form of the way I used to train Thespis, with more of a focus on a small hand edited dataset.\n\n\n## Datasets Used\n\n* Dolphin\n* Ultrachat\n* Capybara\n* Augmental\n* ToxicQA\n* Yahoo Answers\n* Airoboros 3.1\n* LimaRP\n\n## Features ( Examples from 0.1.1 because I'm too lazy to take new screenshots. Its tested tho. )\n\nNarration\n\nIf you request information on objects or characters in the scene, the model will narrate it to you. Most of the time, without moving the story forward.\n\n## Prompt Format: Chat ( The default Ooba template and Silly Tavern Template )\n\n\n\nIf you're using Ooba in verbose mode as a server, you can check if you're console is logging something that looks like this. \n\n\n```\n{System Prompt}\n\nUsername: {Input}\nBotName: {Response}\nUsername: {Input}\nBotName: {Response}\n\n```",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"imatrix",
"TheSpice-7b-FT-v0.3.1",
"text-generation",
"en",
"license:other",
"region:us"
],
"likes": 0,
"downloads": 133,
"gated": false,
"private": false,
"last_modified": "2024-07-04T21:48:47.000Z",
"created_at": "2024-07-04T18:31:47.000Z",
"pipeline_tag": "text-generation",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "6686ea9361ae8422f619f3e7",
"id": "duyntnet/TheSpice-7b-FT-v0.3.1-imatrix-GGUF",
"modelId": "duyntnet/TheSpice-7b-FT-v0.3.1-imatrix-GGUF",
"sha": "0855c19ed2c467ca74b9c9221ea08ebf1bb1346d",
"createdAt": "2024-07-04T18:31:47.000Z",
"lastModified": "2024-07-04T21:48:47.000Z",
"author": "duyntnet",
"downloads": 133,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "transformers",
"siblings_count": 34
}