GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

richarderkhov/sao10k_-_l3-8b-stheno-v3.2-gguf overview

Quantization made by Richard Erkhov. Github Discord Request more models L3-8B-Stheno-v3.2 - GGUF | Name | Quant method | Size | | ---- | ---- | ---- | | L3-8B-Stheno-v3.2.Q2K.gguf | Q2K | 2.96GB | | L3-8B-Stheno-v3.2.IQ3XS.gguf | IQ3XS | 3.28GB | | L3-8B-Stheno-v3.2.IQ3S.gguf | IQ3S | 3.43GB | | L3-8B-Stheno-v3.2.Q3KS.gguf | Q3KS | 3.41GB | | L3-8B-Stheno-v3.2.IQ3M.gguf | IQ3M | 3.52GB | | L3-8B-Stheno-v3.2.Q3K.gguf | Q3K | 3.74GB | | L3-8B-Stheno-v3.2.Q3KM.gguf | Q3KM | 3.74GB | | L3-8B-Stheno-v3.2.Q3KL.gguf | Q3KL | 4.03GB | | L3-8B-Stheno-v3.2.IQ4XS.gguf | IQ4XS | 4.18GB | | L3-8B-Stheno-v3.2.Q40.gguf | Q40 | 4.34GB | | L3-8B-Stheno-v3.2.IQ4NL.gguf | IQ4NL | 4.38GB | | L3-8B-Stheno-v3.2.Q4KS.gguf | Q4KS | 4.37GB | | L3-8B-Stheno-v3.2.Q4K.gguf | Q4K | 4.58GB | | L3-8B-Stheno-v3.2.Q4KM.gguf | Q4KM | 4.58GB | | L3-8B-Stheno-v3.2.Q41.gguf | Q41 | 4.78GB | | L3-8B-Stheno-v3.2.Q50.gguf | Q50 | 5.21GB | | L3-8B-Stheno-v3.2.Q5KS.gguf | Q5KS | 5.21GB | | L3-8B-Stheno-v3.2.Q5K.gguf | Q5K | 5.34GB | | L3-8B-Stheno-v3.2.Q5KM.gguf | Q5KM | 5.34GB | | L3-8B-Stheno-v3.2.Q51.gguf | Q51 | 5.65GB | | L3-8B-Stheno-v3.2.Q6K.gguf | Q6K | 6.14GB | | L3-8B-Stheno-v3.2.Q80.gguf | Q80 | 7.95GB | Original model description: --- license: cc-by-nc-4.0 language: datasets: --- Just message me on discord if you want to host this privately for a service or something. We can talk. Train used 1x H100 SXM for like a total of 24 Hours over multiple runs. Support me here if you're interested: Ko-fi: https://ko-fi.com/sao10k wink Euryale v2? If not, that's fine too. Feedback would be nice. Contact Me in Discord: sao10k // Just ping me in the KoboldAI discord, I'll respond faster. Art by navy(navy.blue) - Danbooru --- !Stheno Stheno-v3.2-Zeta I have done a test run with multiple variations of the models, merged back to its base at various weights, different training runs too, and this Sixth iteration is the one I like most. Changes compared to v3.1 \- Included a mix of SFW and NSFW Storywriting Data, thanks to Gryphe \- Included More Instruct / Assistant-Style Data \- Further cleaned up Roleplaying Samples from c2 Logs -> A few terrible, really bad samples escaped heavy filtering. Manual pass fixed it. \- Hyperparameter tinkering for training, resulting in lower loss levels. Testing Notes - Compared to v3.1 \- Handles SFW / NSFW seperately better. Not as overly excessive with NSFW now. Kinda balanced. \- Better at Storywriting / Narration. \- Better at Assistant-type Tasks. \- Better Multi-Turn Coherency -> Reduced Issues? \- Slightly less creative? A worthy tradeoff. Still creative. \- Better prompt / instruction adherence. --- Recommended Samplers: Stopping Strings: Prompting Template - Llama-3-Instruct Basic Roleplay System Prompt ---

ggufendpoints_compatibleregion:usconversational
richarderkhov/sao10k_-_l3-8b-stheno-v3.2-gguf visual
Downloads
579
Likes
3
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

22 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
L3-8B-Stheno-v3.2.IQ3_M.gguf GGUF IQ3_M 3.52 GB Download
L3-8B-Stheno-v3.2.IQ3_S.gguf GGUF IQ3_S 3.43 GB Download
L3-8B-Stheno-v3.2.IQ3_XS.gguf GGUF IQ3_XS 3.28 GB Download
L3-8B-Stheno-v3.2.IQ4_NL.gguf GGUF IQ4_NL 4.38 GB Download
L3-8B-Stheno-v3.2.IQ4_XS.gguf GGUF IQ4_XS 4.18 GB Download
L3-8B-Stheno-v3.2.Q2_K.gguf GGUF Q2_K 2.96 GB Download
L3-8B-Stheno-v3.2.Q3_K.gguf GGUF Q3_K 3.74 GB Download
L3-8B-Stheno-v3.2.Q3_K_L.gguf GGUF Q3_K_L 4.03 GB Download
L3-8B-Stheno-v3.2.Q3_K_M.gguf GGUF Q3_K_M 3.74 GB Download
L3-8B-Stheno-v3.2.Q3_K_S.gguf GGUF Q3_K_S 3.41 GB Download
L3-8B-Stheno-v3.2.Q4_0.gguf GGUF 4.34 GB Download
L3-8B-Stheno-v3.2.Q4_1.gguf GGUF 4.78 GB Download
L3-8B-Stheno-v3.2.Q4_K.gguf GGUF Q4_K 4.58 GB Download
L3-8B-Stheno-v3.2.Q4_K_M.gguf GGUF Q4_K_M 4.58 GB Download
L3-8B-Stheno-v3.2.Q4_K_S.gguf GGUF Q4_K_S 4.37 GB Download
L3-8B-Stheno-v3.2.Q5_0.gguf GGUF 5.21 GB Download
L3-8B-Stheno-v3.2.Q5_1.gguf GGUF 5.65 GB Download
L3-8B-Stheno-v3.2.Q5_K.gguf GGUF Q5_K 5.34 GB Download
L3-8B-Stheno-v3.2.Q5_K_M.gguf GGUF Q5_K_M 5.34 GB Download
L3-8B-Stheno-v3.2.Q5_K_S.gguf GGUF Q5_K_S 5.21 GB Download
L3-8B-Stheno-v3.2.Q6_K.gguf GGUF Q6_K 6.14 GB Download
L3-8B-Stheno-v3.2.Q8_0.gguf GGUF 7.95 GB Download

Model Details Live

Model Slug
richarderkhov/sao10k_-_l3-8b-stheno-v3.2-gguf
Author
RichardErkhov
Pipeline Task
Library
Created
2024-06-25
Last Modified
2024-06-25
Gated
No
Private
No
HF SHA
08dc467d797897d2a22cb97bba4aa330f5119921
License
Unknown
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2/resolve/main/Stheno.png?",
    "summary": "Quantization made by Richard Erkhov. Github Discord Request more models L3-8B-Stheno-v3.2 - GGUF | Name | Quant method | Size | | ---- | ---- | ---- | | L3-8B-Stheno-v3.2.Q2_K.gguf | Q2_K | 2.96GB | | L3-8B-Stheno-v3.2.IQ3_XS.gguf | IQ3_XS | 3.28GB | | L3-8B-Stheno-v3.2.IQ3_S.gguf | IQ3_S | 3.43GB | | L3-8B-Stheno-v3.2.Q3_K_S.gguf | Q3_K_S | 3.41GB | | L3-8B-Stheno-v3.2.IQ3_M.gguf | IQ3_M | 3.52GB | | L3-8B-Stheno-v3.2.Q3_K.gguf | Q3_K | 3.74GB | | L3-8B-Stheno-v3.2.Q3_K_M.gguf | Q3_K_M | 3.74GB | | L3-8B-Stheno-v3.2.Q3_K_L.gguf | Q3_K_L | 4.03GB | | L3-8B-Stheno-v3.2.IQ4_XS.gguf | IQ4_XS | 4.18GB | | L3-8B-Stheno-v3.2.Q4_0.gguf | Q4_0 | 4.34GB | | L3-8B-Stheno-v3.2.IQ4_NL.gguf | IQ4_NL | 4.38GB | | L3-8B-Stheno-v3.2.Q4_K_S.gguf | Q4_K_S | 4.37GB | | L3-8B-Stheno-v3.2.Q4_K.gguf | Q4_K | 4.58GB | | L3-8B-Stheno-v3.2.Q4_K_M.gguf | Q4_K_M | 4.58GB | | L3-8B-Stheno-v3.2.Q4_1.gguf | Q4_1 | 4.78GB | | L3-8B-Stheno-v3.2.Q5_0.gguf | Q5_0 | 5.21GB | | L3-8B-Stheno-v3.2.Q5_K_S.gguf | Q5_K_S | 5.21GB | | L3-8B-Stheno-v3.2.Q5_K.gguf | Q5_K | 5.34GB | | L3-8B-Stheno-v3.2.Q5_K_M.gguf | Q5_K_M | 5.34GB | | L3-8B-Stheno-v3.2.Q5_1.gguf | Q5_1 | 5.65GB | | L3-8B-Stheno-v3.2.Q6_K.gguf | Q6_K | 6.14GB | | L3-8B-Stheno-v3.2.Q8_0.gguf | Q8_0 | 7.95GB | Original model description: --- license: cc-by-nc-4.0 language: datasets: --- *Just message me on discord if you want to host this privately for a service or something. We can talk.* *Train used 1x H100 SXM for like a total of 24 Hours over multiple runs.* Support me here if you're interested: Ko-fi: https://ko-fi.com/sao10k  *wink* Euryale v2? If not, that's fine too. Feedback would be nice. Contact Me in Discord: sao10k // Just ping me in the KoboldAI discord, I'll respond faster. Art by navy_(navy.blue) - Danbooru --- !Stheno Stheno-v3.2-Zeta I have done a test run with multiple variations of the models, merged back to its base at various weights, different training runs too, and this Sixth iteration is the one I like most. Changes compared to v3.1 \\- Included a mix of SFW and NSFW Storywriting Data, thanks to Gryphe \\- Included More Instruct / Assistant-Style Data \\- Further cleaned up Roleplaying Samples from c2 Logs -> A few terrible, really bad samples escaped heavy filtering. Manual pass fixed it. \\- Hyperparameter tinkering for training, resulting in lower loss levels. Testing Notes - Compared to v3.1 \\- Handles SFW / NSFW seperately better. Not as overly excessive with NSFW now. Kinda balanced. \\- Better at Storywriting / Narration. \\- Better at Assistant-type Tasks. \\- Better Multi-Turn Coherency -> Reduced Issues? \\- Slightly less creative? A worthy tradeoff. Still creative. \\- Better prompt / instruction adherence. --- **Recommended Samplers:** `` Temperature - 1.12-1.22 Min-P - 0.075 Top-K - 50 Repetition Penalty - 1.1 ` **Stopping Strings:** ` \\n\\n{{User}} # Or Equivalent, depending on Frontend   ` **Prompting Template - Llama-3-Instruct** ` system {system_prompt}user {input}assistant {output} ` **Basic Roleplay System Prompt** ` You are an expert actor that can fully immerse yourself into any role given. You do not break character for any reason, even if someone tries addressing you as an AI or language model. Currently your role is {{char}}, which is described in detail below. As {{char}}, continue the exchange with {{user}}. `` ---",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nL3-8B-Stheno-v3.2 - GGUF\n- Model creator: https://huggingface.co/Sao10K/\n- Original model: https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [L3-8B-Stheno-v3.2.Q2_K.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q2_K.gguf) | Q2_K | 2.96GB |\n| [L3-8B-Stheno-v3.2.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.IQ3_XS.gguf) | IQ3_XS | 3.28GB |\n| [L3-8B-Stheno-v3.2.IQ3_S.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.IQ3_S.gguf) | IQ3_S | 3.43GB |\n| [L3-8B-Stheno-v3.2.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q3_K_S.gguf) | Q3_K_S | 3.41GB |\n| [L3-8B-Stheno-v3.2.IQ3_M.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.IQ3_M.gguf) | IQ3_M | 3.52GB |\n| [L3-8B-Stheno-v3.2.Q3_K.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q3_K.gguf) | Q3_K | 3.74GB |\n| [L3-8B-Stheno-v3.2.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q3_K_M.gguf) | Q3_K_M | 3.74GB |\n| [L3-8B-Stheno-v3.2.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q3_K_L.gguf) | Q3_K_L | 4.03GB |\n| [L3-8B-Stheno-v3.2.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.IQ4_XS.gguf) | IQ4_XS | 4.18GB |\n| [L3-8B-Stheno-v3.2.Q4_0.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q4_0.gguf) | Q4_0 | 4.34GB |\n| [L3-8B-Stheno-v3.2.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.IQ4_NL.gguf) | IQ4_NL | 4.38GB |\n| [L3-8B-Stheno-v3.2.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q4_K_S.gguf) | Q4_K_S | 4.37GB |\n| [L3-8B-Stheno-v3.2.Q4_K.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q4_K.gguf) | Q4_K | 4.58GB |\n| [L3-8B-Stheno-v3.2.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q4_K_M.gguf) | Q4_K_M | 4.58GB |\n| [L3-8B-Stheno-v3.2.Q4_1.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q4_1.gguf) | Q4_1 | 4.78GB |\n| [L3-8B-Stheno-v3.2.Q5_0.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q5_0.gguf) | Q5_0 | 5.21GB |\n| [L3-8B-Stheno-v3.2.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q5_K_S.gguf) | Q5_K_S | 5.21GB |\n| [L3-8B-Stheno-v3.2.Q5_K.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q5_K.gguf) | Q5_K | 5.34GB |\n| [L3-8B-Stheno-v3.2.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q5_K_M.gguf) | Q5_K_M | 5.34GB |\n| [L3-8B-Stheno-v3.2.Q5_1.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q5_1.gguf) | Q5_1 | 5.65GB |\n| [L3-8B-Stheno-v3.2.Q6_K.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q6_K.gguf) | Q6_K | 6.14GB |\n| [L3-8B-Stheno-v3.2.Q8_0.gguf](https://huggingface.co/RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf/blob/main/L3-8B-Stheno-v3.2.Q8_0.gguf) | Q8_0 | 7.95GB |\n\n\n\n\nOriginal model description:\n---\nlicense: cc-by-nc-4.0\nlanguage:\n- en\ndatasets:\n- Gryphe/Opus-WritingPrompts\n- Sao10K/Claude-3-Opus-Instruct-15K\n- Sao10K/Short-Storygen-v2\n- Sao10K/c2-Logs-Filtered\n---\n\n*Just message me on discord if you want to host this privately for a service or something. We can talk.*\n\n*Train used 1x H100 SXM for like a total of 24 Hours over multiple runs.*\n\nSupport me here if you're interested:\n<br>Ko-fi: https://ko-fi.com/sao10k\n<br> *wink* Euryale v2?\n\nIf not, that's fine too. Feedback would be nice.\n\nContact Me in Discord:\n<br>`sao10k` // `Just ping me in the KoboldAI discord, I'll respond faster.`\n\n`Art by navy_(navy.blue)` - [Danbooru](https://danbooru.donmai.us/posts/3214477)\n\n---\n\n![Stheno](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2/resolve/main/Stheno.png?)\n\nStheno-v3.2-Zeta\n\nI have done a test run with multiple variations of the models, merged back to its base at various weights, different training runs too, and this Sixth iteration is the one I like most.\n\n\nChanges compared to v3.1\n<br>\\- Included a mix of SFW and NSFW Storywriting Data, thanks to [Gryphe](https://huggingface.co/datasets/Gryphe/Opus-WritingPrompts)\n<br>\\- Included More Instruct / Assistant-Style Data\n<br>\\- Further cleaned up Roleplaying Samples from c2 Logs -> A few terrible, really bad samples escaped heavy filtering. Manual pass fixed it.\n<br>\\- Hyperparameter tinkering for training, resulting in lower loss levels.\n\n\nTesting Notes - Compared to v3.1\n<br>\\- Handles SFW / NSFW seperately better. Not as overly excessive with NSFW now. Kinda balanced.\n<br>\\- Better at Storywriting / Narration.\n<br>\\- Better at Assistant-type Tasks.\n<br>\\- Better Multi-Turn Coherency -> Reduced Issues?\n<br>\\- Slightly less creative? A worthy tradeoff. Still creative.\n<br>\\- Better prompt / instruction adherence.\n\n---\n\n**Recommended Samplers:**\n\n```\nTemperature - 1.12-1.22\nMin-P - 0.075\nTop-K - 50\nRepetition Penalty - 1.1\n```\n\n**Stopping Strings:**\n\n```\n\\n\\n{{User}} # Or Equivalent, depending on Frontend\n<|eot_id|>\n<|end_of_text|>\n```\n\n**Prompting Template - Llama-3-Instruct**\n\n```\n<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\n{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>\n\n{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n{output}<|eot_id|>\n```\n\n**Basic Roleplay System Prompt**\n```\nYou are an expert actor that can fully immerse yourself into any role given. You do not break character for any reason, even if someone tries addressing you as an AI or language model.\nCurrently your role is {{char}}, which is described in detail below. As {{char}}, continue the exchange with {{user}}.\n```\n\n---\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 3,
  "downloads": 579,
  "gated": false,
  "private": false,
  "last_modified": "2024-06-25T08:22:02.000Z",
  "created_at": "2024-06-25T04:05:03.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "667a41ef060322abb2b2c14b",
  "id": "RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf",
  "modelId": "RichardErkhov/Sao10K_-_L3-8B-Stheno-v3.2-gguf",
  "sha": "08dc467d797897d2a22cb97bba4aa330f5119921",
  "createdAt": "2024-06-25T04:05:03.000Z",
  "lastModified": "2024-06-25T08:22:02.000Z",
  "author": "RichardErkhov",
  "downloads": 579,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 24
}