GraySoft
Projects Models About FAQ Contact Download guIDE →

backyardai/llama-3.1-8b-stheno-v3.4-gguf v3.4.Q8_0 GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

backyardai/llama-3.1-8b-stheno-v3.4-gguf overview

*

ggufendataset:Setiaku/Stheno-v3.4-Instructdataset:Setiaku/Stheno-3.4-Creative-2base_model:Sao10K/Llama-3.1-8B-Stheno-v3.4base_model:quantized:Sao10K/Llama-3.1-8B-Stheno-v3.4license:cc-by-nc-4.0endpoints_compatibleregion:usconversational
backyardai/llama-3.1-8b-stheno-v3.4-gguf visual
Downloads
149
Likes
3
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

21 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Llama-3.1-8B-Stheno-v3.4.F32.gguf GGUF F32 29.92 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ1_M.gguf GGUF IQ1_M 2.01 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ1_S.gguf GGUF IQ1_S 1.88 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ2_M.gguf GGUF IQ2_M 2.75 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ2_S.gguf GGUF IQ2_S 2.57 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ2_XS.gguf GGUF IQ2_XS 2.43 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ2_XXS.gguf GGUF IQ2_XXS 2.23 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ3_M.gguf GGUF IQ3_M 3.52 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ3_S.gguf GGUF IQ3_S 3.43 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ3_XS.gguf GGUF IQ3_XS 3.28 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ3_XXS.gguf GGUF IQ3_XXS 3.05 GB Download
Llama-3.1-8B-Stheno-v3.4.IQ4_XS.gguf GGUF IQ4_XS 4.14 GB Download
Llama-3.1-8B-Stheno-v3.4.Q3_K_L.gguf GGUF Q3_K_L 4.03 GB Download
Llama-3.1-8B-Stheno-v3.4.Q3_K_M.gguf GGUF Q3_K_M 3.74 GB Download
Llama-3.1-8B-Stheno-v3.4.Q3_K_S.gguf GGUF Q3_K_S 3.41 GB Download
Llama-3.1-8B-Stheno-v3.4.Q4_K_M.gguf GGUF Q4_K_M 4.58 GB Download
Llama-3.1-8B-Stheno-v3.4.Q4_K_S.gguf GGUF Q4_K_S 4.37 GB Download
Llama-3.1-8B-Stheno-v3.4.Q5_K_M.gguf GGUF Q5_K_M 5.34 GB Download
Llama-3.1-8B-Stheno-v3.4.Q5_K_S.gguf GGUF Q5_K_S 5.21 GB Download
Llama-3.1-8B-Stheno-v3.4.Q6_K.gguf GGUF Q6_K 6.14 GB Download
Llama-3.1-8B-Stheno-v3.4.Q8_0.gguf GGUF 7.95 GB Download

Model Details Live

Model Slug
backyardai/llama-3.1-8b-stheno-v3.4-gguf
Author
backyardai
Pipeline Task
Library
Created
2024-08-29
Last Modified
2024-08-29
Gated
No
Private
No
HF SHA
23d8cfcbc1df39d79ee26e794940ee91a007f93d
License
cc-by-nc-4.0
Language
en
Base Model
Sao10K/Llama-3.1-8B-Stheno-v3.4

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": "Sao10K/Llama-3.1-8B-Stheno-v3.4",
    "datasets": [
      "Setiaku/Stheno-v3.4-Instruct",
      "Setiaku/Stheno-3.4-Creative-2"
    ],
    "language": [
      "en"
    ],
    "license": "cc-by-nc-4.0",
    "model_name": "Llama-3.1-8B-Stheno-v3.4-GGUF",
    "quantized_by": "brooketh",
    "parameter_count": 8030261312,
    "frontmatter": {
      "base_model": "Sao10K/Llama-3.1-8B-Stheno-v3.4",
      "datasets": [
        "Setiaku/Stheno-v3.4-Instruct",
        "Setiaku/Stheno-3.4-Creative-2"
      ],
      "language": [
        "en"
      ],
      "license": "cc-by-nc-4.0",
      "model_name": "Llama-3.1-8B-Stheno-v3.4-GGUF",
      "quantized_by": "brooketh",
      "parameter_count": "8030261312"
    },
    "hero_image_url": "BackyardAI_Banner.png",
    "summary": "***",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: Sao10K/Llama-3.1-8B-Stheno-v3.4\ndatasets:\n- Setiaku/Stheno-v3.4-Instruct\n- Setiaku/Stheno-3.4-Creative-2\nlanguage:\n- en\nlicense: cc-by-nc-4.0\nmodel_name: Llama-3.1-8B-Stheno-v3.4-GGUF\nquantized_by: brooketh\nparameter_count: 8030261312\n---\n<img src=\"BackyardAI_Banner.png\" alt=\"Backyard.ai\" style=\"height: 90px; min-width: 32px; display: block; margin: auto;\">\n\n**<p style=\"text-align: center;\">The official library of GGUF format models for use in the local AI chat app, Backyard AI.</p>**\n\n<p style=\"text-align: center;\"><a href=\"https://backyard.ai/\">Download Backyard AI here to get started.</a></p>\n\n<p style=\"text-align: center;\"><a href=\"https://www.reddit.com/r/LLM_Quants/\">Request Additional models at r/LLM_Quants.</a></p>\n\n***\n# Llama 3.1 Stheno V3.4 8B\n- **Creator:** [Sao10K](https://huggingface.co/Sao10K/)\n- **Original:** [Llama 3.1 Stheno V3.4 8B](https://huggingface.co/Sao10K/Llama-3.1-8B-Stheno-v3.4)\n- **Date Created:** 2024-08-19\n- **Trained Context:** 131072 tokens\n- **Description:** Version 3.4 of the popular Stheno series of Llama-3.1-based roleplay models. Finetuned in two passes: first, over a multi-turn Conversational-Instruct dataaset; second, over a Creative Writing/Roleplay dataset along with some Creative-based Instruct datasets. Source data contains a mixture of human and Claude data.\n***\n## What is a GGUF?\nGGUF is a large language model (LLM) format that can be split between CPU and GPU. GGUFs are compatible with applications based on llama.cpp, such as Backyard AI. Where other model formats require higher end GPUs with ample VRAM, GGUFs can be efficiently run on a wider variety of hardware.\nGGUF models are quantized to reduce resource usage, with a tradeoff of reduced coherence at lower quantizations. Quantization reduces the precision of the model weights by changing the number of bits used for each weight.\n\n***\n<img src=\"BackyardAI_Logo.png\" alt=\"Backyard.ai\" style=\"height: 75px; min-width: 32px; display: block; horizontal align: left;\">\n\n## Backyard AI\n- Free, local AI chat application.\n- One-click installation on Mac and PC.\n- Automatically use GPU for maximum speed.\n- Built-in model manager.\n- High-quality character hub.\n- Zero-config desktop-to-mobile tethering.\nBackyard AI makes it easy to start chatting with AI using your own characters or one of the many found in the built-in character hub. The model manager helps you find the latest and greatest models without worrying about whether it's the correct format. Backyard AI supports advanced features such as lorebooks, author's note, text formatting, custom context size, sampler settings, grammars, local TTS, cloud inference, and tethering, all implemented in a way that is straightforward and reliable.\n**Join us on [Discord](https://discord.gg/SyNN2vC9tQ)**\n***",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "en",
    "dataset:Setiaku/Stheno-v3.4-Instruct",
    "dataset:Setiaku/Stheno-3.4-Creative-2",
    "base_model:Sao10K/Llama-3.1-8B-Stheno-v3.4",
    "base_model:quantized:Sao10K/Llama-3.1-8B-Stheno-v3.4",
    "license:cc-by-nc-4.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 3,
  "downloads": 149,
  "gated": false,
  "private": false,
  "last_modified": "2024-08-29T01:33:20.000Z",
  "created_at": "2024-08-29T01:16:17.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "66cfcbe14c3b13931ee21741",
  "id": "backyardai/Llama-3.1-8B-Stheno-v3.4-GGUF",
  "modelId": "backyardai/Llama-3.1-8B-Stheno-v3.4-GGUF",
  "sha": "23d8cfcbc1df39d79ee26e794940ee91a007f93d",
  "createdAt": "2024-08-29T01:16:17.000Z",
  "lastModified": "2024-08-29T01:33:20.000Z",
  "author": "backyardai",
  "downloads": 149,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 26
}