backyardai/llama-3.1-8b-stheno-v3.4-gguf v3.4.Q8_0 GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

backyardai/llama-3.1-8b-stheno-v3.4-gguf overview

ggufendataset:Setiaku/Stheno-v3.4-Instructdataset:Setiaku/Stheno-3.4-Creative-2base_model:Sao10K/Llama-3.1-8B-Stheno-v3.4base_model:quantized:Sao10K/Llama-3.1-8B-Stheno-v3.4license:cc-by-nc-4.0endpoints_compatibleregion:usconversational

backyardai/llama-3.1-8b-stheno-v3.4-gguf visual

Downloads

149

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

21 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Llama-3.1-8B-Stheno-v3.4.F32.gguf	GGUF	F32	29.92 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ1_M.gguf	GGUF	IQ1_M	2.01 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ1_S.gguf	GGUF	IQ1_S	1.88 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ2_M.gguf	GGUF	IQ2_M	2.75 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ2_S.gguf	GGUF	IQ2_S	2.57 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ2_XS.gguf	GGUF	IQ2_XS	2.43 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ2_XXS.gguf	GGUF	IQ2_XXS	2.23 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ3_M.gguf	GGUF	IQ3_M	3.52 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ3_S.gguf	GGUF	IQ3_S	3.43 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ3_XS.gguf	GGUF	IQ3_XS	3.28 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ3_XXS.gguf	GGUF	IQ3_XXS	3.05 GB	Download
Llama-3.1-8B-Stheno-v3.4.IQ4_XS.gguf	GGUF	IQ4_XS	4.14 GB	Download
Llama-3.1-8B-Stheno-v3.4.Q3_K_L.gguf	GGUF	Q3_K_L	4.03 GB	Download
Llama-3.1-8B-Stheno-v3.4.Q3_K_M.gguf	GGUF	Q3_K_M	3.74 GB	Download
Llama-3.1-8B-Stheno-v3.4.Q3_K_S.gguf	GGUF	Q3_K_S	3.41 GB	Download
Llama-3.1-8B-Stheno-v3.4.Q4_K_M.gguf	GGUF	Q4_K_M	4.58 GB	Download
Llama-3.1-8B-Stheno-v3.4.Q4_K_S.gguf	GGUF	Q4_K_S	4.37 GB	Download
Llama-3.1-8B-Stheno-v3.4.Q5_K_M.gguf	GGUF	Q5_K_M	5.34 GB	Download
Llama-3.1-8B-Stheno-v3.4.Q5_K_S.gguf	GGUF	Q5_K_S	5.21 GB	Download
Llama-3.1-8B-Stheno-v3.4.Q6_K.gguf	GGUF	Q6_K	6.14 GB	Download
Llama-3.1-8B-Stheno-v3.4.Q8_0.gguf	GGUF	—	7.95 GB	Download

Model Details Live

Model Slug

backyardai/llama-3.1-8b-stheno-v3.4-gguf

Author

backyardai

Pipeline Task

—

Library

—

Created

2024-08-29

Last Modified

2024-08-29

Gated

Private

HF SHA

23d8cfcbc1df39d79ee26e794940ee91a007f93d

License

cc-by-nc-4.0

Language

Base Model

Sao10K/Llama-3.1-8B-Stheno-v3.4

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "base_model": "Sao10K/Llama-3.1-8B-Stheno-v3.4",
    "datasets": [
      "Setiaku/Stheno-v3.4-Instruct",
      "Setiaku/Stheno-3.4-Creative-2"
    ],
    "language": [
      "en"
    ],
    "license": "cc-by-nc-4.0",
    "model_name": "Llama-3.1-8B-Stheno-v3.4-GGUF",
    "quantized_by": "brooketh",
    "parameter_count": 8030261312,
    "frontmatter": {
      "base_model": "Sao10K/Llama-3.1-8B-Stheno-v3.4",
      "datasets": [
        "Setiaku/Stheno-v3.4-Instruct",
        "Setiaku/Stheno-3.4-Creative-2"
      ],
      "language": [
        "en"
      ],
      "license": "cc-by-nc-4.0",
      "model_name": "Llama-3.1-8B-Stheno-v3.4-GGUF",
      "quantized_by": "brooketh",
      "parameter_count": "8030261312"
    },
    "hero_image_url": "BackyardAI_Banner.png",
    "summary": "***",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: Sao10K/Llama-3.1-8B-Stheno-v3.4\ndatasets:\n- Setiaku/Stheno-v3.4-Instruct\n- Setiaku/Stheno-3.4-Creative-2\nlanguage:\n- en\nlicense: cc-by-nc-4.0\nmodel_name: Llama-3.1-8B-Stheno-v3.4-GGUF\nquantized_by: brooketh\nparameter_count: 8030261312\n---\n<img src=\"BackyardAI_Banner.png\" alt=\"Backyard.ai\" style=\"height: 90px; min-width: 32px; display: block; margin: auto;\">\n\n**<p style=\"text-align: center;\">The official library of GGUF format models for use in the local AI chat app, Backyard AI.</p>**\n\n<p style=\"text-align: center;\"><a href=\"https://backyard.ai/\">Download Backyard AI here to get started.</a></p>\n\n<p style=\"text-align: center;\"><a href=\"https://www.reddit.com/r/LLM_Quants/\">Request Additional models at r/LLM_Quants.</a></p>\n\n***\n# Llama 3.1 Stheno V3.4 8B\n- **Creator:** [Sao10K](https://huggingface.co/Sao10K/)\n- **Original:** [Llama 3.1 Stheno V3.4 8B](https://huggingface.co/Sao10K/Llama-3.1-8B-Stheno-v3.4)\n- **Date Created:** 2024-08-19\n- **Trained Context:** 131072 tokens\n- **Description:** Version 3.4 of the popular Stheno series of Llama-3.1-based roleplay models. Finetuned in two passes: first, over a multi-turn Conversational-Instruct dataaset; second, over a Creative Writing/Roleplay dataset along with some Creative-based Instruct datasets. Source data contains a mixture of human and Claude data.\n***\n## What is a GGUF?\nGGUF is a large language model (LLM) format that can be split between CPU and GPU. GGUFs are compatible with applications based on llama.cpp, such as Backyard AI. Where other model formats require higher end GPUs with ample VRAM, GGUFs can be efficiently run on a wider variety of hardware.\nGGUF models are quantized to reduce resource usage, with a tradeoff of reduced coherence at lower quantizations. Quantization reduces the precision of the model weights by changing the number of bits used for each weight.\n\n***\n<img src=\"BackyardAI_Logo.png\" alt=\"Backyard.ai\" style=\"height: 75px; min-width: 32px; display: block; horizontal align: left;\">\n\n## Backyard AI\n- Free, local AI chat application.\n- One-click installation on Mac and PC.\n- Automatically use GPU for maximum speed.\n- Built-in model manager.\n- High-quality character hub.\n- Zero-config desktop-to-mobile tethering.\nBackyard AI makes it easy to start chatting with AI using your own characters or one of the many found in the built-in character hub. The model manager helps you find the latest and greatest models without worrying about whether it's the correct format. Backyard AI supports advanced features such as lorebooks, author's note, text formatting, custom context size, sampler settings, grammars, local TTS, cloud inference, and tethering, all implemented in a way that is straightforward and reliable.\n**Join us on [Discord](https://discord.gg/SyNN2vC9tQ)**\n***",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "en",
    "dataset:Setiaku/Stheno-v3.4-Instruct",
    "dataset:Setiaku/Stheno-3.4-Creative-2",
    "base_model:Sao10K/Llama-3.1-8B-Stheno-v3.4",
    "base_model:quantized:Sao10K/Llama-3.1-8B-Stheno-v3.4",
    "license:cc-by-nc-4.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 3,
  "downloads": 149,
  "gated": false,
  "private": false,
  "last_modified": "2024-08-29T01:33:20.000Z",
  "created_at": "2024-08-29T01:16:17.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "66cfcbe14c3b13931ee21741",
  "id": "backyardai/Llama-3.1-8B-Stheno-v3.4-GGUF",
  "modelId": "backyardai/Llama-3.1-8B-Stheno-v3.4-GGUF",
  "sha": "23d8cfcbc1df39d79ee26e794940ee91a007f93d",
  "createdAt": "2024-08-29T01:16:17.000Z",
  "lastModified": "2024-08-29T01:33:20.000Z",
  "author": "backyardai",
  "downloads": 149,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 26
}

backyardai/llama-3.1-8b-stheno-v3.4-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard