GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

tsunemoto/tinyllama-1.1b-chat-v0.6-x8-moe-gguf overview

This is a GGUF quantization of TinyLlama-1.1B-Chat-v0.6-x8-MoE.

ggufGGUFenendpoints_compatibleregion:usconversational
tsunemoto/tinyllama-1.1b-chat-v0.6-x8-moe-gguf visual
Downloads
174
Likes
6
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

14 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
tinyllama-1.1b-chat-v0.6-x8-moe.Q2_K.gguf GGUF Q2_K 2.01 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q3_K_L.gguf GGUF Q3_K_L 2.62 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q3_K_M.gguf GGUF Q3_K_M 2.61 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q3_K_S.gguf GGUF Q3_K_S 2.60 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q4_0.gguf GGUF 3.39 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q4_1.gguf GGUF 3.76 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q4_K_M.gguf GGUF Q4_K_M 3.39 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q4_K_S.gguf GGUF Q4_K_S 3.39 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q5_0.gguf GGUF 4.13 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q5_1.gguf GGUF 4.50 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q5_K_M.gguf GGUF Q5_K_M 4.13 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q5_K_S.gguf GGUF Q5_K_S 4.13 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q6_K.gguf GGUF Q6_K 4.91 GB Download
tinyllama-1.1b-chat-v0.6-x8-moe.Q8_0.gguf GGUF 6.36 GB Download

Model Details Live

Model Slug
tsunemoto/tinyllama-1.1b-chat-v0.6-x8-moe-gguf
Author
tsunemoto
Pipeline Task
Library
Created
2023-12-29
Last Modified
2023-12-29
Gated
No
Private
No
HF SHA
879c7960a2b3df6f27e2148260c549695f8758a4
License
Unknown
Language
en
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "title": "TinyLlama-1.1B-Chat-v0.6-x8-MoE Quantized in GGUF",
    "tags": [
      "GGUF"
    ],
    "language": "en",
    "frontmatter": {
      "title": "\"TinyLlama-1.1B-Chat-v0.6-x8-MoE Quantized in GGUF\"",
      "tags": [
        "GGUF"
      ],
      "language": "en"
    },
    "hero_image_url": "https://i.postimg.cc/MGwhtFfF/tsune-fixed.png",
    "summary": "This is a GGUF quantization of TinyLlama-1.1B-Chat-v0.6-x8-MoE.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\ntitle: \"TinyLlama-1.1B-Chat-v0.6-x8-MoE Quantized in GGUF\"\ntags:\n  - GGUF\nlanguage: en\n---\n![Image description](https://i.postimg.cc/MGwhtFfF/tsune-fixed.png)\n\n# Tsunemoto GGUF's of TinyLlama-1.1B-Chat-v0.6-x8-MoE\n\nThis is a GGUF quantization of TinyLlama-1.1B-Chat-v0.6-x8-MoE.\n\n## Original Repo Link:\n[Original Repository](https://huggingface.co/dillfrescott/TinyLlama-1.1B-Chat-v0.6-x8-MoE)\n\n## Original Model Card:\n---\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/6215ce9abfcb3893344dd0a2/EPZcVwZOvnWl9mfoC1zMC.png)\n\nx8 MoE of https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.6",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "GGUF",
    "en",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 6,
  "downloads": 174,
  "gated": false,
  "private": false,
  "last_modified": "2023-12-29T18:08:55.000Z",
  "created_at": "2023-12-29T17:50:16.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "658f06d8ac02633c0d54b805",
  "id": "tsunemoto/TinyLlama-1.1B-Chat-v0.6-x8-MoE-GGUF",
  "modelId": "tsunemoto/TinyLlama-1.1B-Chat-v0.6-x8-MoE-GGUF",
  "sha": "879c7960a2b3df6f27e2148260c549695f8758a4",
  "createdAt": "2023-12-29T17:50:16.000Z",
  "lastModified": "2023-12-29T18:08:55.000Z",
  "author": "tsunemoto",
  "downloads": 174,
  "likes": 6,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 16
}