GraySoft
Projects Models About FAQ Contact Download guIDE →

duyntnet/llama-3.1-supernova-lite-imatrix-gguf Q3_K_L GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

duyntnet/llama-3.1-supernova-lite-imatrix-gguf overview

Llama-3.1-SuperNova-Lite is an 8B parameter model developed by Arcee.ai, based on the Llama-3.1-8B-Instruct architecture. It is a distilled version of the larger Llama-3.1-405B-Instruct model, leveraging offline logits extracted from the 405B parameter variant. This 8B variation of Llama-3.1-SuperNova maintains high performance while offering exceptional instruction-following capabilities and domain-specific adaptability. The model was trained using a state-of-the-art distillation pipeline and an instruction dataset generated with EvolKit, ensuring accuracy and efficiency across a wide range of tasks. For more information on its training, visit blog.arcee.ai. Llama-3.1-SuperNova-Lite excels in both benchmark performance and real-world applications, providing the power of large-scale models in a more compact, efficient form ideal for organizations seeking high performance with reduced resource requirements.

transformersggufimatrixLlama-3.1-SuperNova-Litetext-generationenlicense:otherregion:usconversational
duyntnet/llama-3.1-supernova-lite-imatrix-gguf visual
Downloads
97
Likes
0
Pipeline
text-generation
Library
transformers
Visibility
Public
Access
Open

Repository Files & Downloads

27 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Llama-3.1-SuperNova-Lite-IQ1_M.gguf GGUF IQ1_M 2.01 GB Download
Llama-3.1-SuperNova-Lite-IQ1_S.gguf GGUF IQ1_S 1.88 GB Download
Llama-3.1-SuperNova-Lite-IQ2_M.gguf GGUF IQ2_M 2.75 GB Download
Llama-3.1-SuperNova-Lite-IQ2_S.gguf GGUF IQ2_S 2.57 GB Download
Llama-3.1-SuperNova-Lite-IQ2_XS.gguf GGUF IQ2_XS 2.43 GB Download
Llama-3.1-SuperNova-Lite-IQ2_XXS.gguf GGUF IQ2_XXS 2.23 GB Download
Llama-3.1-SuperNova-Lite-IQ3_M.gguf GGUF IQ3_M 3.52 GB Download
Llama-3.1-SuperNova-Lite-IQ3_S.gguf GGUF IQ3_S 3.43 GB Download
Llama-3.1-SuperNova-Lite-IQ3_XS.gguf GGUF IQ3_XS 3.28 GB Download
Llama-3.1-SuperNova-Lite-IQ3_XXS.gguf GGUF IQ3_XXS 3.05 GB Download
Llama-3.1-SuperNova-Lite-IQ4_NL.gguf GGUF IQ4_NL 4.36 GB Download
Llama-3.1-SuperNova-Lite-IQ4_XS.gguf GGUF IQ4_XS 4.14 GB Download
Llama-3.1-SuperNova-Lite-Q2_K.gguf GGUF Q2_K 2.96 GB Download
Llama-3.1-SuperNova-Lite-Q2_K_S.gguf GGUF Q2_K_S 2.78 GB Download
Llama-3.1-SuperNova-Lite-Q3_K_L.gguf GGUF Q3_K_L 4.03 GB Download
Llama-3.1-SuperNova-Lite-Q3_K_M.gguf GGUF Q3_K_M 3.74 GB Download
Llama-3.1-SuperNova-Lite-Q3_K_S.gguf GGUF Q3_K_S 3.41 GB Download
Llama-3.1-SuperNova-Lite-Q4_0.gguf GGUF 4.35 GB Download
Llama-3.1-SuperNova-Lite-Q4_1.gguf GGUF 4.78 GB Download
Llama-3.1-SuperNova-Lite-Q4_K_M.gguf GGUF Q4_K_M 4.58 GB Download
Llama-3.1-SuperNova-Lite-Q4_K_S.gguf GGUF Q4_K_S 4.37 GB Download
Llama-3.1-SuperNova-Lite-Q5_0.gguf GGUF 5.23 GB Download
Llama-3.1-SuperNova-Lite-Q5_1.gguf GGUF 5.65 GB Download
Llama-3.1-SuperNova-Lite-Q5_K_M.gguf GGUF Q5_K_M 5.34 GB Download
Llama-3.1-SuperNova-Lite-Q5_K_S.gguf GGUF Q5_K_S 5.21 GB Download
Llama-3.1-SuperNova-Lite-Q6_K.gguf GGUF Q6_K 6.14 GB Download
Llama-3.1-SuperNova-Lite-Q8_0.gguf GGUF 7.95 GB Download

Model Details Live

Model Slug
duyntnet/llama-3.1-supernova-lite-imatrix-gguf
Author
duyntnet
Pipeline Task
text-generation
Library
transformers
Created
2024-10-03
Last Modified
2024-10-03
Gated
No
Private
No
HF SHA
bd2e48eef64e149c1696d47a85cdc0e888ebb080
License
other
Language
en
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "other",
    "language": [
      "en"
    ],
    "pipeline_tag": "text-generation",
    "inference": false,
    "tags": [
      "transformers",
      "gguf",
      "imatrix",
      "Llama-3.1-SuperNova-Lite"
    ],
    "frontmatter": {
      "license": "other",
      "language": [
        "en"
      ],
      "pipeline_tag": "text-generation",
      "inference": "false",
      "tags": [
        "transformers",
        "gguf",
        "imatrix",
        "Llama-3.1-SuperNova-Lite"
      ]
    },
    "hero_image_url": "",
    "summary": "Llama-3.1-SuperNova-Lite is an 8B parameter model developed by Arcee.ai, based on the Llama-3.1-8B-Instruct architecture. It is a distilled version of the larger Llama-3.1-405B-Instruct model, leveraging offline logits extracted from the 405B parameter variant. This 8B variation of Llama-3.1-SuperNova maintains high performance while offering exceptional instruction-following capabilities and domain-specific adaptability. The model was trained using a state-of-the-art distillation pipeline and an instruction dataset generated with EvolKit, ensuring accuracy and efficiency across a wide range of tasks. For more information on its training, visit blog.arcee.ai. Llama-3.1-SuperNova-Lite excels in both benchmark performance and real-world applications, providing the power of large-scale models in a more compact, efficient form ideal for organizations seeking high performance with reduced resource requirements.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- Llama-3.1-SuperNova-Lite\n---\nQuantizations of https://huggingface.co/arcee-ai/Llama-3.1-SuperNova-Lite\n\n\n### Inference Clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [ollama](https://github.com/ollama/ollama)\n\n\n---\n\n# From original readme\n\nLlama-3.1-SuperNova-Lite is an 8B parameter model developed by Arcee.ai, based on the Llama-3.1-8B-Instruct architecture. It is a distilled version of the larger Llama-3.1-405B-Instruct model, leveraging offline logits extracted from the 405B parameter variant. This 8B variation of Llama-3.1-SuperNova maintains high performance while offering exceptional instruction-following capabilities and domain-specific adaptability. \n\nThe model was trained using a state-of-the-art distillation pipeline and an instruction dataset generated with [EvolKit](https://github.com/arcee-ai/EvolKit), ensuring accuracy and efficiency across a wide range of tasks. For more information on its training, visit blog.arcee.ai. \n\nLlama-3.1-SuperNova-Lite excels in both benchmark performance and real-world applications, providing the power of large-scale models in a more compact, efficient form ideal for organizations seeking high performance with reduced resource requirements.",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "imatrix",
    "Llama-3.1-SuperNova-Lite",
    "text-generation",
    "en",
    "license:other",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 97,
  "gated": false,
  "private": false,
  "last_modified": "2024-10-03T17:30:08.000Z",
  "created_at": "2024-10-03T15:11:30.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "66feb422c1a87ccc721c0ce6",
  "id": "duyntnet/Llama-3.1-SuperNova-Lite-imatrix-GGUF",
  "modelId": "duyntnet/Llama-3.1-SuperNova-Lite-imatrix-GGUF",
  "sha": "bd2e48eef64e149c1696d47a85cdc0e888ebb080",
  "createdAt": "2024-10-03T15:11:30.000Z",
  "lastModified": "2024-10-03T17:30:08.000Z",
  "author": "duyntnet",
  "downloads": 97,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 29
}