duyntnet/llama-3.1-supernova-lite-imatrix-gguf Q3_K_L GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
duyntnet/llama-3.1-supernova-lite-imatrix-gguf overview
Llama-3.1-SuperNova-Lite is an 8B parameter model developed by Arcee.ai, based on the Llama-3.1-8B-Instruct architecture. It is a distilled version of the larger Llama-3.1-405B-Instruct model, leveraging offline logits extracted from the 405B parameter variant. This 8B variation of Llama-3.1-SuperNova maintains high performance while offering exceptional instruction-following capabilities and domain-specific adaptability. The model was trained using a state-of-the-art distillation pipeline and an instruction dataset generated with EvolKit, ensuring accuracy and efficiency across a wide range of tasks. For more information on its training, visit blog.arcee.ai. Llama-3.1-SuperNova-Lite excels in both benchmark performance and real-world applications, providing the power of large-scale models in a more compact, efficient form ideal for organizations seeking high performance with reduced resource requirements.
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Llama-3.1-SuperNova-Lite-IQ1_M.gguf | GGUF | IQ1_M | 2.01 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ1_S.gguf | GGUF | IQ1_S | 1.88 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ2_M.gguf | GGUF | IQ2_M | 2.75 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ2_S.gguf | GGUF | IQ2_S | 2.57 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ2_XS.gguf | GGUF | IQ2_XS | 2.43 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ2_XXS.gguf | GGUF | IQ2_XXS | 2.23 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ3_M.gguf | GGUF | IQ3_M | 3.52 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ3_S.gguf | GGUF | IQ3_S | 3.43 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ3_XS.gguf | GGUF | IQ3_XS | 3.28 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ3_XXS.gguf | GGUF | IQ3_XXS | 3.05 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ4_NL.gguf | GGUF | IQ4_NL | 4.36 GB | Download |
| Llama-3.1-SuperNova-Lite-IQ4_XS.gguf | GGUF | IQ4_XS | 4.14 GB | Download |
| Llama-3.1-SuperNova-Lite-Q2_K.gguf | GGUF | Q2_K | 2.96 GB | Download |
| Llama-3.1-SuperNova-Lite-Q2_K_S.gguf | GGUF | Q2_K_S | 2.78 GB | Download |
| Llama-3.1-SuperNova-Lite-Q3_K_L.gguf | GGUF | Q3_K_L | 4.03 GB | Download |
| Llama-3.1-SuperNova-Lite-Q3_K_M.gguf | GGUF | Q3_K_M | 3.74 GB | Download |
| Llama-3.1-SuperNova-Lite-Q3_K_S.gguf | GGUF | Q3_K_S | 3.41 GB | Download |
| Llama-3.1-SuperNova-Lite-Q4_0.gguf | GGUF | — | 4.35 GB | Download |
| Llama-3.1-SuperNova-Lite-Q4_1.gguf | GGUF | — | 4.78 GB | Download |
| Llama-3.1-SuperNova-Lite-Q4_K_M.gguf | GGUF | Q4_K_M | 4.58 GB | Download |
| Llama-3.1-SuperNova-Lite-Q4_K_S.gguf | GGUF | Q4_K_S | 4.37 GB | Download |
| Llama-3.1-SuperNova-Lite-Q5_0.gguf | GGUF | — | 5.23 GB | Download |
| Llama-3.1-SuperNova-Lite-Q5_1.gguf | GGUF | — | 5.65 GB | Download |
| Llama-3.1-SuperNova-Lite-Q5_K_M.gguf | GGUF | Q5_K_M | 5.34 GB | Download |
| Llama-3.1-SuperNova-Lite-Q5_K_S.gguf | GGUF | Q5_K_S | 5.21 GB | Download |
| Llama-3.1-SuperNova-Lite-Q6_K.gguf | GGUF | Q6_K | 6.14 GB | Download |
| Llama-3.1-SuperNova-Lite-Q8_0.gguf | GGUF | — | 7.95 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": false,
"tags": [
"transformers",
"gguf",
"imatrix",
"Llama-3.1-SuperNova-Lite"
],
"frontmatter": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": "false",
"tags": [
"transformers",
"gguf",
"imatrix",
"Llama-3.1-SuperNova-Lite"
]
},
"hero_image_url": "",
"summary": "Llama-3.1-SuperNova-Lite is an 8B parameter model developed by Arcee.ai, based on the Llama-3.1-8B-Instruct architecture. It is a distilled version of the larger Llama-3.1-405B-Instruct model, leveraging offline logits extracted from the 405B parameter variant. This 8B variation of Llama-3.1-SuperNova maintains high performance while offering exceptional instruction-following capabilities and domain-specific adaptability. The model was trained using a state-of-the-art distillation pipeline and an instruction dataset generated with EvolKit, ensuring accuracy and efficiency across a wide range of tasks. For more information on its training, visit blog.arcee.ai. Llama-3.1-SuperNova-Lite excels in both benchmark performance and real-world applications, providing the power of large-scale models in a more compact, efficient form ideal for organizations seeking high performance with reduced resource requirements.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- Llama-3.1-SuperNova-Lite\n---\nQuantizations of https://huggingface.co/arcee-ai/Llama-3.1-SuperNova-Lite\n\n\n### Inference Clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [ollama](https://github.com/ollama/ollama)\n\n\n---\n\n# From original readme\n\nLlama-3.1-SuperNova-Lite is an 8B parameter model developed by Arcee.ai, based on the Llama-3.1-8B-Instruct architecture. It is a distilled version of the larger Llama-3.1-405B-Instruct model, leveraging offline logits extracted from the 405B parameter variant. This 8B variation of Llama-3.1-SuperNova maintains high performance while offering exceptional instruction-following capabilities and domain-specific adaptability. \n\nThe model was trained using a state-of-the-art distillation pipeline and an instruction dataset generated with [EvolKit](https://github.com/arcee-ai/EvolKit), ensuring accuracy and efficiency across a wide range of tasks. For more information on its training, visit blog.arcee.ai. \n\nLlama-3.1-SuperNova-Lite excels in both benchmark performance and real-world applications, providing the power of large-scale models in a more compact, efficient form ideal for organizations seeking high performance with reduced resource requirements.",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"imatrix",
"Llama-3.1-SuperNova-Lite",
"text-generation",
"en",
"license:other",
"region:us",
"conversational"
],
"likes": 0,
"downloads": 97,
"gated": false,
"private": false,
"last_modified": "2024-10-03T17:30:08.000Z",
"created_at": "2024-10-03T15:11:30.000Z",
"pipeline_tag": "text-generation",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "66feb422c1a87ccc721c0ce6",
"id": "duyntnet/Llama-3.1-SuperNova-Lite-imatrix-GGUF",
"modelId": "duyntnet/Llama-3.1-SuperNova-Lite-imatrix-GGUF",
"sha": "bd2e48eef64e149c1696d47a85cdc0e888ebb080",
"createdAt": "2024-10-03T15:11:30.000Z",
"lastModified": "2024-10-03T17:30:08.000Z",
"author": "duyntnet",
"downloads": 97,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "transformers",
"siblings_count": 29
}