maziyarpanahi/llama-3-8b-instruct-dpo-v0.3-32k-gguf v0.3.fp16 GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
maziyarpanahi/llama-3-8b-instruct-dpo-v0.3-32k-gguf overview
MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3-GGUF
Downloads
139
Likes
9
Pipeline
text-generation
Library
transformers
Visibility
Public
Access
Open
Repository Files & Downloads
11 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Llama-3-8B-Instruct-DPO-v0.3.Q2_K.gguf | GGUF | Q2_K | 2.96 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.Q3_K_L.gguf | GGUF | Q3_K_L | 4.03 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.Q3_K_M.gguf | GGUF | Q3_K_M | 3.74 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.Q3_K_S.gguf | GGUF | Q3_K_S | 3.41 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.Q4_K_M.gguf | GGUF | Q4_K_M | 4.58 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.Q4_K_S.gguf | GGUF | Q4_K_S | 4.37 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.Q5_K_M.gguf | GGUF | Q5_K_M | 5.34 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.Q5_K_S.gguf | GGUF | Q5_K_S | 5.21 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.Q6_K.gguf | GGUF | Q6_K | 6.14 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.Q8_0.gguf | GGUF | โ | 7.95 GB | Download |
| Llama-3-8B-Instruct-DPO-v0.3.fp16.gguf | GGUF | โ | 14.97 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"tags": [
"quantized",
"2-bit",
"3-bit",
"4-bit",
"5-bit",
"6-bit",
"8-bit",
"GGUF",
"text-generation",
"llama",
"llama-3",
"text-generation"
],
"model_name": "Llama-3-8B-Instruct-DPO-v0.3-GGUF",
"base_model": "MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3",
"inference": false,
"model_creator": "MaziyarPanahi",
"pipeline_tag": "text-generation",
"quantized_by": "MaziyarPanahi",
"frontmatter": {
"tags": [
"quantized",
"2-bit",
"3-bit",
"4-bit",
"5-bit",
"6-bit",
"8-bit",
"GGUF",
"text-generation",
"llama",
"llama-3",
"text-generation"
],
"model_name": "Llama-3-8B-Instruct-DPO-v0.3-GGUF",
"base_model": "MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3",
"inference": "false",
"model_creator": "MaziyarPanahi",
"pipeline_tag": "text-generation",
"quantized_by": "MaziyarPanahi"
},
"hero_image_url": "",
"summary": "# MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3-GGUF",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\ntags:\n- quantized\n- 2-bit\n- 3-bit\n- 4-bit\n- 5-bit\n- 6-bit\n- 8-bit\n- GGUF\n- text-generation\n- llama\n- llama-3\n- text-generation\nmodel_name: Llama-3-8B-Instruct-DPO-v0.3-GGUF\nbase_model: MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3\ninference: false\nmodel_creator: MaziyarPanahi\npipeline_tag: text-generation\nquantized_by: MaziyarPanahi\n---\n# [MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3-GGUF](https://huggingface.co/MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3-GGUF)\n- Model creator: [MaziyarPanahi](https://huggingface.co/MaziyarPanahi)\n- Original model: [MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3](https://huggingface.co/MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3)\n\n## Description\n[MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3-GGUF](https://huggingface.co/MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3-GGUF) contains GGUF format model files for [MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3](https://huggingface.co/MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3).\n\n## Prompt Template\n\nThis model uses `ChatML` prompt template:\n\n```\n<|im_start|>system\n{System}\n<|im_end|>\n<|im_start|>user\n{User}\n<|im_end|>\n<|im_start|>assistant\n{Assistant}\n````\n\n### About GGUF\n\nGGUF is a new format introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp.\n\nHere is an incomplete list of clients and libraries that are known to support GGUF:\n\n* [llama.cpp](https://github.com/ggerganov/llama.cpp). The source project for GGUF. Offers a CLI and a server option.\n* [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), a Python library with GPU accel, LangChain support, and OpenAI-compatible API server.\n* [LM Studio](https://lmstudio.ai/), an easy-to-use and powerful local GUI for Windows and macOS (Silicon), with GPU acceleration. Linux available, in beta as of 27/11/2023.\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI, with many features and powerful extensions. Supports GPU acceleration.\n* [KoboldCpp](https://github.com/LostRuins/koboldcpp), a fully featured web UI, with GPU accel across all platforms and GPU architectures. Especially good for story telling.\n* [GPT4All](https://gpt4all.io/index.html), a free and open source local running GUI, supporting Windows, Linux and macOS with full GPU accel.\n* [LoLLMS Web UI](https://github.com/ParisNeo/lollms-webui), a great web UI with many interesting and unique features, including a full model library for easy model selection.\n* [Faraday.dev](https://faraday.dev/), an attractive and easy to use character-based chat GUI for Windows and macOS (both Silicon and Intel), with GPU acceleration.\n* [candle](https://github.com/huggingface/candle), a Rust ML framework with a focus on performance, including GPU support, and ease of use.\n* [ctransformers](https://github.com/marella/ctransformers), a Python library with GPU accel, LangChain support, and OpenAI-compatible AI server. Note, as of time of writing (November 27th 2023), ctransformers has not been updated in a long time and does not support many recent models.\n\n## Special thanks\n\n๐ Special thanks to [Georgi Gerganov](https://github.com/ggerganov) and the whole team working on [llama.cpp](https://github.com/ggerganov/llama.cpp/) for making all of this possible.",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"mistral",
"quantized",
"2-bit",
"3-bit",
"4-bit",
"5-bit",
"6-bit",
"8-bit",
"GGUF",
"text-generation",
"llama",
"llama-3",
"base_model:MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3",
"base_model:quantized:MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3",
"region:us",
"conversational"
],
"likes": 9,
"downloads": 139,
"gated": false,
"private": false,
"last_modified": "2024-04-25T17:38:23.000Z",
"created_at": "2024-04-25T16:20:10.000Z",
"pipeline_tag": "text-generation",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "662a82ba61d1eb767b8548e6",
"id": "MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3-32k-GGUF",
"modelId": "MaziyarPanahi/Llama-3-8B-Instruct-DPO-v0.3-32k-GGUF",
"sha": "3e6f26a29c0bf1a7d30a1ae508dda147ff56e924",
"createdAt": "2024-04-25T16:20:10.000Z",
"lastModified": "2024-04-25T17:38:23.000Z",
"author": "MaziyarPanahi",
"downloads": 139,
"likes": 9,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "transformers",
"siblings_count": 14
}