maziyarpanahi/zephyr-orpo-141b-a35b-v0.1-gguf Q5_K_S GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
maziyarpanahi/zephyr-orpo-141b-a35b-v0.1-gguf overview
On April 11th, @HuggingFaceH4 released a fine-tuned model called HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1 based on Mixtral-8x22B-v0.1 model.
Downloads
557
Likes
29
Pipeline
text-generation
Library
GGUF
Visibility
Public
Access
Open
Repository Files & Downloads
69 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| zephyr-orpo-141b-A35b-v0.1.IQ1_M.gguf | GGUF | IQ1_M | 30.48 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ1_S.gguf | GGUF | IQ1_S | 27.61 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ3_XS-00001-of-00005.gguf | GGUF | IQ3_XS | 12.59 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ3_XS-00002-of-00005.gguf | GGUF | IQ3_XS | 12.56 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ3_XS-00003-of-00005.gguf | GGUF | IQ3_XS | 11.43 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ3_XS-00004-of-00005.gguf | GGUF | IQ3_XS | 12.42 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ3_XS-00005-of-00005.gguf | GGUF | IQ3_XS | 5.23 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ4_XS-00001-of-00005.gguf | GGUF | IQ4_XS | 16.78 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ4_XS-00002-of-00005.gguf | GGUF | IQ4_XS | 16.62 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ4_XS-00003-of-00005.gguf | GGUF | IQ4_XS | 15.08 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ4_XS-00004-of-00005.gguf | GGUF | IQ4_XS | 16.24 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.IQ4_XS-00005-of-00005.gguf | GGUF | IQ4_XS | 6.40 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q2_K-00001-of-00005.gguf | GGUF | Q2_K | 11.03 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q2_K-00002-of-00005.gguf | GGUF | Q2_K | 11.43 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q2_K-00003-of-00005.gguf | GGUF | Q2_K | 10.42 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q2_K-00004-of-00005.gguf | GGUF | Q2_K | 11.19 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q2_K-00005-of-00005.gguf | GGUF | Q2_K | 4.46 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_L-00001-of-00005.gguf | GGUF | Q3_K_L | 15.40 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_L-00002-of-00005.gguf | GGUF | Q3_K_L | 15.93 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_L-00003-of-00005.gguf | GGUF | Q3_K_L | 14.48 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_L-00004-of-00005.gguf | GGUF | Q3_K_L | 15.62 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_L-00005-of-00005.gguf | GGUF | Q3_K_L | 6.16 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_M-00001-of-00005.gguf | GGUF | Q3_K_M | 14.58 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_M-00002-of-00005.gguf | GGUF | Q3_K_M | 14.82 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_M-00003-of-00005.gguf | GGUF | Q3_K_M | 13.49 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_M-00004-of-00005.gguf | GGUF | Q3_K_M | 14.51 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_M-00005-of-00005.gguf | GGUF | Q3_K_M | 5.74 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_S-00001-of-00005.gguf | GGUF | Q3_K_S | 13.00 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_S-00002-of-00005.gguf | GGUF | Q3_K_S | 13.53 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_S-00003-of-00005.gguf | GGUF | Q3_K_S | 12.29 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_S-00004-of-00005.gguf | GGUF | Q3_K_S | 13.22 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q3_K_S-00005-of-00005.gguf | GGUF | Q3_K_S | 5.24 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_M-00001-of-00005.gguf | GGUF | Q4_K_M | 18.61 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_M-00002-of-00005.gguf | GGUF | Q4_K_M | 18.35 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_M-00003-of-00005.gguf | GGUF | Q4_K_M | 16.71 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_M-00004-of-00005.gguf | GGUF | Q4_K_M | 18.32 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_M-00005-of-00005.gguf | GGUF | Q4_K_M | 7.72 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_S-00001-of-00005.gguf | GGUF | Q4_K_S | 17.53 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_S-00002-of-00005.gguf | GGUF | Q4_K_S | 17.57 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_S-00003-of-00005.gguf | GGUF | Q4_K_S | 15.93 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_S-00004-of-00005.gguf | GGUF | Q4_K_S | 17.16 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q4_K_S-00005-of-00005.gguf | GGUF | Q4_K_S | 6.75 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_M-00001-of-00005.gguf | GGUF | Q5_K_M | 21.41 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_M-00002-of-00005.gguf | GGUF | Q5_K_M | 21.78 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_M-00003-of-00005.gguf | GGUF | Q5_K_M | 19.76 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_M-00004-of-00005.gguf | GGUF | Q5_K_M | 21.47 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_M-00005-of-00005.gguf | GGUF | Q5_K_M | 8.68 GB | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_S-00001-of-00005.gguf | GGUF | Q5_K_S | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_S-00002-of-00005.gguf | GGUF | Q5_K_S | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_S-00003-of-00005.gguf | GGUF | Q5_K_S | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_S-00004-of-00005.gguf | GGUF | Q5_K_S | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q5_K_S-00005-of-00005.gguf | GGUF | Q5_K_S | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q6_K-00001-of-00005.gguf | GGUF | Q6_K | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q6_K-00002-of-00005.gguf | GGUF | Q6_K | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q6_K-00003-of-00005.gguf | GGUF | Q6_K | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q6_K-00004-of-00005.gguf | GGUF | Q6_K | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q6_K-00005-of-00005.gguf | GGUF | Q6_K | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q8_0-00001-of-00005.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q8_0-00002-of-00005.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q8_0-00003-of-00005.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q8_0-00004-of-00005.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.Q8_0-00005-of-00005.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.fp16-00001-of-00007.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.fp16-00002-of-00007.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.fp16-00003-of-00007.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.fp16-00004-of-00007.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.fp16-00005-of-00007.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.fp16-00006-of-00007.gguf | GGUF | — | Unknown | Download |
| zephyr-orpo-141b-A35b-v0.1.fp16-00007-of-00007.gguf | GGUF | — | Unknown | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "apache-2.0",
"base_model": "HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1",
"tags": [
"orpo",
"GGUF",
"quantized",
"2-bit",
"3-bit",
"4-bit",
"5-bit",
"6-bit",
"8-bit",
"16-bit",
"GGUF",
"mixtral",
"moe"
],
"language": [
"en"
],
"datasets": [
"argilla/distilabel-capybara-dpo-7k-binarized"
],
"inference": false,
"model_creator": "MaziyarPanahi",
"model_name": "zephyr-orpo-141b-A35b-v0.1-GGUF",
"pipeline_tag": "text-generation",
"quantized_by": "MaziyarPanahi",
"library_name": "GGUF",
"frontmatter": {
"license": "apache-2.0",
"base_model": "HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1",
"tags": [
"orpo",
"GGUF",
"quantized",
"2-bit",
"3-bit",
"4-bit",
"5-bit",
"6-bit",
"8-bit",
"16-bit",
"GGUF",
"mixtral",
"moe"
],
"language": [
"en"
],
"datasets": [
"argilla/distilabel-capybara-dpo-7k-binarized"
],
"inference": "false",
"model_creator": "MaziyarPanahi",
"model_name": "zephyr-orpo-141b-A35b-v0.1-GGUF",
"pipeline_tag": "text-generation",
"quantized_by": "MaziyarPanahi",
"library_name": "GGUF"
},
"hero_image_url": "https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1/resolve/main/logo.png",
"summary": "On April 11th, @HuggingFaceH4 released a fine-tuned model called HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1 based on Mixtral-8x22B-v0.1 model.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: apache-2.0\nbase_model: HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1\ntags:\n- orpo\n- GGUF\n- quantized\n- 2-bit\n- 3-bit\n- 4-bit\n- 5-bit\n- 6-bit\n- 8-bit\n- 16-bit\n- GGUF\n- mixtral\n- moe\nlanguage:\n- en\ndatasets:\n- argilla/distilabel-capybara-dpo-7k-binarized\ninference: false\nmodel_creator: MaziyarPanahi\nmodel_name: zephyr-orpo-141b-A35b-v0.1-GGUF\npipeline_tag: text-generation\nquantized_by: MaziyarPanahi\nlibrary_name: GGUF\n---\n\n<img src=\"https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1/resolve/main/logo.png\" alt=\"Zephyr 141B Logo\" width=\"400\" style=\"margin-left:'auto' margin-right:'auto' display:'block'\"/>\n\n\n# zephyr-orpo-141b-A35b-v0.1-GGUF\n\nOn April 11th, [@HuggingFaceH4](https://huggingface.co/HuggingFaceH4) released a fine-tuned model called [HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1](https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1) based on `Mixtral-8x22B-v0.1` model.\n\n- **Model type:** A Mixture of Experts (MoE) model with 141B total parameters and 35B active parameters. Fine-tuned on a mix of publicly available, synthetic datasets.\n- **Language(s) (NLP):** Primarily English.\n- **License:** Apache 2.0\n- **Finetuned from model:** [mistral-community/Mixtral-8x22B-v0.1](https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1)\n\n\n## Load sharded model\n\n`llama_load_model_from_file` will detect the number of files and will load additional tensors from the rest of files.\n\n```sh\nllama.cpp/main -m zephyr-orpo-141b-A35b-v0.1.Q2_K-00001-of-00005.gguf --file prompt.txt -n 1024 -e\n```\n\nThe output from `Q2_K` quantized model:\n\n```\n <|system|>\nYou are a helpful assistant.\n<|user|>\nHello, how are you?\n<|assistant|>\nI'm doing great. How can I help you today?\n<|user|>\nShow me how to build a website in 10 simple steps\n<|assistant|>\nOf course, I'd be happy to guide you through the process of building a website in 10 simple steps.\n\n1. Determine the purpose of your website: Decide what your website will be used for, whether it's for business, personal use, or a blog.\n\n2. Choose a domain name: Your domain name is the address that people will use to find your website. It should be easy to remember and relevant to your website's purpose.\n\n3. Select a web hosting provider: A web hosting provider will store your website files and make them accessible to users around the world. Research different providers and choose one that fits your needs.\n\n4. Choose a platform: There are many platforms available for building a website, including WordPress, Wix, and Squarespace. Consider your technical skills and the features you need when choosing a platform.\n\n5. Plan your website structure: Decide on the pages you want to include on your website and how they will be organized. This will help guide the design process.\n\n6. Design your website: Use a template or create your own design for your website. Make sure it's visually appealing and easy to navigate.\n\n7. Add content to your website: Write the text and create images or videos for your website. Make sure the content is relevant to your website's purpose and engaging for users.\n\n8. Optimize your website for search engines: Make sure your website is easy to find by using keywords in your content and metadata. This will help your website rank higher in search engine results.\n\n9. Test your website: Test your website to make sure everything is working properly and that there are no broken links or errors.\n\n10. Launch your website: Once everything is working properly, it's time to launch your website. Share the link with your audience and start promoting your website.\n\nI hope this guide helps you build a successful website. Let me know if you have any questions. [end of text]\n\nllama_print_timings: load time = 11670.53 ms\nllama_print_timings: sample time = 16.30 ms / 422 runs ( 0.04 ms per token, 25894.34 tokens per second)\nllama_print_timings: prompt eval time = 5084.73 ms / 78 tokens ( 65.19 ms per token, 15.34 tokens per second)\nllama_print_timings: eval time = 279055.53 ms / 421 runs ( 662.84 ms per token, 1.51 tokens per second)\nllama_print_timings: total time = 284314.00 ms / 499 tokens\nLog end\n```\n\nWhat's inside the `prompt.txt`:\n```\n<|system|>\nYou are a helpful assistant.</s>\n<|user|>\nHello, how are you?</s>\n<|assistant|>\nI'm doing great. How can I help you today?</s>\n<|user|>\nShow me how to build a website in 10 simple steps</s>\n<|assistant|>\n```",
"related_quantizations": []
},
"tags": [
"GGUF",
"gguf",
"orpo",
"quantized",
"2-bit",
"3-bit",
"4-bit",
"5-bit",
"6-bit",
"8-bit",
"16-bit",
"mixtral",
"moe",
"text-generation",
"en",
"dataset:argilla/distilabel-capybara-dpo-7k-binarized",
"base_model:HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1",
"base_model:quantized:HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1",
"license:apache-2.0",
"region:us",
"conversational"
],
"likes": 29,
"downloads": 557,
"gated": false,
"private": false,
"last_modified": "2024-04-12T04:01:30.000Z",
"created_at": "2024-04-11T16:38:25.000Z",
"pipeline_tag": "text-generation",
"library_name": "GGUF"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "6618120105a66aa36cd2cf5a",
"id": "MaziyarPanahi/zephyr-orpo-141b-A35b-v0.1-GGUF",
"modelId": "MaziyarPanahi/zephyr-orpo-141b-A35b-v0.1-GGUF",
"sha": "8db6013940a7ed35c589cf7f2cb70893e5a15641",
"createdAt": "2024-04-11T16:38:25.000Z",
"lastModified": "2024-04-12T04:01:30.000Z",
"author": "MaziyarPanahi",
"downloads": 557,
"likes": 29,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "GGUF",
"siblings_count": 72
}