richarderkhov/huggyllama_-_llama-30b-gguf overview
Quantization made by Richard Erkhov. Github Discord Request more models llama-30b - GGUF | Name | Quant method | Size | | ---- | ---- | ---- | | llama-30b.Q2K.gguf | Q2K | 11.22GB | | llama-30b.IQ3XS.gguf | IQ3XS | 12.4GB | | llama-30b.IQ3S.gguf | IQ3S | 13.1GB | | llama-30b.Q3KS.gguf | Q3KS | 13.1GB | | llama-30b.IQ3M.gguf | IQ3M | 13.86GB | | llama-30b.Q3K.gguf | Q3K | 14.69GB | | llama-30b.Q3KM.gguf | Q3KM | 14.69GB | | llama-30b.Q3KL.gguf | Q3KL | 16.09GB | | llama-30b.IQ4XS.gguf | IQ4XS | 16.28GB | | llama-30b.Q40.gguf | Q40 | 17.1GB | | llama-30b.IQ4NL.gguf | IQ4NL | 17.19GB | | llama-30b.Q4KS.gguf | Q4KS | 17.21GB | | llama-30b.Q4K.gguf | Q4K | 18.27GB | | llama-30b.Q4KM.gguf | Q4KM | 18.27GB | | llama-30b.Q41.gguf | Q41 | 18.98GB | | llama-30b.Q50.gguf | Q50 | 20.86GB | | llama-30b.Q5KS.gguf | Q5KS | 20.86GB | | llama-30b.Q5K.gguf | Q5K | 21.46GB | | llama-30b.Q5KM.gguf | Q5KM | 21.46GB | | llama-30b.Q51.gguf | Q51 | 22.74GB | | llama-30b.Q6K.gguf | Q6K | 24.85GB | | llama-30b.Q80.gguf | Q80 | 32.19GB | Original model description: --- license: other --- This contains the weights for the LLaMA-30b model. This model is under a non-commercial license (see the LICENSE file). You should only use this repository if you have been granted access to the model by filling out this form but either lost your copy of the weights or got some trouble converting them to the Transformers format.
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| llama-30b.IQ3_M.gguf | GGUF | IQ3_M | 13.86 GB | Download |
| llama-30b.IQ3_S.gguf | GGUF | IQ3_S | 13.10 GB | Download |
| llama-30b.IQ3_XS.gguf | GGUF | IQ3_XS | 12.40 GB | Download |
| llama-30b.IQ4_NL.gguf | GGUF | IQ4_NL | 17.19 GB | Download |
| llama-30b.IQ4_XS.gguf | GGUF | IQ4_XS | 16.28 GB | Download |
| llama-30b.Q2_K.gguf | GGUF | Q2_K | 11.22 GB | Download |
| llama-30b.Q3_K.gguf | GGUF | Q3_K | 14.69 GB | Download |
| llama-30b.Q3_K_L.gguf | GGUF | Q3_K_L | 16.09 GB | Download |
| llama-30b.Q3_K_M.gguf | GGUF | Q3_K_M | 14.69 GB | Download |
| llama-30b.Q3_K_S.gguf | GGUF | Q3_K_S | 13.10 GB | Download |
| llama-30b.Q4_0.gguf | GGUF | — | 17.10 GB | Download |
| llama-30b.Q4_1.gguf | GGUF | — | 18.98 GB | Download |
| llama-30b.Q4_K.gguf | GGUF | Q4_K | 18.27 GB | Download |
| llama-30b.Q4_K_M.gguf | GGUF | Q4_K_M | 18.27 GB | Download |
| llama-30b.Q4_K_S.gguf | GGUF | Q4_K_S | 17.21 GB | Download |
| llama-30b.Q5_0.gguf | GGUF | — | 20.86 GB | Download |
| llama-30b.Q5_1.gguf | GGUF | — | 22.74 GB | Download |
| llama-30b.Q5_K.gguf | GGUF | Q5_K | 21.46 GB | Download |
| llama-30b.Q5_K_M.gguf | GGUF | Q5_K_M | 21.46 GB | Download |
| llama-30b.Q5_K_S.gguf | GGUF | Q5_K_S | 20.86 GB | Download |
| llama-30b.Q6_K.gguf | GGUF | Q6_K | 24.85 GB | Download |
| llama-30b.Q8_0.gguf | GGUF | — | 32.19 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"frontmatter": {},
"hero_image_url": "",
"summary": "Quantization made by Richard Erkhov. Github Discord Request more models llama-30b - GGUF | Name | Quant method | Size | | ---- | ---- | ---- | | llama-30b.Q2_K.gguf | Q2_K | 11.22GB | | llama-30b.IQ3_XS.gguf | IQ3_XS | 12.4GB | | llama-30b.IQ3_S.gguf | IQ3_S | 13.1GB | | llama-30b.Q3_K_S.gguf | Q3_K_S | 13.1GB | | llama-30b.IQ3_M.gguf | IQ3_M | 13.86GB | | llama-30b.Q3_K.gguf | Q3_K | 14.69GB | | llama-30b.Q3_K_M.gguf | Q3_K_M | 14.69GB | | llama-30b.Q3_K_L.gguf | Q3_K_L | 16.09GB | | llama-30b.IQ4_XS.gguf | IQ4_XS | 16.28GB | | llama-30b.Q4_0.gguf | Q4_0 | 17.1GB | | llama-30b.IQ4_NL.gguf | IQ4_NL | 17.19GB | | llama-30b.Q4_K_S.gguf | Q4_K_S | 17.21GB | | llama-30b.Q4_K.gguf | Q4_K | 18.27GB | | llama-30b.Q4_K_M.gguf | Q4_K_M | 18.27GB | | llama-30b.Q4_1.gguf | Q4_1 | 18.98GB | | llama-30b.Q5_0.gguf | Q5_0 | 20.86GB | | llama-30b.Q5_K_S.gguf | Q5_K_S | 20.86GB | | llama-30b.Q5_K.gguf | Q5_K | 21.46GB | | llama-30b.Q5_K_M.gguf | Q5_K_M | 21.46GB | | llama-30b.Q5_1.gguf | Q5_1 | 22.74GB | | llama-30b.Q6_K.gguf | Q6_K | 24.85GB | | llama-30b.Q8_0.gguf | Q8_0 | 32.19GB | Original model description: --- license: other --- This contains the weights for the LLaMA-30b model. This model is under a non-commercial license (see the LICENSE file). You should only use this repository if you have been granted access to the model by filling out this form but either lost your copy of the weights or got some trouble converting them to the Transformers format.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nllama-30b - GGUF\n- Model creator: https://huggingface.co/huggyllama/\n- Original model: https://huggingface.co/huggyllama/llama-30b/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [llama-30b.Q2_K.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q2_K.gguf) | Q2_K | 11.22GB |\n| [llama-30b.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.IQ3_XS.gguf) | IQ3_XS | 12.4GB |\n| [llama-30b.IQ3_S.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.IQ3_S.gguf) | IQ3_S | 13.1GB |\n| [llama-30b.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q3_K_S.gguf) | Q3_K_S | 13.1GB |\n| [llama-30b.IQ3_M.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.IQ3_M.gguf) | IQ3_M | 13.86GB |\n| [llama-30b.Q3_K.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q3_K.gguf) | Q3_K | 14.69GB |\n| [llama-30b.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q3_K_M.gguf) | Q3_K_M | 14.69GB |\n| [llama-30b.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q3_K_L.gguf) | Q3_K_L | 16.09GB |\n| [llama-30b.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.IQ4_XS.gguf) | IQ4_XS | 16.28GB |\n| [llama-30b.Q4_0.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q4_0.gguf) | Q4_0 | 17.1GB |\n| [llama-30b.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.IQ4_NL.gguf) | IQ4_NL | 17.19GB |\n| [llama-30b.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q4_K_S.gguf) | Q4_K_S | 17.21GB |\n| [llama-30b.Q4_K.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q4_K.gguf) | Q4_K | 18.27GB |\n| [llama-30b.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q4_K_M.gguf) | Q4_K_M | 18.27GB |\n| [llama-30b.Q4_1.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q4_1.gguf) | Q4_1 | 18.98GB |\n| [llama-30b.Q5_0.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q5_0.gguf) | Q5_0 | 20.86GB |\n| [llama-30b.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q5_K_S.gguf) | Q5_K_S | 20.86GB |\n| [llama-30b.Q5_K.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q5_K.gguf) | Q5_K | 21.46GB |\n| [llama-30b.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q5_K_M.gguf) | Q5_K_M | 21.46GB |\n| [llama-30b.Q5_1.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q5_1.gguf) | Q5_1 | 22.74GB |\n| [llama-30b.Q6_K.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q6_K.gguf) | Q6_K | 24.85GB |\n| [llama-30b.Q8_0.gguf](https://huggingface.co/RichardErkhov/huggyllama_-_llama-30b-gguf/blob/main/llama-30b.Q8_0.gguf) | Q8_0 | 32.19GB |\n\n\n\n\nOriginal model description:\n---\nlicense: other\n---\n\nThis contains the weights for the LLaMA-30b model. This model is under a non-commercial license (see the LICENSE file).\nYou should only use this repository if you have been granted access to the model by filling out [this form](https://docs.google.com/forms/d/e/1FAIpQLSfqNECQnMkycAp2jP4Z9TFX0cGR4uf7b_fBxjY_OjhJILlKGA/viewform?usp=send_form) but either lost your copy of the weights or got some trouble converting them to the Transformers format.\n\n\n\n",
"related_quantizations": []
},
"tags": [
"gguf",
"endpoints_compatible",
"region:us"
],
"likes": 0,
"downloads": 483,
"gated": false,
"private": false,
"last_modified": "2024-07-26T15:03:52.000Z",
"created_at": "2024-07-25T18:01:26.000Z",
"pipeline_tag": "",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "66a292f63ea52a727b0fa2f1",
"id": "RichardErkhov/huggyllama_-_llama-30b-gguf",
"modelId": "RichardErkhov/huggyllama_-_llama-30b-gguf",
"sha": "33154cbc215a34898b2e473911b02b525e8a267b",
"createdAt": "2024-07-25T18:01:26.000Z",
"lastModified": "2024-07-26T15:03:52.000Z",
"author": "RichardErkhov",
"downloads": 483,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "",
"library_name": "",
"siblings_count": 24
}