GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

richarderkhov/junchengxie_-_llama-2-13b-chat-hf-gpt-4-80k-gguf overview

Quantization made by Richard Erkhov. Github Discord Request more models Llama-2-13b-chat-hf-gpt-4-80k - GGUF | Name | Quant method | Size | | ---- | ---- | ---- | | Llama-2-13b-chat-hf-gpt-4-80k.Q2K.gguf | Q2K | 4.52GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ3XS.gguf | IQ3XS | 4.99GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ3S.gguf | IQ3S | 5.27GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q3KS.gguf | Q3KS | 5.27GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ3M.gguf | IQ3M | 5.57GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q3K.gguf | Q3K | 5.9GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q3KM.gguf | Q3KM | 5.9GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q3KL.gguf | Q3KL | 6.45GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ4XS.gguf | IQ4XS | 6.54GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q40.gguf | Q40 | 6.86GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ4NL.gguf | IQ4NL | 6.9GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q4KS.gguf | Q4KS | 6.91GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q4K.gguf | Q4K | 7.33GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q4KM.gguf | Q4KM | 7.33GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q41.gguf | Q41 | 7.61GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q50.gguf | Q50 | 8.36GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q5KS.gguf | Q5KS | 8.36GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q5K.gguf | Q5K | 8.6GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q5KM.gguf | Q5KM | 8.6GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q51.gguf | Q51 | 9.1GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q6K.gguf | Q6K | 9.95GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q80.gguf | Q80 | 12.88GB | Original model description: --- license: apache-2.0 ---

ggufendpoints_compatibleregion:usconversational
richarderkhov/junchengxie_-_llama-2-13b-chat-hf-gpt-4-80k-gguf visual
Downloads
81
Likes
1
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

22 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Llama-2-13b-chat-hf-gpt-4-80k.IQ3_M.gguf GGUF IQ3_M 5.57 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.IQ3_S.gguf GGUF IQ3_S 5.27 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.IQ3_XS.gguf GGUF IQ3_XS 4.99 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.IQ4_NL.gguf GGUF IQ4_NL 6.90 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.IQ4_XS.gguf GGUF IQ4_XS 6.54 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q2_K.gguf GGUF Q2_K 4.52 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q3_K.gguf GGUF Q3_K 5.90 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_L.gguf GGUF Q3_K_L 6.45 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_M.gguf GGUF Q3_K_M 5.90 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_S.gguf GGUF Q3_K_S 5.27 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q4_0.gguf GGUF 6.86 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q4_1.gguf GGUF 7.61 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q4_K.gguf GGUF Q4_K 7.33 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q4_K_M.gguf GGUF Q4_K_M 7.33 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q4_K_S.gguf GGUF Q4_K_S 6.91 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q5_0.gguf GGUF 8.36 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q5_1.gguf GGUF 9.10 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q5_K.gguf GGUF Q5_K 8.60 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q5_K_M.gguf GGUF Q5_K_M 8.60 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q5_K_S.gguf GGUF Q5_K_S 8.36 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q6_K.gguf GGUF Q6_K 9.95 GB Download
Llama-2-13b-chat-hf-gpt-4-80k.Q8_0.gguf GGUF 12.88 GB Download

Model Details Live

Model Slug
richarderkhov/junchengxie_-_llama-2-13b-chat-hf-gpt-4-80k-gguf
Author
RichardErkhov
Pipeline Task
Library
Created
2024-08-09
Last Modified
2024-08-09
Gated
No
Private
No
HF SHA
1e45f3526c2c6da31ac44a6d15eef6217a9f73c2
License
Unknown
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "",
    "summary": "Quantization made by Richard Erkhov. Github Discord Request more models Llama-2-13b-chat-hf-gpt-4-80k - GGUF | Name | Quant method | Size | | ---- | ---- | ---- | | Llama-2-13b-chat-hf-gpt-4-80k.Q2_K.gguf | Q2_K | 4.52GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ3_XS.gguf | IQ3_XS | 4.99GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ3_S.gguf | IQ3_S | 5.27GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_S.gguf | Q3_K_S | 5.27GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ3_M.gguf | IQ3_M | 5.57GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q3_K.gguf | Q3_K | 5.9GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_M.gguf | Q3_K_M | 5.9GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_L.gguf | Q3_K_L | 6.45GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ4_XS.gguf | IQ4_XS | 6.54GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q4_0.gguf | Q4_0 | 6.86GB | | Llama-2-13b-chat-hf-gpt-4-80k.IQ4_NL.gguf | IQ4_NL | 6.9GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q4_K_S.gguf | Q4_K_S | 6.91GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q4_K.gguf | Q4_K | 7.33GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q4_K_M.gguf | Q4_K_M | 7.33GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q4_1.gguf | Q4_1 | 7.61GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q5_0.gguf | Q5_0 | 8.36GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q5_K_S.gguf | Q5_K_S | 8.36GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q5_K.gguf | Q5_K | 8.6GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q5_K_M.gguf | Q5_K_M | 8.6GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q5_1.gguf | Q5_1 | 9.1GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q6_K.gguf | Q6_K | 9.95GB | | Llama-2-13b-chat-hf-gpt-4-80k.Q8_0.gguf | Q8_0 | 12.88GB | Original model description: --- license: apache-2.0 ---",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nLlama-2-13b-chat-hf-gpt-4-80k - GGUF\n- Model creator: https://huggingface.co/JunchengXie/\n- Original model: https://huggingface.co/JunchengXie/Llama-2-13b-chat-hf-gpt-4-80k/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q2_K.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q2_K.gguf) | Q2_K | 4.52GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.IQ3_XS.gguf) | IQ3_XS | 4.99GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.IQ3_S.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.IQ3_S.gguf) | IQ3_S | 5.27GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_S.gguf) | Q3_K_S | 5.27GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.IQ3_M.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.IQ3_M.gguf) | IQ3_M | 5.57GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q3_K.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q3_K.gguf) | Q3_K | 5.9GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_M.gguf) | Q3_K_M | 5.9GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q3_K_L.gguf) | Q3_K_L | 6.45GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.IQ4_XS.gguf) | IQ4_XS | 6.54GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q4_0.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q4_0.gguf) | Q4_0 | 6.86GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.IQ4_NL.gguf) | IQ4_NL | 6.9GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q4_K_S.gguf) | Q4_K_S | 6.91GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q4_K.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q4_K.gguf) | Q4_K | 7.33GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q4_K_M.gguf) | Q4_K_M | 7.33GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q4_1.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q4_1.gguf) | Q4_1 | 7.61GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q5_0.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q5_0.gguf) | Q5_0 | 8.36GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q5_K_S.gguf) | Q5_K_S | 8.36GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q5_K.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q5_K.gguf) | Q5_K | 8.6GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q5_K_M.gguf) | Q5_K_M | 8.6GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q5_1.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q5_1.gguf) | Q5_1 | 9.1GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q6_K.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q6_K.gguf) | Q6_K | 9.95GB |\n| [Llama-2-13b-chat-hf-gpt-4-80k.Q8_0.gguf](https://huggingface.co/RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf/blob/main/Llama-2-13b-chat-hf-gpt-4-80k.Q8_0.gguf) | Q8_0 | 12.88GB |\n\n\n\n\nOriginal model description:\n---\nlicense: apache-2.0\n---\n\n## Description\nThis model is finetuned on the distillation data from GPT-4.\nThe base model is meta-llama/Llama-2-13b-chat-hf\n## Usage\nThe model has a query format as in llama-2.\n```\n<s> [INST] <<SYS>>\nYou are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe.  Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature.\n\nIf a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information.\n<</SYS>>\n\n{query} [/INST]  \n```\n\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 1,
  "downloads": 81,
  "gated": false,
  "private": false,
  "last_modified": "2024-08-09T08:04:28.000Z",
  "created_at": "2024-08-09T04:48:43.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "66b59fab18d95c926cc50f53",
  "id": "RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf",
  "modelId": "RichardErkhov/JunchengXie_-_Llama-2-13b-chat-hf-gpt-4-80k-gguf",
  "sha": "1e45f3526c2c6da31ac44a6d15eef6217a9f73c2",
  "createdAt": "2024-08-09T04:48:43.000Z",
  "lastModified": "2024-08-09T08:04:28.000Z",
  "author": "RichardErkhov",
  "downloads": 81,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 24
}