duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf IQ3_S GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf overview
Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT. ### Model Sources
Downloads
472
Likes
0
Pipeline
text-generation
Library
transformers
Visibility
Public
Access
Open
Repository Files & Downloads
27 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| vicuna-7b-v1.5-16k-IQ1_M.gguf | GGUF | IQ1_M | 1.54 GB | Download |
| vicuna-7b-v1.5-16k-IQ1_S.gguf | GGUF | IQ1_S | 1.42 GB | Download |
| vicuna-7b-v1.5-16k-IQ2_M.gguf | GGUF | IQ2_M | 2.20 GB | Download |
| vicuna-7b-v1.5-16k-IQ2_S.gguf | GGUF | IQ2_S | 2.05 GB | Download |
| vicuna-7b-v1.5-16k-IQ2_XS.gguf | GGUF | IQ2_XS | 1.90 GB | Download |
| vicuna-7b-v1.5-16k-IQ2_XXS.gguf | GGUF | IQ2_XXS | 1.73 GB | Download |
| vicuna-7b-v1.5-16k-IQ3_M.gguf | GGUF | IQ3_M | 2.90 GB | Download |
| vicuna-7b-v1.5-16k-IQ3_S.gguf | GGUF | IQ3_S | 2.75 GB | Download |
| vicuna-7b-v1.5-16k-IQ3_XS.gguf | GGUF | IQ3_XS | 2.60 GB | Download |
| vicuna-7b-v1.5-16k-IQ3_XXS.gguf | GGUF | IQ3_XXS | 2.41 GB | Download |
| vicuna-7b-v1.5-16k-IQ4_NL.gguf | GGUF | IQ4_NL | 3.56 GB | Download |
| vicuna-7b-v1.5-16k-IQ4_XS.gguf | GGUF | IQ4_XS | 3.37 GB | Download |
| vicuna-7b-v1.5-16k-Q2_K.gguf | GGUF | Q2_K | 2.36 GB | Download |
| vicuna-7b-v1.5-16k-Q2_K_S.gguf | GGUF | Q2_K_S | 2.16 GB | Download |
| vicuna-7b-v1.5-16k-Q3_K_L.gguf | GGUF | Q3_K_L | 3.35 GB | Download |
| vicuna-7b-v1.5-16k-Q3_K_M.gguf | GGUF | Q3_K_M | 3.07 GB | Download |
| vicuna-7b-v1.5-16k-Q3_K_S.gguf | GGUF | Q3_K_S | 2.75 GB | Download |
| vicuna-7b-v1.5-16k-Q4_0.gguf | GGUF | — | 3.57 GB | Download |
| vicuna-7b-v1.5-16k-Q4_1.gguf | GGUF | — | 3.95 GB | Download |
| vicuna-7b-v1.5-16k-Q4_K_M.gguf | GGUF | Q4_K_M | 3.80 GB | Download |
| vicuna-7b-v1.5-16k-Q4_K_S.gguf | GGUF | Q4_K_S | 3.59 GB | Download |
| vicuna-7b-v1.5-16k-Q5_0.gguf | GGUF | — | 4.34 GB | Download |
| vicuna-7b-v1.5-16k-Q5_1.gguf | GGUF | — | 4.72 GB | Download |
| vicuna-7b-v1.5-16k-Q5_K_M.gguf | GGUF | Q5_K_M | 4.45 GB | Download |
| vicuna-7b-v1.5-16k-Q5_K_S.gguf | GGUF | Q5_K_S | 4.33 GB | Download |
| vicuna-7b-v1.5-16k-Q6_K.gguf | GGUF | Q6_K | 5.15 GB | Download |
| vicuna-7b-v1.5-16k-Q8_0.gguf | GGUF | — | 6.67 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": false,
"tags": [
"transformers",
"gguf",
"imatrix",
"vicuna-7b-v1.5-16k"
],
"frontmatter": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": "false",
"tags": [
"transformers",
"gguf",
"imatrix",
"vicuna-7b-v1.5-16k"
]
},
"hero_image_url": "",
"summary": "Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT. ### Model Sources",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- vicuna-7b-v1.5-16k\n---\nQuantizations of https://huggingface.co/lmsys/vicuna-7b-v1.5-16k\n\n\n### Inference Clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [ollama](https://github.com/ollama/ollama)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [GPT4All](https://github.com/nomic-ai/gpt4all)\n* [jan](https://github.com/janhq/jan)\n---\n\n# From original readme\n\nVicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT.\n\n- **Developed by:** [LMSYS](https://lmsys.org/)\n- **Model type:** An auto-regressive language model based on the transformer architecture\n- **License:** Llama 2 Community License Agreement\t\n- **Finetuned from model:** [Llama 2](https://arxiv.org/abs/2307.09288)\n\n### Model Sources\n\n- **Repository:** https://github.com/lm-sys/FastChat\n- **Blog:** https://lmsys.org/blog/2023-03-30-vicuna/\n- **Paper:** https://arxiv.org/abs/2306.05685\n- **Demo:** https://chat.lmsys.org/\n\n## Uses\n\nThe primary use of Vicuna is research on large language models and chatbots.\nThe primary intended users of the model are researchers and hobbyists in natural language processing, machine learning, and artificial intelligence.\n\n## How to Get Started with the Model\n\n- Command line interface: https://github.com/lm-sys/FastChat#vicuna-weights\n- APIs (OpenAI API, Huggingface API): https://github.com/lm-sys/FastChat/tree/main#api ",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"imatrix",
"vicuna-7b-v1.5-16k",
"text-generation",
"en",
"arxiv:2307.09288",
"arxiv:2306.05685",
"license:other",
"region:us"
],
"likes": 0,
"downloads": 472,
"gated": false,
"private": false,
"last_modified": "2024-11-12T04:35:27.000Z",
"created_at": "2024-11-12T00:15:18.000Z",
"pipeline_tag": "text-generation",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "67329e1624b316be878e6b42",
"id": "duyntnet/vicuna-7b-v1.5-16k-imatrix-GGUF",
"modelId": "duyntnet/vicuna-7b-v1.5-16k-imatrix-GGUF",
"sha": "373fc286c5b4731c81957a89aa201102bab66bd4",
"createdAt": "2024-11-12T00:15:18.000Z",
"lastModified": "2024-11-12T04:35:27.000Z",
"author": "duyntnet",
"downloads": 472,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "transformers",
"siblings_count": 29
}