duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf IQ3_S GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf overview

Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT. ### Model Sources

transformersggufimatrixvicuna-7b-v1.5-16ktext-generationenarxiv:2307.09288arxiv:2306.05685license:otherregion:us

duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf visual

Downloads

472

Likes

Pipeline

text-generation

Library

transformers

Visibility

Public

Access

Open

Repository Files & Downloads

27 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
vicuna-7b-v1.5-16k-IQ1_M.gguf	GGUF	IQ1_M	1.54 GB	Download
vicuna-7b-v1.5-16k-IQ1_S.gguf	GGUF	IQ1_S	1.42 GB	Download
vicuna-7b-v1.5-16k-IQ2_M.gguf	GGUF	IQ2_M	2.20 GB	Download
vicuna-7b-v1.5-16k-IQ2_S.gguf	GGUF	IQ2_S	2.05 GB	Download
vicuna-7b-v1.5-16k-IQ2_XS.gguf	GGUF	IQ2_XS	1.90 GB	Download
vicuna-7b-v1.5-16k-IQ2_XXS.gguf	GGUF	IQ2_XXS	1.73 GB	Download
vicuna-7b-v1.5-16k-IQ3_M.gguf	GGUF	IQ3_M	2.90 GB	Download
vicuna-7b-v1.5-16k-IQ3_S.gguf	GGUF	IQ3_S	2.75 GB	Download
vicuna-7b-v1.5-16k-IQ3_XS.gguf	GGUF	IQ3_XS	2.60 GB	Download
vicuna-7b-v1.5-16k-IQ3_XXS.gguf	GGUF	IQ3_XXS	2.41 GB	Download
vicuna-7b-v1.5-16k-IQ4_NL.gguf	GGUF	IQ4_NL	3.56 GB	Download
vicuna-7b-v1.5-16k-IQ4_XS.gguf	GGUF	IQ4_XS	3.37 GB	Download
vicuna-7b-v1.5-16k-Q2_K.gguf	GGUF	Q2_K	2.36 GB	Download
vicuna-7b-v1.5-16k-Q2_K_S.gguf	GGUF	Q2_K_S	2.16 GB	Download
vicuna-7b-v1.5-16k-Q3_K_L.gguf	GGUF	Q3_K_L	3.35 GB	Download
vicuna-7b-v1.5-16k-Q3_K_M.gguf	GGUF	Q3_K_M	3.07 GB	Download
vicuna-7b-v1.5-16k-Q3_K_S.gguf	GGUF	Q3_K_S	2.75 GB	Download
vicuna-7b-v1.5-16k-Q4_0.gguf	GGUF	—	3.57 GB	Download
vicuna-7b-v1.5-16k-Q4_1.gguf	GGUF	—	3.95 GB	Download
vicuna-7b-v1.5-16k-Q4_K_M.gguf	GGUF	Q4_K_M	3.80 GB	Download
vicuna-7b-v1.5-16k-Q4_K_S.gguf	GGUF	Q4_K_S	3.59 GB	Download
vicuna-7b-v1.5-16k-Q5_0.gguf	GGUF	—	4.34 GB	Download
vicuna-7b-v1.5-16k-Q5_1.gguf	GGUF	—	4.72 GB	Download
vicuna-7b-v1.5-16k-Q5_K_M.gguf	GGUF	Q5_K_M	4.45 GB	Download
vicuna-7b-v1.5-16k-Q5_K_S.gguf	GGUF	Q5_K_S	4.33 GB	Download
vicuna-7b-v1.5-16k-Q6_K.gguf	GGUF	Q6_K	5.15 GB	Download
vicuna-7b-v1.5-16k-Q8_0.gguf	GGUF	—	6.67 GB	Download

Model Details Live

Model Slug

duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf

Author

duyntnet

Pipeline Task

text-generation

Library

transformers

Created

2024-11-12

Last Modified

2024-11-12

Gated

Private

HF SHA

373fc286c5b4731c81957a89aa201102bab66bd4

License

other

Language

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "other",
    "language": [
      "en"
    ],
    "pipeline_tag": "text-generation",
    "inference": false,
    "tags": [
      "transformers",
      "gguf",
      "imatrix",
      "vicuna-7b-v1.5-16k"
    ],
    "frontmatter": {
      "license": "other",
      "language": [
        "en"
      ],
      "pipeline_tag": "text-generation",
      "inference": "false",
      "tags": [
        "transformers",
        "gguf",
        "imatrix",
        "vicuna-7b-v1.5-16k"
      ]
    },
    "hero_image_url": "",
    "summary": "Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT. ### Model Sources",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- vicuna-7b-v1.5-16k\n---\nQuantizations of https://huggingface.co/lmsys/vicuna-7b-v1.5-16k\n\n\n### Inference Clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [ollama](https://github.com/ollama/ollama)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [GPT4All](https://github.com/nomic-ai/gpt4all)\n* [jan](https://github.com/janhq/jan)\n---\n\n# From original readme\n\nVicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT.\n\n- **Developed by:** [LMSYS](https://lmsys.org/)\n- **Model type:** An auto-regressive language model based on the transformer architecture\n- **License:** Llama 2 Community License Agreement\t\n- **Finetuned from model:** [Llama 2](https://arxiv.org/abs/2307.09288)\n\n### Model Sources\n\n- **Repository:** https://github.com/lm-sys/FastChat\n- **Blog:** https://lmsys.org/blog/2023-03-30-vicuna/\n- **Paper:** https://arxiv.org/abs/2306.05685\n- **Demo:** https://chat.lmsys.org/\n\n## Uses\n\nThe primary use of Vicuna is research on large language models and chatbots.\nThe primary intended users of the model are researchers and hobbyists in natural language processing, machine learning, and artificial intelligence.\n\n## How to Get Started with the Model\n\n- Command line interface: https://github.com/lm-sys/FastChat#vicuna-weights\n- APIs (OpenAI API, Huggingface API): https://github.com/lm-sys/FastChat/tree/main#api  ",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "imatrix",
    "vicuna-7b-v1.5-16k",
    "text-generation",
    "en",
    "arxiv:2307.09288",
    "arxiv:2306.05685",
    "license:other",
    "region:us"
  ],
  "likes": 0,
  "downloads": 472,
  "gated": false,
  "private": false,
  "last_modified": "2024-11-12T04:35:27.000Z",
  "created_at": "2024-11-12T00:15:18.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "67329e1624b316be878e6b42",
  "id": "duyntnet/vicuna-7b-v1.5-16k-imatrix-GGUF",
  "modelId": "duyntnet/vicuna-7b-v1.5-16k-imatrix-GGUF",
  "sha": "373fc286c5b4731c81957a89aa201102bab66bd4",
  "createdAt": "2024-11-12T00:15:18.000Z",
  "lastModified": "2024-11-12T04:35:27.000Z",
  "author": "duyntnet",
  "downloads": 472,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 29
}

duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard