GraySoft
Projects Models About FAQ Contact Download guIDE →

duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf IQ3_S GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf overview

Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT. ### Model Sources

transformersggufimatrixvicuna-7b-v1.5-16ktext-generationenarxiv:2307.09288arxiv:2306.05685license:otherregion:us
duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf visual
Downloads
472
Likes
0
Pipeline
text-generation
Library
transformers
Visibility
Public
Access
Open

Repository Files & Downloads

27 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
vicuna-7b-v1.5-16k-IQ1_M.gguf GGUF IQ1_M 1.54 GB Download
vicuna-7b-v1.5-16k-IQ1_S.gguf GGUF IQ1_S 1.42 GB Download
vicuna-7b-v1.5-16k-IQ2_M.gguf GGUF IQ2_M 2.20 GB Download
vicuna-7b-v1.5-16k-IQ2_S.gguf GGUF IQ2_S 2.05 GB Download
vicuna-7b-v1.5-16k-IQ2_XS.gguf GGUF IQ2_XS 1.90 GB Download
vicuna-7b-v1.5-16k-IQ2_XXS.gguf GGUF IQ2_XXS 1.73 GB Download
vicuna-7b-v1.5-16k-IQ3_M.gguf GGUF IQ3_M 2.90 GB Download
vicuna-7b-v1.5-16k-IQ3_S.gguf GGUF IQ3_S 2.75 GB Download
vicuna-7b-v1.5-16k-IQ3_XS.gguf GGUF IQ3_XS 2.60 GB Download
vicuna-7b-v1.5-16k-IQ3_XXS.gguf GGUF IQ3_XXS 2.41 GB Download
vicuna-7b-v1.5-16k-IQ4_NL.gguf GGUF IQ4_NL 3.56 GB Download
vicuna-7b-v1.5-16k-IQ4_XS.gguf GGUF IQ4_XS 3.37 GB Download
vicuna-7b-v1.5-16k-Q2_K.gguf GGUF Q2_K 2.36 GB Download
vicuna-7b-v1.5-16k-Q2_K_S.gguf GGUF Q2_K_S 2.16 GB Download
vicuna-7b-v1.5-16k-Q3_K_L.gguf GGUF Q3_K_L 3.35 GB Download
vicuna-7b-v1.5-16k-Q3_K_M.gguf GGUF Q3_K_M 3.07 GB Download
vicuna-7b-v1.5-16k-Q3_K_S.gguf GGUF Q3_K_S 2.75 GB Download
vicuna-7b-v1.5-16k-Q4_0.gguf GGUF 3.57 GB Download
vicuna-7b-v1.5-16k-Q4_1.gguf GGUF 3.95 GB Download
vicuna-7b-v1.5-16k-Q4_K_M.gguf GGUF Q4_K_M 3.80 GB Download
vicuna-7b-v1.5-16k-Q4_K_S.gguf GGUF Q4_K_S 3.59 GB Download
vicuna-7b-v1.5-16k-Q5_0.gguf GGUF 4.34 GB Download
vicuna-7b-v1.5-16k-Q5_1.gguf GGUF 4.72 GB Download
vicuna-7b-v1.5-16k-Q5_K_M.gguf GGUF Q5_K_M 4.45 GB Download
vicuna-7b-v1.5-16k-Q5_K_S.gguf GGUF Q5_K_S 4.33 GB Download
vicuna-7b-v1.5-16k-Q6_K.gguf GGUF Q6_K 5.15 GB Download
vicuna-7b-v1.5-16k-Q8_0.gguf GGUF 6.67 GB Download

Model Details Live

Model Slug
duyntnet/vicuna-7b-v1.5-16k-imatrix-gguf
Author
duyntnet
Pipeline Task
text-generation
Library
transformers
Created
2024-11-12
Last Modified
2024-11-12
Gated
No
Private
No
HF SHA
373fc286c5b4731c81957a89aa201102bab66bd4
License
other
Language
en
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "other",
    "language": [
      "en"
    ],
    "pipeline_tag": "text-generation",
    "inference": false,
    "tags": [
      "transformers",
      "gguf",
      "imatrix",
      "vicuna-7b-v1.5-16k"
    ],
    "frontmatter": {
      "license": "other",
      "language": [
        "en"
      ],
      "pipeline_tag": "text-generation",
      "inference": "false",
      "tags": [
        "transformers",
        "gguf",
        "imatrix",
        "vicuna-7b-v1.5-16k"
      ]
    },
    "hero_image_url": "",
    "summary": "Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT. ### Model Sources",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- vicuna-7b-v1.5-16k\n---\nQuantizations of https://huggingface.co/lmsys/vicuna-7b-v1.5-16k\n\n\n### Inference Clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [ollama](https://github.com/ollama/ollama)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [GPT4All](https://github.com/nomic-ai/gpt4all)\n* [jan](https://github.com/janhq/jan)\n---\n\n# From original readme\n\nVicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT.\n\n- **Developed by:** [LMSYS](https://lmsys.org/)\n- **Model type:** An auto-regressive language model based on the transformer architecture\n- **License:** Llama 2 Community License Agreement\t\n- **Finetuned from model:** [Llama 2](https://arxiv.org/abs/2307.09288)\n\n### Model Sources\n\n- **Repository:** https://github.com/lm-sys/FastChat\n- **Blog:** https://lmsys.org/blog/2023-03-30-vicuna/\n- **Paper:** https://arxiv.org/abs/2306.05685\n- **Demo:** https://chat.lmsys.org/\n\n## Uses\n\nThe primary use of Vicuna is research on large language models and chatbots.\nThe primary intended users of the model are researchers and hobbyists in natural language processing, machine learning, and artificial intelligence.\n\n## How to Get Started with the Model\n\n- Command line interface: https://github.com/lm-sys/FastChat#vicuna-weights\n- APIs (OpenAI API, Huggingface API): https://github.com/lm-sys/FastChat/tree/main#api  ",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "imatrix",
    "vicuna-7b-v1.5-16k",
    "text-generation",
    "en",
    "arxiv:2307.09288",
    "arxiv:2306.05685",
    "license:other",
    "region:us"
  ],
  "likes": 0,
  "downloads": 472,
  "gated": false,
  "private": false,
  "last_modified": "2024-11-12T04:35:27.000Z",
  "created_at": "2024-11-12T00:15:18.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "67329e1624b316be878e6b42",
  "id": "duyntnet/vicuna-7b-v1.5-16k-imatrix-GGUF",
  "modelId": "duyntnet/vicuna-7b-v1.5-16k-imatrix-GGUF",
  "sha": "373fc286c5b4731c81957a89aa201102bab66bd4",
  "createdAt": "2024-11-12T00:15:18.000Z",
  "lastModified": "2024-11-12T04:35:27.000Z",
  "author": "duyntnet",
  "downloads": 472,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 29
}