duyntnet/solar-10.7b-instruct-v1.0-imatrix-gguf IQ3_XXS GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

duyntnet/solar-10.7b-instruct-v1.0-imatrix-gguf overview

Usage Instructions This model has been fine-tuned primarily for single-turn conversation, making it less suitable for multi-turn conversations such as chat. ### Version Make sure you have the correct version of the transformers library installed: ### Loading the Model Use the following Python code to load the model: ### Conducting Single-Turn Conversation Below is an example of the output.

transformersggufimatrixSOLAR-10.7B-Instruct-v1.0text-generationenlicense:otherregion:usconversational

duyntnet/solar-10.7b-instruct-v1.0-imatrix-gguf visual

Downloads

Likes

Pipeline

text-generation

Library

transformers

Visibility

Public

Access

Open

Repository Files & Downloads

27 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
SOLAR-10.7B-Instruct-v1.0-IQ1_M.gguf	GGUF	IQ1_M	2.39 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ1_S.gguf	GGUF	IQ1_S	2.19 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ2_M.gguf	GGUF	IQ2_M	3.42 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ2_S.gguf	GGUF	IQ2_S	3.16 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ2_XS.gguf	GGUF	IQ2_XS	3.01 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ2_XXS.gguf	GGUF	IQ2_XXS	2.72 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ3_M.gguf	GGUF	IQ3_M	4.51 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ3_S.gguf	GGUF	IQ3_S	4.37 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ3_XS.gguf	GGUF	IQ3_XS	4.14 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ3_XXS.gguf	GGUF	IQ3_XXS	3.88 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ4_NL.gguf	GGUF	IQ4_NL	5.68 GB	Download
SOLAR-10.7B-Instruct-v1.0-IQ4_XS.gguf	GGUF	IQ4_XS	5.38 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q2_K.gguf	GGUF	Q2_K	3.73 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q2_K_S.gguf	GGUF	Q2_K_S	3.46 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q3_K_L.gguf	GGUF	Q3_K_L	5.26 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q3_K_M.gguf	GGUF	Q3_K_M	4.84 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q3_K_S.gguf	GGUF	Q3_K_S	4.34 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q4_0.gguf	GGUF	—	5.68 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q4_1.gguf	GGUF	—	6.27 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q4_K_M.gguf	GGUF	Q4_K_M	6.02 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q4_K_S.gguf	GGUF	Q4_K_S	5.70 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q5_0.gguf	GGUF	—	6.91 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q5_1.gguf	GGUF	—	7.51 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q5_K_M.gguf	GGUF	Q5_K_M	7.08 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q5_K_S.gguf	GGUF	Q5_K_S	6.89 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q6_K.gguf	GGUF	Q6_K	8.20 GB	Download
SOLAR-10.7B-Instruct-v1.0-Q8_0.gguf	GGUF	—	10.62 GB	Download

Model Details Live

Model Slug

duyntnet/solar-10.7b-instruct-v1.0-imatrix-gguf

Author

duyntnet

Pipeline Task

text-generation

Library

transformers

Created

2024-05-04

Last Modified

2024-05-04

Gated

Private

HF SHA

ea19334578bb3989499c8582a6f055ee33e60a59

License

other

Language

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "other",
    "language": [
      "en"
    ],
    "pipeline_tag": "text-generation",
    "inference": false,
    "tags": [
      "transformers",
      "gguf",
      "imatrix",
      "SOLAR-10.7B-Instruct-v1.0"
    ],
    "frontmatter": {
      "license": "other",
      "language": [
        "en"
      ],
      "pipeline_tag": "text-generation",
      "inference": "false",
      "tags": [
        "transformers",
        "gguf",
        "imatrix",
        "SOLAR-10.7B-Instruct-v1.0"
      ]
    },
    "hero_image_url": "",
    "summary": "# **Usage Instructions** This model has been fine-tuned primarily for single-turn conversation, making it less suitable for multi-turn conversations such as chat. ### **Version** Make sure you have the correct version of the transformers library installed: ``sh pip install transformers==4.35.2 ` ### **Loading the Model** Use the following Python code to load the model: `python import torch from transformers import AutoModelForCausalLM, AutoTokenizer tokenizer = AutoTokenizer.from_pretrained(\"Upstage/SOLAR-10.7B-Instruct-v1.0\") model = AutoModelForCausalLM.from_pretrained( \"Upstage/SOLAR-10.7B-Instruct-v1.0\", device_map=\"auto\", torch_dtype=torch.float16, ) ` ### **Conducting Single-Turn Conversation** `python conversation = [ {'role': 'user', 'content': 'Hello?'} ] prompt = tokenizer.apply_chat_template(conversation, tokenize=False, add_generation_prompt=True) inputs = tokenizer(prompt, return_tensors=\"pt\").to(model.device) outputs = model.generate(**inputs, use_cache=True, max_length=4096) output_text = tokenizer.decode(outputs[0]) print(output_text) ` Below is an example of the output. `  ### User: Hello? ### Assistant: Hello, how can I assist you today? Please feel free to ask any questions or request help with a specific task. ``",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- SOLAR-10.7B-Instruct-v1.0\n---\nQuantizations of https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0\n\n# From original readme\n\n# **Usage Instructions**\n\nThis model has been fine-tuned primarily for single-turn conversation, making it less suitable for multi-turn conversations such as chat.\n\n### **Version**\n\nMake sure you have the correct version of the transformers library installed:\n\n```sh\npip install transformers==4.35.2\n```\n\n### **Loading the Model**\n\nUse the following Python code to load the model:\n\n```python\nimport torch\nfrom transformers import AutoModelForCausalLM, AutoTokenizer\n\ntokenizer = AutoTokenizer.from_pretrained(\"Upstage/SOLAR-10.7B-Instruct-v1.0\")\nmodel = AutoModelForCausalLM.from_pretrained(\n    \"Upstage/SOLAR-10.7B-Instruct-v1.0\",\n    device_map=\"auto\",\n    torch_dtype=torch.float16,\n)\n```\n\n### **Conducting Single-Turn Conversation**\n\n```python\nconversation = [ {'role': 'user', 'content': 'Hello?'} ] \n\nprompt = tokenizer.apply_chat_template(conversation, tokenize=False, add_generation_prompt=True)\n\ninputs = tokenizer(prompt, return_tensors=\"pt\").to(model.device) \noutputs = model.generate(**inputs, use_cache=True, max_length=4096)\noutput_text = tokenizer.decode(outputs[0]) \nprint(output_text)\n```\n\nBelow is an example of the output.\n```\n<s> ### User:\nHello?\n\n### Assistant:\nHello, how can I assist you today? Please feel free to ask any questions or request help with a specific task.</s>\n```",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "imatrix",
    "SOLAR-10.7B-Instruct-v1.0",
    "text-generation",
    "en",
    "license:other",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 98,
  "gated": false,
  "private": false,
  "last_modified": "2024-05-04T14:30:03.000Z",
  "created_at": "2024-05-04T11:46:39.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "6636201f20663edd59c71675",
  "id": "duyntnet/SOLAR-10.7B-Instruct-v1.0-imatrix-GGUF",
  "modelId": "duyntnet/SOLAR-10.7B-Instruct-v1.0-imatrix-GGUF",
  "sha": "ea19334578bb3989499c8582a6f055ee33e60a59",
  "createdAt": "2024-05-04T11:46:39.000Z",
  "lastModified": "2024-05-04T14:30:03.000Z",
  "author": "duyntnet",
  "downloads": 98,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 29
}

duyntnet/solar-10.7b-instruct-v1.0-imatrix-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard