GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

richarderkhov/scandukuri_-_llama3-8b-stargate-m1-gguf overview

This repository contains the iteration 1 meta-llama/Meta-Llama-3-8B-Instruct model from an additional experiment for STaR-GATE: Teaching Language Models to Ask Clarifying Questions. Note that this experiment is an extension and is not yet included in the most recent revision of the linked preprint. The weights contained in this repository are represented by the blue line in the left-side win-rate graph below. Note that this repository contains the weights for iteration t=1, i.e. only one iteration of self-improvement. When prompting language models to complete a task, users often leave important aspects unsaid. While asking questions could resolve this ambiguity (GATE; Li et al., 2023), models often struggle to ask good questions. We explore a language model's ability to self-improve (STaR; Zelikman et al., 2022) by rewarding the model for generating useful questions-a simple method we dub STaR-GATE. We generate a synthetic dataset of 25,500 unique persona-task prompts to simulate conversations between a pretrained language model-the Questioner-and a Roleplayer whose preferences are unknown to the Questioner. By asking questions, the Questioner elicits preferences from the Roleplayer. The Questioner is iteratively finetuned on questions that increase the probability of high-quality responses to the task, which are generated by an Oracle with access to the Roleplayer's latent preferences. After two iterations of self-improvement, the Questioner asks better questions, allowing it to generate responses that are preferred over responses from the initial model on 72% of tasks. Our results indicate that teaching a language model to ask better questions leads to better personalized responses. # Usage Reference the paper appendix sections (Figure 14: Questioner Elicitation Prompt) and (Figure 17: Questioner Win-Rate Response Prompt.) to see how you can prompt the model for elicitation or for final responses. All code and data for the project can be found here.

ggufarxiv:2403.19154endpoints_compatibleregion:usconversational
richarderkhov/scandukuri_-_llama3-8b-stargate-m1-gguf visual
Downloads
267
Likes
0
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

22 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
llama3-8b-stargate-m1.IQ3_M.gguf GGUF IQ3_M 3.52 GB Download
llama3-8b-stargate-m1.IQ3_S.gguf GGUF IQ3_S 3.43 GB Download
llama3-8b-stargate-m1.IQ3_XS.gguf GGUF IQ3_XS 3.28 GB Download
llama3-8b-stargate-m1.IQ4_NL.gguf GGUF IQ4_NL 4.38 GB Download
llama3-8b-stargate-m1.IQ4_XS.gguf GGUF IQ4_XS 4.18 GB Download
llama3-8b-stargate-m1.Q2_K.gguf GGUF Q2_K 2.96 GB Download
llama3-8b-stargate-m1.Q3_K.gguf GGUF Q3_K 3.74 GB Download
llama3-8b-stargate-m1.Q3_K_L.gguf GGUF Q3_K_L 4.03 GB Download
llama3-8b-stargate-m1.Q3_K_M.gguf GGUF Q3_K_M 3.74 GB Download
llama3-8b-stargate-m1.Q3_K_S.gguf GGUF Q3_K_S 3.41 GB Download
llama3-8b-stargate-m1.Q4_0.gguf GGUF 4.34 GB Download
llama3-8b-stargate-m1.Q4_1.gguf GGUF 4.78 GB Download
llama3-8b-stargate-m1.Q4_K.gguf GGUF Q4_K 2.38 GB Download
llama3-8b-stargate-m1.Q4_K_M.gguf GGUF Q4_K_M 4.58 GB Download
llama3-8b-stargate-m1.Q4_K_S.gguf GGUF Q4_K_S 4.37 GB Download
llama3-8b-stargate-m1.Q5_0.gguf GGUF 5.21 GB Download
llama3-8b-stargate-m1.Q5_1.gguf GGUF 5.65 GB Download
llama3-8b-stargate-m1.Q5_K.gguf GGUF Q5_K 5.34 GB Download
llama3-8b-stargate-m1.Q5_K_M.gguf GGUF Q5_K_M 5.34 GB Download
llama3-8b-stargate-m1.Q5_K_S.gguf GGUF Q5_K_S 5.21 GB Download
llama3-8b-stargate-m1.Q6_K.gguf GGUF Q6_K 6.14 GB Download
llama3-8b-stargate-m1.Q8_0.gguf GGUF 7.95 GB Download

Model Details Live

Model Slug
richarderkhov/scandukuri_-_llama3-8b-stargate-m1-gguf
Author
RichardErkhov
Pipeline Task
Library
Created
2024-08-23
Last Modified
2024-08-24
Gated
No
Private
No
HF SHA
df823d742a1ac2fd0c6fcc91866bc5603ae041dc
License
Unknown
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "https://github.com/scandukuri/assistant-gate/assets/87667591/7c2fe82a-04e8-4779-ab8d-c2476724ac69",
    "summary": "This repository contains the *iteration 1* meta-llama/Meta-Llama-3-8B-Instruct model from an additional experiment for STaR-GATE: Teaching Language Models to Ask Clarifying Questions. Note that this experiment is an extension and is **not yet included in the most recent revision of the linked preprint**. The weights contained in this repository are represented by the blue line in the left-side win-rate graph below. Note that this repository contains the weights for iteration *t=1*, i.e. only one iteration of self-improvement. When prompting language models to complete a task, users often leave important aspects unsaid. While asking questions could resolve this ambiguity (GATE; Li et al., 2023), models often struggle to ask good questions. We explore a language model's ability to self-improve (STaR; Zelikman et al., 2022) by rewarding the model for generating useful questions-a simple method we dub STaR-GATE. We generate a synthetic dataset of 25,500 unique persona-task prompts to simulate conversations between a pretrained language model-the Questioner-and a Roleplayer whose preferences are unknown to the Questioner. By asking questions, the Questioner elicits preferences from the Roleplayer. The Questioner is iteratively finetuned on questions that increase the probability of high-quality responses to the task, which are generated by an Oracle with access to the Roleplayer's latent preferences. After two iterations of self-improvement, the Questioner asks better questions, allowing it to generate responses that are preferred over responses from the initial model on 72% of tasks. Our results indicate that teaching a language model to ask better questions leads to better personalized responses.      # Usage Reference the paper appendix sections ``A.5.2` (**Figure 14:** Questioner Elicitation Prompt) and `A.6.2`` (**Figure 17:** Questioner Win-Rate Response Prompt.) to see how you can prompt the model for elicitation or for final responses. All code and data for the project can be found here.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nllama3-8b-stargate-m1 - GGUF\n- Model creator: https://huggingface.co/scandukuri/\n- Original model: https://huggingface.co/scandukuri/llama3-8b-stargate-m1/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [llama3-8b-stargate-m1.Q2_K.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q2_K.gguf) | Q2_K | 2.96GB |\n| [llama3-8b-stargate-m1.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.IQ3_XS.gguf) | IQ3_XS | 3.28GB |\n| [llama3-8b-stargate-m1.IQ3_S.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.IQ3_S.gguf) | IQ3_S | 3.43GB |\n| [llama3-8b-stargate-m1.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q3_K_S.gguf) | Q3_K_S | 3.41GB |\n| [llama3-8b-stargate-m1.IQ3_M.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.IQ3_M.gguf) | IQ3_M | 3.52GB |\n| [llama3-8b-stargate-m1.Q3_K.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q3_K.gguf) | Q3_K | 3.74GB |\n| [llama3-8b-stargate-m1.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q3_K_M.gguf) | Q3_K_M | 3.74GB |\n| [llama3-8b-stargate-m1.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q3_K_L.gguf) | Q3_K_L | 4.03GB |\n| [llama3-8b-stargate-m1.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.IQ4_XS.gguf) | IQ4_XS | 4.18GB |\n| [llama3-8b-stargate-m1.Q4_0.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q4_0.gguf) | Q4_0 | 4.34GB |\n| [llama3-8b-stargate-m1.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.IQ4_NL.gguf) | IQ4_NL | 4.38GB |\n| [llama3-8b-stargate-m1.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q4_K_S.gguf) | Q4_K_S | 4.37GB |\n| [llama3-8b-stargate-m1.Q4_K.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q4_K.gguf) | Q4_K | 2.38GB |\n| [llama3-8b-stargate-m1.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q4_K_M.gguf) | Q4_K_M | 4.58GB |\n| [llama3-8b-stargate-m1.Q4_1.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q4_1.gguf) | Q4_1 | 4.78GB |\n| [llama3-8b-stargate-m1.Q5_0.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q5_0.gguf) | Q5_0 | 5.21GB |\n| [llama3-8b-stargate-m1.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q5_K_S.gguf) | Q5_K_S | 5.21GB |\n| [llama3-8b-stargate-m1.Q5_K.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q5_K.gguf) | Q5_K | 5.34GB |\n| [llama3-8b-stargate-m1.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q5_K_M.gguf) | Q5_K_M | 5.34GB |\n| [llama3-8b-stargate-m1.Q5_1.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q5_1.gguf) | Q5_1 | 5.65GB |\n| [llama3-8b-stargate-m1.Q6_K.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q6_K.gguf) | Q6_K | 6.14GB |\n| [llama3-8b-stargate-m1.Q8_0.gguf](https://huggingface.co/RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf/blob/main/llama3-8b-stargate-m1.Q8_0.gguf) | Q8_0 | 7.95GB |\n\n\n\n\nOriginal model description:\n---\nlicense: mit\n---\n\n# STaR-GATE\n\nThis repository contains the *iteration 1* [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) model from an additional experiment for [STaR-GATE: Teaching Language Models to Ask Clarifying Questions](https://arxiv.org/abs/2403.19154). Note that this experiment is an extension and is **not yet included in the most recent revision of the linked preprint**. The weights contained in this repository are represented by the <span style=\"color:#2EA5E7\">blue</span> line in the left-side win-rate graph below. Note that this repository contains the weights for iteration *t=1*, i.e. only one iteration of self-improvement.\n\nWhen prompting language models to complete a task, users often leave important aspects unsaid. While asking questions could resolve this ambiguity (GATE; Li et al., 2023), models often struggle to ask good questions. We explore a language model's ability to self-improve (STaR; Zelikman et al., 2022) by rewarding the model for generating useful questions-a simple method we dub STaR-GATE. We generate a synthetic dataset of 25,500 unique persona-task prompts to simulate conversations between a pretrained language model-the Questioner-and a Roleplayer whose preferences are unknown to the Questioner. By asking questions, the Questioner elicits preferences from the Roleplayer. The Questioner is iteratively finetuned on questions that increase the probability of high-quality responses to the task, which are generated by an Oracle with access to the Roleplayer's latent preferences. After two iterations of self-improvement, the Questioner asks better questions, allowing it to generate responses that are preferred over responses from the initial model on 72% of tasks. Our results indicate that teaching a language model to ask better questions leads to better personalized responses.\n\n<p align=\"center\">\n  <br>\n  <img src=\"https://github.com/scandukuri/assistant-gate/assets/87667591/7c2fe82a-04e8-4779-ab8d-c2476724ac69\" alt=\"fig_3\">\n  <br><br>\n</p>\n\n# Usage\n\nReference the [paper](https://arxiv.org/abs/2403.19154) appendix sections ```A.5.2``` (**Figure 14:** Questioner Elicitation Prompt) and ```A.6.2``` (**Figure 17:** Questioner Win-Rate Response Prompt.) to see how you can prompt the model for elicitation or for final responses. All code and data for the project can be found [here](https://github.com/scandukuri/assistant-gate).\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "arxiv:2403.19154",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 267,
  "gated": false,
  "private": false,
  "last_modified": "2024-08-24T06:20:15.000Z",
  "created_at": "2024-08-23T18:31:01.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "66c8d565c218946abecd9eae",
  "id": "RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf",
  "modelId": "RichardErkhov/scandukuri_-_llama3-8b-stargate-m1-gguf",
  "sha": "df823d742a1ac2fd0c6fcc91866bc5603ae041dc",
  "createdAt": "2024-08-23T18:31:01.000Z",
  "lastModified": "2024-08-24T06:20:15.000Z",
  "author": "RichardErkhov",
  "downloads": 267,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 24
}