GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

richarderkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf overview

Replication of prometheus-7b-v2.0 using Llama 3 8B Instruct as a base model. As in their paper, two different models were trained on their preference and feedback datasets then linearly merged at equal weight. Training hyperparameters: 1 epoch Learning rate 1e-5 Effective batch size 128 Cosine annealing * ~5% warmup Uses Llama 3 Instruct prompt format and the same prompts as prometheus-7b-v2.0. See that readme for info. # Citations

ggufarxiv:2310.08491arxiv:2405.01535endpoints_compatibleregion:usconversational
richarderkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf visual
Downloads
152
Likes
0
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

22 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
prometheus-2-llama-3-8b.IQ3_M.gguf GGUF IQ3_M 3.52 GB Download
prometheus-2-llama-3-8b.IQ3_S.gguf GGUF IQ3_S 3.43 GB Download
prometheus-2-llama-3-8b.IQ3_XS.gguf GGUF IQ3_XS 3.28 GB Download
prometheus-2-llama-3-8b.IQ4_NL.gguf GGUF IQ4_NL 4.38 GB Download
prometheus-2-llama-3-8b.IQ4_XS.gguf GGUF IQ4_XS 4.18 GB Download
prometheus-2-llama-3-8b.Q2_K.gguf GGUF Q2_K 2.96 GB Download
prometheus-2-llama-3-8b.Q3_K.gguf GGUF Q3_K 3.74 GB Download
prometheus-2-llama-3-8b.Q3_K_L.gguf GGUF Q3_K_L 4.03 GB Download
prometheus-2-llama-3-8b.Q3_K_M.gguf GGUF Q3_K_M 3.74 GB Download
prometheus-2-llama-3-8b.Q3_K_S.gguf GGUF Q3_K_S 3.41 GB Download
prometheus-2-llama-3-8b.Q4_0.gguf GGUF 4.34 GB Download
prometheus-2-llama-3-8b.Q4_1.gguf GGUF 4.78 GB Download
prometheus-2-llama-3-8b.Q4_K.gguf GGUF Q4_K 4.58 GB Download
prometheus-2-llama-3-8b.Q4_K_M.gguf GGUF Q4_K_M 4.58 GB Download
prometheus-2-llama-3-8b.Q4_K_S.gguf GGUF Q4_K_S 4.37 GB Download
prometheus-2-llama-3-8b.Q5_0.gguf GGUF 5.21 GB Download
prometheus-2-llama-3-8b.Q5_1.gguf GGUF 5.65 GB Download
prometheus-2-llama-3-8b.Q5_K.gguf GGUF Q5_K 5.34 GB Download
prometheus-2-llama-3-8b.Q5_K_M.gguf GGUF Q5_K_M 5.34 GB Download
prometheus-2-llama-3-8b.Q5_K_S.gguf GGUF Q5_K_S 5.21 GB Download
prometheus-2-llama-3-8b.Q6_K.gguf GGUF Q6_K 6.14 GB Download
prometheus-2-llama-3-8b.Q8_0.gguf GGUF 7.95 GB Download

Model Details Live

Model Slug
richarderkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf
Author
RichardErkhov
Pipeline Task
Library
Created
2024-08-21
Last Modified
2024-08-21
Gated
No
Private
No
HF SHA
f3a1e9e166f260516b6b1ccce9f732c6f37e8873
License
Unknown
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "",
    "summary": "Replication of prometheus-7b-v2.0 using Llama 3 8B Instruct as a base model. As in their paper, two different models were trained on their preference and feedback datasets then linearly merged at equal weight. Training hyperparameters: * 1 epoch * Learning rate 1e-5 * Effective batch size 128 * Cosine annealing * ~5% warmup Uses Llama 3 Instruct prompt format and the same prompts as prometheus-7b-v2.0. See that readme for info. # Citations ``bibtex @misc{kim2023prometheus, title={Prometheus: Inducing Fine-grained Evaluation Capability in Language Models}, author={Seungone Kim and Jamin Shin and Yejin Cho and Joel Jang and Shayne Longpre and Hwaran Lee and Sangdoo Yun and Seongjin Shin and Sungdong Kim and James Thorne and Minjoon Seo}, year={2023}, eprint={2310.08491}, archivePrefix={arXiv}, primaryClass={cs.CL} } ` `bibtex @misc{kim2024prometheus, title={Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models}, author={Seungone Kim and Juyoung Suk and Shayne Longpre and Bill Yuchen Lin and Jamin Shin and Sean Welleck and Graham Neubig and Moontae Lee and Kyungjae Lee and Minjoon Seo}, year={2024}, eprint={2405.01535}, archivePrefix={arXiv}, primaryClass={cs.CL} } ``",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nprometheus-2-llama-3-8b - GGUF\n- Model creator: https://huggingface.co/chargoddard/\n- Original model: https://huggingface.co/chargoddard/prometheus-2-llama-3-8b/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [prometheus-2-llama-3-8b.Q2_K.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q2_K.gguf) | Q2_K | 2.96GB |\n| [prometheus-2-llama-3-8b.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.IQ3_XS.gguf) | IQ3_XS | 3.28GB |\n| [prometheus-2-llama-3-8b.IQ3_S.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.IQ3_S.gguf) | IQ3_S | 3.43GB |\n| [prometheus-2-llama-3-8b.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q3_K_S.gguf) | Q3_K_S | 3.41GB |\n| [prometheus-2-llama-3-8b.IQ3_M.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.IQ3_M.gguf) | IQ3_M | 3.52GB |\n| [prometheus-2-llama-3-8b.Q3_K.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q3_K.gguf) | Q3_K | 3.74GB |\n| [prometheus-2-llama-3-8b.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q3_K_M.gguf) | Q3_K_M | 3.74GB |\n| [prometheus-2-llama-3-8b.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q3_K_L.gguf) | Q3_K_L | 4.03GB |\n| [prometheus-2-llama-3-8b.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.IQ4_XS.gguf) | IQ4_XS | 4.18GB |\n| [prometheus-2-llama-3-8b.Q4_0.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q4_0.gguf) | Q4_0 | 4.34GB |\n| [prometheus-2-llama-3-8b.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.IQ4_NL.gguf) | IQ4_NL | 4.38GB |\n| [prometheus-2-llama-3-8b.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q4_K_S.gguf) | Q4_K_S | 4.37GB |\n| [prometheus-2-llama-3-8b.Q4_K.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q4_K.gguf) | Q4_K | 4.58GB |\n| [prometheus-2-llama-3-8b.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q4_K_M.gguf) | Q4_K_M | 4.58GB |\n| [prometheus-2-llama-3-8b.Q4_1.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q4_1.gguf) | Q4_1 | 4.78GB |\n| [prometheus-2-llama-3-8b.Q5_0.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q5_0.gguf) | Q5_0 | 5.21GB |\n| [prometheus-2-llama-3-8b.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q5_K_S.gguf) | Q5_K_S | 5.21GB |\n| [prometheus-2-llama-3-8b.Q5_K.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q5_K.gguf) | Q5_K | 5.34GB |\n| [prometheus-2-llama-3-8b.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q5_K_M.gguf) | Q5_K_M | 5.34GB |\n| [prometheus-2-llama-3-8b.Q5_1.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q5_1.gguf) | Q5_1 | 5.65GB |\n| [prometheus-2-llama-3-8b.Q6_K.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q6_K.gguf) | Q6_K | 6.14GB |\n| [prometheus-2-llama-3-8b.Q8_0.gguf](https://huggingface.co/RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf/blob/main/prometheus-2-llama-3-8b.Q8_0.gguf) | Q8_0 | 7.95GB |\n\n\n\n\nOriginal model description:\n---\nbase_model:\n- chargoddard/prometheus-llama-3-8b-preference\n- chargoddard/prometheus-llama-3-8b-absolute\nlibrary_name: transformers\ntags:\n- mergekit\n- merge\nlicense: apache-2.0\ndatasets:\n- prometheus-eval/Preference-Collection\n- prometheus-eval/Feedback-Collection\nlanguage:\n- en\n---\n# prometheus-2-llama-3-8b\n\nReplication of [prometheus-7b-v2.0](https://huggingface.co/prometheus-eval/prometheus-7b-v2.0) using [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) as a base model.\n\nAs in their paper, two different models were trained on their preference and feedback datasets then linearly merged at equal weight.\n\nTraining hyperparameters:\n* 1 epoch\n* Learning rate 1e-5\n* Effective batch size 128\n* Cosine annealing\n* ~5% warmup\n\n\nUses Llama 3 Instruct prompt format and the same prompts as prometheus-7b-v2.0. See that readme for info.\n\n\n# Citations\n\n\n```bibtex\n@misc{kim2023prometheus,\n    title={Prometheus: Inducing Fine-grained Evaluation Capability in Language Models},\n    author={Seungone Kim and Jamin Shin and Yejin Cho and Joel Jang and Shayne Longpre and Hwaran Lee and Sangdoo Yun and Seongjin Shin and Sungdong Kim and James Thorne and Minjoon Seo},\n    year={2023},\n    eprint={2310.08491},\n    archivePrefix={arXiv},\n    primaryClass={cs.CL}\n}\n```\n```bibtex\n@misc{kim2024prometheus,\n    title={Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models},\n    author={Seungone Kim and Juyoung Suk and Shayne Longpre and Bill Yuchen Lin and Jamin Shin and Sean Welleck and Graham Neubig and Moontae Lee and Kyungjae Lee and Minjoon Seo},\n    year={2024},\n    eprint={2405.01535},\n    archivePrefix={arXiv},\n    primaryClass={cs.CL}\n}\n```\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "arxiv:2310.08491",
    "arxiv:2405.01535",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 152,
  "gated": false,
  "private": false,
  "last_modified": "2024-08-21T11:32:57.000Z",
  "created_at": "2024-08-21T09:45:11.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "66c5b72746525a0007ce24be",
  "id": "RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf",
  "modelId": "RichardErkhov/chargoddard_-_prometheus-2-llama-3-8b-gguf",
  "sha": "f3a1e9e166f260516b6b1ccce9f732c6f37e8873",
  "createdAt": "2024-08-21T09:45:11.000Z",
  "lastModified": "2024-08-21T11:32:57.000Z",
  "author": "RichardErkhov",
  "downloads": 152,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 24
}