GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

richarderkhov/aalf_-_gemma-2-27b-it-simpo-37k-gguf overview

Comprehensive model page for richarderkhov/aalf-gemma-2-27b-it-simpo-37k-gguf

ggufarxiv:2405.14734arxiv:2310.01377endpoints_compatibleregion:usconversational
richarderkhov/aalf_-_gemma-2-27b-it-simpo-37k-gguf visual
Downloads
103
Likes
0
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

22 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
gemma-2-27b-it-SimPO-37K.IQ3_M.gguf GGUF IQ3_M 11.60 GB Download
gemma-2-27b-it-SimPO-37K.IQ3_S.gguf GGUF IQ3_S 11.33 GB Download
gemma-2-27b-it-SimPO-37K.IQ3_XS.gguf GGUF IQ3_XS 10.76 GB Download
gemma-2-27b-it-SimPO-37K.IQ4_NL.gguf GGUF IQ4_NL 14.65 GB Download
gemma-2-27b-it-SimPO-37K.IQ4_XS.gguf GGUF IQ4_XS 13.92 GB Download
gemma-2-27b-it-SimPO-37K.Q2_K.gguf GGUF Q2_K 9.73 GB Download
gemma-2-27b-it-SimPO-37K.Q3_K.gguf GGUF Q3_K 12.50 GB Download
gemma-2-27b-it-SimPO-37K.Q3_K_L.gguf GGUF Q3_K_L 13.52 GB Download
gemma-2-27b-it-SimPO-37K.Q3_K_M.gguf GGUF Q3_K_M 12.50 GB Download
gemma-2-27b-it-SimPO-37K.Q3_K_S.gguf GGUF Q3_K_S 11.33 GB Download
gemma-2-27b-it-SimPO-37K.Q4_0.gguf GGUF 14.56 GB Download
gemma-2-27b-it-SimPO-37K.Q4_1.gguf GGUF 16.07 GB Download
gemma-2-27b-it-SimPO-37K.Q4_K.gguf GGUF Q4_K 15.50 GB Download
gemma-2-27b-it-SimPO-37K.Q4_K_M.gguf GGUF Q4_K_M 15.50 GB Download
gemma-2-27b-it-SimPO-37K.Q4_K_S.gguf GGUF Q4_K_S 14.66 GB Download
gemma-2-27b-it-SimPO-37K.Q5_0.gguf GGUF 17.59 GB Download
gemma-2-27b-it-SimPO-37K.Q5_1.gguf GGUF 19.10 GB Download
gemma-2-27b-it-SimPO-37K.Q5_K.gguf GGUF Q5_K 18.08 GB Download
gemma-2-27b-it-SimPO-37K.Q5_K_M.gguf GGUF Q5_K_M 18.08 GB Download
gemma-2-27b-it-SimPO-37K.Q5_K_S.gguf GGUF Q5_K_S 17.59 GB Download
gemma-2-27b-it-SimPO-37K.Q6_K.gguf GGUF Q6_K 20.81 GB Download
gemma-2-27b-it-SimPO-37K.Q8_0.gguf GGUF 26.95 GB Download

Model Details Live

Model Slug
richarderkhov/aalf_-_gemma-2-27b-it-simpo-37k-gguf
Author
RichardErkhov
Pipeline Task
Library
Created
2024-09-17
Last Modified
2024-09-18
Gated
No
Private
No
HF SHA
3c3cce221fb8828e0f8a32ed32762526a32cd3ab
License
Unknown
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "",
    "summary": "",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\ngemma-2-27b-it-SimPO-37K - GGUF\n- Model creator: https://huggingface.co/AALF/\n- Original model: https://huggingface.co/AALF/gemma-2-27b-it-SimPO-37K/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [gemma-2-27b-it-SimPO-37K.Q2_K.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q2_K.gguf) | Q2_K | 9.73GB |\n| [gemma-2-27b-it-SimPO-37K.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.IQ3_XS.gguf) | IQ3_XS | 10.76GB |\n| [gemma-2-27b-it-SimPO-37K.IQ3_S.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.IQ3_S.gguf) | IQ3_S | 11.33GB |\n| [gemma-2-27b-it-SimPO-37K.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q3_K_S.gguf) | Q3_K_S | 11.33GB |\n| [gemma-2-27b-it-SimPO-37K.IQ3_M.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.IQ3_M.gguf) | IQ3_M | 11.6GB |\n| [gemma-2-27b-it-SimPO-37K.Q3_K.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q3_K.gguf) | Q3_K | 12.5GB |\n| [gemma-2-27b-it-SimPO-37K.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q3_K_M.gguf) | Q3_K_M | 12.5GB |\n| [gemma-2-27b-it-SimPO-37K.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q3_K_L.gguf) | Q3_K_L | 13.52GB |\n| [gemma-2-27b-it-SimPO-37K.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.IQ4_XS.gguf) | IQ4_XS | 13.92GB |\n| [gemma-2-27b-it-SimPO-37K.Q4_0.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q4_0.gguf) | Q4_0 | 14.56GB |\n| [gemma-2-27b-it-SimPO-37K.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.IQ4_NL.gguf) | IQ4_NL | 14.65GB |\n| [gemma-2-27b-it-SimPO-37K.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q4_K_S.gguf) | Q4_K_S | 14.66GB |\n| [gemma-2-27b-it-SimPO-37K.Q4_K.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q4_K.gguf) | Q4_K | 15.5GB |\n| [gemma-2-27b-it-SimPO-37K.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q4_K_M.gguf) | Q4_K_M | 15.5GB |\n| [gemma-2-27b-it-SimPO-37K.Q4_1.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q4_1.gguf) | Q4_1 | 16.07GB |\n| [gemma-2-27b-it-SimPO-37K.Q5_0.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q5_0.gguf) | Q5_0 | 17.59GB |\n| [gemma-2-27b-it-SimPO-37K.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q5_K_S.gguf) | Q5_K_S | 17.59GB |\n| [gemma-2-27b-it-SimPO-37K.Q5_K.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q5_K.gguf) | Q5_K | 18.08GB |\n| [gemma-2-27b-it-SimPO-37K.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q5_K_M.gguf) | Q5_K_M | 18.08GB |\n| [gemma-2-27b-it-SimPO-37K.Q5_1.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q5_1.gguf) | Q5_1 | 19.1GB |\n| [gemma-2-27b-it-SimPO-37K.Q6_K.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q6_K.gguf) | Q6_K | 20.81GB |\n| [gemma-2-27b-it-SimPO-37K.Q8_0.gguf](https://huggingface.co/RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf/blob/main/gemma-2-27b-it-SimPO-37K.Q8_0.gguf) | Q8_0 | 26.95GB |\n\n\n\n\nOriginal model description:\n---\nlicense: gemma\nlibrary_name: transformers\npipeline_tag: text-generation\nbase_model: google/gemma-2-27b-it\ntags:\n- alignment-handbook\n- generated_from_trainer\n---\n\n# gemma-2-27b-it-SimPO-37K Model Card\n\n## Implementation Details\nWe first followed the [SimPO](https://github.com/princeton-nlp/SimPO) framework to apply [On-Policy Preference Data Generation](https://github.com/princeton-nlp/SimPO/tree/main/on_policy_data_gen) on the [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized) dataset using the [google/gemma-2-27b-it](https://huggingface.co/google/gemma-2-27b-it) model. We then selected prompts where the chosen reward was at least 0.01 higher than the rejected reward, resulting in 37,040 training data points.\n\nModel training was conducted using 8x80G A800 GPUs, leveraging the [alignment-handbook](https://github.com/huggingface/alignment-handbook) library. We used `deepspeed_zero_stage3` with optimizer offloading to the CPU. The `SimPOTrainer` arguments were as follows:\n\n```bash\n# SimPOTrainer arguments\nbf16: true\nbeta: 10\ngamma_beta_ratio: 0.5\ngradient_accumulation_steps: 8\ngradient_checkpointing: true\ngradient_checkpointing_kwargs:\n  use_reentrant: true\nhub_model_id: simpo-exps\nlearning_rate: 8.0e-7\nlog_level: info\nlogging_steps: 1\nlr_scheduler_type: cosine\nmax_length: 2048\nmax_prompt_length: 1800\nnum_train_epochs: 1\noptim: adamw_torch\noutput_dir: outputs/gemma-2-27b-it-SimPO\nrun_name: gemma-2-27b-it-SimPO\nper_device_train_batch_size: 2\npush_to_hub: false\nsave_strategy: \"steps\"\nsave_steps: 100\nsave_total_limit: 20\nseed: 42\nwarmup_ratio: 0.1\nsave_only_model: true\n```\n\n## Citation\n\ngemma model:\n```\n@article{gemma_2024,\n    title={Gemma},\n    url={https://www.kaggle.com/m/3301},\n    DOI={10.34740/KAGGLE/M/3301},\n    publisher={Kaggle},\n    author={Gemma Team},\n    year={2024}\n}\n```\n\nSimPO paper:\n```\n@article{meng2024simpo,\n  title={{SimPO}: Simple preference optimization with a reference-free reward},\n  author={Meng, Yu and Xia, Mengzhou and Chen, Danqi},\n  journal={arXiv preprint arXiv:2405.14734},\n  year={2024}\n}\n```\n\nUltraFeedback paper:\n```\n@article{cui2023ultrafeedback,\n  title={{UltraFeedback}: Boosting language models with high-quality feedback},\n  author={Cui, Ganqu and Yuan, Lifan and Ding, Ning and Yao, Guanming and Zhu, Wei and Ni, Yuan and Xie, Guotong and Liu, Zhiyuan and Sun, Maosong},\n  journal={arXiv preprint arXiv:2310.01377},\n  year={2023}\n}\n```\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "arxiv:2405.14734",
    "arxiv:2310.01377",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 103,
  "gated": false,
  "private": false,
  "last_modified": "2024-09-18T03:56:58.000Z",
  "created_at": "2024-09-17T15:52:52.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "66e9a5d4ffe696289baeb2a2",
  "id": "RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf",
  "modelId": "RichardErkhov/AALF_-_gemma-2-27b-it-SimPO-37K-gguf",
  "sha": "3c3cce221fb8828e0f8a32ed32762526a32cd3ab",
  "createdAt": "2024-09-17T15:52:52.000Z",
  "lastModified": "2024-09-18T03:56:58.000Z",
  "author": "RichardErkhov",
  "downloads": 103,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 24
}