GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

richarderkhov/charlesli_-_openelm-1_1b-cpo-gguf overview

This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:

ggufendpoints_compatibleregion:usconversational
richarderkhov/charlesli_-_openelm-1_1b-cpo-gguf visual
Downloads
273
Likes
0
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

22 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
OpenELM-1_1B-CPO.IQ3_M.gguf GGUF IQ3_M 497.10 MB Download
OpenELM-1_1B-CPO.IQ3_S.gguf GGUF IQ3_S 468.05 MB Download
OpenELM-1_1B-CPO.IQ3_XS.gguf GGUF IQ3_XS 450.85 MB Download
OpenELM-1_1B-CPO.IQ4_NL.gguf GGUF IQ4_NL 597.45 MB Download
OpenELM-1_1B-CPO.IQ4_XS.gguf GGUF IQ4_XS 567.46 MB Download
OpenELM-1_1B-CPO.Q2_K.gguf GGUF Q2_K 403.99 MB Download
OpenELM-1_1B-CPO.Q3_K.gguf GGUF Q3_K 529.83 MB Download
OpenELM-1_1B-CPO.Q3_K_L.gguf GGUF Q3_K_L 571.64 MB Download
OpenELM-1_1B-CPO.Q3_K_M.gguf GGUF Q3_K_M 529.83 MB Download
OpenELM-1_1B-CPO.Q3_K_S.gguf GGUF Q3_K_S 468.05 MB Download
OpenELM-1_1B-CPO.Q4_0.gguf GGUF 596.51 MB Download
OpenELM-1_1B-CPO.Q4_1.gguf GGUF 656.97 MB Download
OpenELM-1_1B-CPO.Q4_K.gguf GGUF Q4_K 646.65 MB Download
OpenELM-1_1B-CPO.Q4_K_M.gguf GGUF Q4_K_M 646.65 MB Download
OpenELM-1_1B-CPO.Q4_K_S.gguf GGUF Q4_K_S 597.45 MB Download
OpenELM-1_1B-CPO.Q5_0.gguf GGUF 717.42 MB Download
OpenELM-1_1B-CPO.Q5_1.gguf GGUF 777.87 MB Download
OpenELM-1_1B-CPO.Q5_K.gguf GGUF Q5_K 751.92 MB Download
OpenELM-1_1B-CPO.Q5_K_M.gguf GGUF Q5_K_M 751.92 MB Download
OpenELM-1_1B-CPO.Q5_K_S.gguf GGUF Q5_K_S 717.42 MB Download
OpenELM-1_1B-CPO.Q6_K.gguf GGUF Q6_K 845.88 MB Download
OpenELM-1_1B-CPO.Q8_0.gguf GGUF 1.07 GB Download

Model Details Live

Model Slug
richarderkhov/charlesli_-_openelm-1_1b-cpo-gguf
Author
RichardErkhov
Pipeline Task
Library
Created
2025-04-01
Last Modified
2025-04-01
Gated
No
Private
No
HF SHA
13689c90f4a2331751fe162d3391aa5d5158b300
License
Unknown
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "",
    "summary": "This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nOpenELM-1_1B-CPO - GGUF\n- Model creator: https://huggingface.co/CharlesLi/\n- Original model: https://huggingface.co/CharlesLi/OpenELM-1_1B-CPO/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [OpenELM-1_1B-CPO.Q2_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q2_K.gguf) | Q2_K | 0.39GB |\n| [OpenELM-1_1B-CPO.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.IQ3_XS.gguf) | IQ3_XS | 0.44GB |\n| [OpenELM-1_1B-CPO.IQ3_S.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.IQ3_S.gguf) | IQ3_S | 0.46GB |\n| [OpenELM-1_1B-CPO.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q3_K_S.gguf) | Q3_K_S | 0.46GB |\n| [OpenELM-1_1B-CPO.IQ3_M.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.IQ3_M.gguf) | IQ3_M | 0.49GB |\n| [OpenELM-1_1B-CPO.Q3_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q3_K.gguf) | Q3_K | 0.52GB |\n| [OpenELM-1_1B-CPO.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q3_K_M.gguf) | Q3_K_M | 0.52GB |\n| [OpenELM-1_1B-CPO.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q3_K_L.gguf) | Q3_K_L | 0.56GB |\n| [OpenELM-1_1B-CPO.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.IQ4_XS.gguf) | IQ4_XS | 0.55GB |\n| [OpenELM-1_1B-CPO.Q4_0.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q4_0.gguf) | Q4_0 | 0.58GB |\n| [OpenELM-1_1B-CPO.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.IQ4_NL.gguf) | IQ4_NL | 0.58GB |\n| [OpenELM-1_1B-CPO.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q4_K_S.gguf) | Q4_K_S | 0.58GB |\n| [OpenELM-1_1B-CPO.Q4_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q4_K.gguf) | Q4_K | 0.63GB |\n| [OpenELM-1_1B-CPO.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q4_K_M.gguf) | Q4_K_M | 0.63GB |\n| [OpenELM-1_1B-CPO.Q4_1.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q4_1.gguf) | Q4_1 | 0.64GB |\n| [OpenELM-1_1B-CPO.Q5_0.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q5_0.gguf) | Q5_0 | 0.7GB |\n| [OpenELM-1_1B-CPO.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q5_K_S.gguf) | Q5_K_S | 0.7GB |\n| [OpenELM-1_1B-CPO.Q5_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q5_K.gguf) | Q5_K | 0.73GB |\n| [OpenELM-1_1B-CPO.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q5_K_M.gguf) | Q5_K_M | 0.73GB |\n| [OpenELM-1_1B-CPO.Q5_1.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q5_1.gguf) | Q5_1 | 0.76GB |\n| [OpenELM-1_1B-CPO.Q6_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q6_K.gguf) | Q6_K | 0.83GB |\n| [OpenELM-1_1B-CPO.Q8_0.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf/blob/main/OpenELM-1_1B-CPO.Q8_0.gguf) | Q8_0 | 1.07GB |\n\n\n\n\nOriginal model description:\n---\nlibrary_name: transformers\ntags:\n- trl\n- cpo\n- alignment-handbook\n- generated_from_trainer\nmodel-index:\n- name: OpenELM-1_1B-CPO\n  results: []\n---\n\n<!-- This model card has been generated automatically according to the information the Trainer had access to. You\nshould probably proofread and complete it, then remove this comment. -->\n\n# OpenELM-1_1B-CPO\n\nThis model was trained from scratch on an unknown dataset.\nIt achieves the following results on the evaluation set:\n- Logits/chosen: -8.875\n- Logits/rejected: -7.5312\n- Logps/chosen: -364.0\n- Logps/rejected: -444.0\n- Loss: 2.1904\n- Nll Loss: 1.1719\n- Rewards/accuracies: 0.5918\n- Rewards/chosen: -3.6406\n- Rewards/margins: 0.8008\n- Rewards/rejected: -4.4375\n\n## Model description\n\nMore information needed\n\n## Intended uses & limitations\n\nMore information needed\n\n## Training and evaluation data\n\nMore information needed\n\n## Training procedure\n\n### Training hyperparameters\n\nThe following hyperparameters were used during training:\n- learning_rate: 5e-05\n- train_batch_size: 8\n- eval_batch_size: 16\n- seed: 42\n- distributed_type: multi-GPU\n- num_devices: 4\n- gradient_accumulation_steps: 2\n- total_train_batch_size: 64\n- total_eval_batch_size: 64\n- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08\n- lr_scheduler_type: cosine\n- lr_scheduler_warmup_ratio: 0.1\n- num_epochs: 3\n\n### Training results\n\n| Training Loss | Epoch  | Step | Logits/chosen | Logits/rejected | Logps/chosen | Logps/rejected | Validation Loss | Nll Loss | Rewards/accuracies | Rewards/chosen | Rewards/margins | Rewards/rejected |\n|:-------------:|:------:|:----:|:-------------:|:---------------:|:------------:|:--------------:|:---------------:|:--------:|:------------------:|:--------------:|:---------------:|:----------------:|\n| 2.4271        | 0.1047 | 100  | -12.3125      | -12.125         | -336.0       | -328.0         | 2.2959          | 1.0859   | 0.4980             | -3.3594        | -0.0850         | -3.2812          |\n| 2.2538        | 0.2093 | 200  | -9.875        | -9.5            | -338.0       | -346.0         | 2.1836          | 1.0938   | 0.5234             | -3.3906        | 0.0640          | -3.4531          |\n| 2.1253        | 0.3140 | 300  | -11.4375      | -11.0           | -346.0       | -360.0         | 2.1307          | 1.1172   | 0.5176             | -3.4531        | 0.1416          | -3.5938          |\n| 2.0609        | 0.4186 | 400  | -11.125       | -10.625         | -332.0       | -344.0         | 2.1359          | 1.0703   | 0.5293             | -3.3281        | 0.1187          | -3.4375          |\n| 2.1905        | 0.5233 | 500  | -9.3125       | -8.5            | -338.0       | -352.0         | 2.1286          | 1.0859   | 0.5254             | -3.375         | 0.1357          | -3.5156          |\n| 2.1304        | 0.6279 | 600  | -10.625       | -9.625          | -360.0       | -398.0         | 2.1410          | 1.1562   | 0.5723             | -3.6094        | 0.3672          | -3.9688          |\n| 2.2554        | 0.7326 | 700  | -9.6875       | -8.5625         | -374.0       | -416.0         | 2.1848          | 1.2031   | 0.5664             | -3.7344        | 0.4258          | -4.1562          |\n| 2.0796        | 0.8373 | 800  | -7.8438       | -7.0312         | -346.0       | -374.0         | 2.1224          | 1.1172   | 0.5469             | -3.4531        | 0.2852          | -3.75            |\n| 2.1021        | 0.9419 | 900  | -6.2812       | -5.2812         | -350.0       | -390.0         | 2.1099          | 1.1328   | 0.5723             | -3.5           | 0.4062          | -3.9062          |\n| 1.5182        | 1.0471 | 1000 | -10.625       | -9.375          | -350.0       | -386.0         | 2.1662          | 1.125    | 0.5664             | -3.5           | 0.3633          | -3.8594          |\n| 1.4917        | 1.1518 | 1100 | -7.875        | -6.4688         | -356.0       | -400.0         | 2.1588          | 1.1484   | 0.5703             | -3.5625        | 0.4395          | -4.0             |\n| 1.5219        | 1.2564 | 1200 | -7.7812       | -6.6562         | -364.0       | -420.0         | 2.1449          | 1.1719   | 0.5938             | -3.625         | 0.5586          | -4.1875          |\n| 1.5292        | 1.3611 | 1300 | -8.875        | -7.75           | -354.0       | -402.0         | 2.1489          | 1.1406   | 0.5742             | -3.5312        | 0.4785          | -4.0             |\n| 1.4257        | 1.4657 | 1400 | -9.25         | -7.7188         | -358.0       | -410.0         | 2.1193          | 1.1562   | 0.5801             | -3.5781        | 0.5156          | -4.0938          |\n| 1.4366        | 1.5704 | 1500 | -8.9375       | -7.6875         | -358.0       | -416.0         | 2.0983          | 1.1562   | 0.5898             | -3.5938        | 0.5586          | -4.1562          |\n| 1.5246        | 1.6750 | 1600 | -6.9062       | -5.4688         | -358.0       | -420.0         | 2.1191          | 1.1562   | 0.5938             | -3.5781        | 0.625           | -4.2188          |\n| 1.4534        | 1.7797 | 1700 | -10.0625      | -9.0625         | -348.0       | -404.0         | 2.0829          | 1.1172   | 0.5762             | -3.4688        | 0.5625          | -4.0312          |\n| 1.4551        | 1.8844 | 1800 | -8.1875       | -6.8438         | -356.0       | -416.0         | 2.1033          | 1.1484   | 0.5898             | -3.5625        | 0.6016          | -4.1562          |\n| 1.4969        | 1.9890 | 1900 | -9.3125       | -8.125          | -354.0       | -412.0         | 2.1046          | 1.1406   | 0.5762             | -3.5312        | 0.5938          | -4.125           |\n| 0.9984        | 2.0937 | 2000 | -9.1875       | -7.9375         | -364.0       | -428.0         | 2.1806          | 1.1719   | 0.5781             | -3.6406        | 0.6367          | -4.2812          |\n| 0.9885        | 2.1983 | 2100 | -8.6875       | -7.4062         | -370.0       | -448.0         | 2.1927          | 1.1875   | 0.5801             | -3.6875        | 0.7930          | -4.5             |\n| 0.9814        | 2.3030 | 2200 | -8.8125       | -7.5            | -362.0       | -436.0         | 2.1867          | 1.1719   | 0.5742             | -3.625         | 0.7266          | -4.3438          |\n| 0.9844        | 2.4076 | 2300 | -8.375        | -7.125          | -368.0       | -452.0         | 2.1905          | 1.1875   | 0.5996             | -3.6875        | 0.8438          | -4.5312          |\n| 0.9931        | 2.5123 | 2400 | -8.6875       | -7.375          | -364.0       | -442.0         | 2.1843          | 1.1719   | 0.5820             | -3.6406        | 0.7930          | -4.4375          |\n| 0.9537        | 2.6170 | 2500 | -8.8125       | -7.5            | -364.0       | -446.0         | 2.1907          | 1.1719   | 0.5898             | -3.6406        | 0.8125          | -4.4688          |\n| 0.9512        | 2.7216 | 2600 | -8.8125       | -7.5            | -364.0       | -446.0         | 2.1918          | 1.1719   | 0.5898             | -3.6406        | 0.8086          | -4.4375          |\n| 0.9604        | 2.8263 | 2700 | -8.875        | -7.5312         | -364.0       | -442.0         | 2.1906          | 1.1719   | 0.5879             | -3.6406        | 0.7969          | -4.4375          |\n| 1.0208        | 2.9309 | 2800 | -8.875        | -7.5312         | -364.0       | -444.0         | 2.1904          | 1.1719   | 0.5918             | -3.6406        | 0.8008          | -4.4375          |\n\n\n### Framework versions\n\n- Transformers 4.44.2\n- Pytorch 2.3.0\n- Datasets 3.0.0\n- Tokenizers 0.19.1\n\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 273,
  "gated": false,
  "private": false,
  "last_modified": "2025-04-01T13:38:16.000Z",
  "created_at": "2025-04-01T13:23:25.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "67ebe8cd4f4e3d8f0106184c",
  "id": "RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf",
  "modelId": "RichardErkhov/CharlesLi_-_OpenELM-1_1B-CPO-gguf",
  "sha": "13689c90f4a2331751fe162d3391aa5d5158b300",
  "createdAt": "2025-04-01T13:23:25.000Z",
  "lastModified": "2025-04-01T13:38:16.000Z",
  "author": "RichardErkhov",
  "downloads": 273,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 24
}