Model Intelligence Sheet
richarderkhov/charlesli_-_openelm-1_1b-dpo-full-1-5-gguf overview
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
Downloads
287
Likes
0
Pipeline
—
Library
—
Visibility
Public
Access
Open
Repository Files & Downloads
22 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| OpenELM-1_1B-DPO-full-1-5.IQ3_M.gguf | GGUF | IQ3_M | 497.10 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.IQ3_S.gguf | GGUF | IQ3_S | 468.05 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.IQ3_XS.gguf | GGUF | IQ3_XS | 450.85 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.IQ4_NL.gguf | GGUF | IQ4_NL | 597.45 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.IQ4_XS.gguf | GGUF | IQ4_XS | 567.46 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q2_K.gguf | GGUF | Q2_K | 403.99 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q3_K.gguf | GGUF | Q3_K | 529.83 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q3_K_L.gguf | GGUF | Q3_K_L | 571.64 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q3_K_M.gguf | GGUF | Q3_K_M | 529.83 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q3_K_S.gguf | GGUF | Q3_K_S | 468.05 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q4_0.gguf | GGUF | — | 596.51 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q4_1.gguf | GGUF | — | 656.97 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q4_K.gguf | GGUF | Q4_K | 646.65 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q4_K_M.gguf | GGUF | Q4_K_M | 646.65 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q4_K_S.gguf | GGUF | Q4_K_S | 597.45 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q5_0.gguf | GGUF | — | 717.42 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q5_1.gguf | GGUF | — | 777.87 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q5_K.gguf | GGUF | Q5_K | 751.92 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q5_K_M.gguf | GGUF | Q5_K_M | 751.92 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q5_K_S.gguf | GGUF | Q5_K_S | 717.42 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q6_K.gguf | GGUF | Q6_K | 845.88 MB | Download |
| OpenELM-1_1B-DPO-full-1-5.Q8_0.gguf | GGUF | — | 1.07 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"frontmatter": {},
"hero_image_url": "",
"summary": "This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nOpenELM-1_1B-DPO-full-1-5 - GGUF\n- Model creator: https://huggingface.co/CharlesLi/\n- Original model: https://huggingface.co/CharlesLi/OpenELM-1_1B-DPO-full-1-5/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [OpenELM-1_1B-DPO-full-1-5.Q2_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q2_K.gguf) | Q2_K | 0.39GB |\n| [OpenELM-1_1B-DPO-full-1-5.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.IQ3_XS.gguf) | IQ3_XS | 0.44GB |\n| [OpenELM-1_1B-DPO-full-1-5.IQ3_S.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.IQ3_S.gguf) | IQ3_S | 0.46GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q3_K_S.gguf) | Q3_K_S | 0.46GB |\n| [OpenELM-1_1B-DPO-full-1-5.IQ3_M.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.IQ3_M.gguf) | IQ3_M | 0.49GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q3_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q3_K.gguf) | Q3_K | 0.52GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q3_K_M.gguf) | Q3_K_M | 0.52GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q3_K_L.gguf) | Q3_K_L | 0.56GB |\n| [OpenELM-1_1B-DPO-full-1-5.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.IQ4_XS.gguf) | IQ4_XS | 0.55GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q4_0.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q4_0.gguf) | Q4_0 | 0.58GB |\n| [OpenELM-1_1B-DPO-full-1-5.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.IQ4_NL.gguf) | IQ4_NL | 0.58GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q4_K_S.gguf) | Q4_K_S | 0.58GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q4_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q4_K.gguf) | Q4_K | 0.63GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q4_K_M.gguf) | Q4_K_M | 0.63GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q4_1.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q4_1.gguf) | Q4_1 | 0.64GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q5_0.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q5_0.gguf) | Q5_0 | 0.7GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q5_K_S.gguf) | Q5_K_S | 0.7GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q5_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q5_K.gguf) | Q5_K | 0.73GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q5_K_M.gguf) | Q5_K_M | 0.73GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q5_1.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q5_1.gguf) | Q5_1 | 0.76GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q6_K.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q6_K.gguf) | Q6_K | 0.83GB |\n| [OpenELM-1_1B-DPO-full-1-5.Q8_0.gguf](https://huggingface.co/RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf/blob/main/OpenELM-1_1B-DPO-full-1-5.Q8_0.gguf) | Q8_0 | 1.07GB |\n\n\n\n\nOriginal model description:\n---\nlibrary_name: transformers\ntags:\n- trl\n- dpo\n- generated_from_trainer\nmodel-index:\n- name: OpenELM-1_1B-DPO-full-1-5\n results: []\n---\n\n<!-- This model card has been generated automatically according to the information the Trainer had access to. You\nshould probably proofread and complete it, then remove this comment. -->\n\n# OpenELM-1_1B-DPO-full-1-5\n\nThis model was trained from scratch on an unknown dataset.\nIt achieves the following results on the evaluation set:\n- Loss: 1.1836\n- Rewards/chosen: -14.0\n- Rewards/rejected: -17.625\n- Rewards/accuracies: 0.7227\n- Rewards/margins: 3.625\n- Logps/rejected: -2048.0\n- Logps/chosen: -1720.0\n- Logits/rejected: 4.2812\n- Logits/chosen: 2.625\n\n## Model description\n\nMore information needed\n\n## Intended uses & limitations\n\nMore information needed\n\n## Training and evaluation data\n\nMore information needed\n\n## Training procedure\n\n### Training hyperparameters\n\nThe following hyperparameters were used during training:\n- learning_rate: 5e-05\n- train_batch_size: 8\n- eval_batch_size: 16\n- seed: 42\n- distributed_type: multi-GPU\n- num_devices: 4\n- gradient_accumulation_steps: 2\n- total_train_batch_size: 64\n- total_eval_batch_size: 64\n- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08\n- lr_scheduler_type: cosine\n- lr_scheduler_warmup_ratio: 0.1\n- num_epochs: 5\n\n### Training results\n\n| Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |\n|:-------------:|:------:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|\n| 0.6268 | 0.1047 | 100 | 0.6449 | -0.4805 | -0.6680 | 0.6406 | 0.1885 | -356.0 | -366.0 | -9.5625 | -10.0 |\n| 0.5924 | 0.2093 | 200 | 0.5985 | -1.2031 | -1.6172 | 0.6875 | 0.4199 | -450.0 | -438.0 | -12.875 | -13.125 |\n| 0.6197 | 0.3140 | 300 | 0.5811 | -1.375 | -1.8438 | 0.7090 | 0.4668 | -474.0 | -456.0 | -11.75 | -12.1875 |\n| 0.5968 | 0.4186 | 400 | 0.5933 | -2.3125 | -2.8438 | 0.6934 | 0.5273 | -572.0 | -548.0 | -8.5625 | -9.25 |\n| 0.5854 | 0.5233 | 500 | 0.5737 | -1.7422 | -2.2812 | 0.6953 | 0.5352 | -516.0 | -492.0 | -7.7188 | -8.625 |\n| 0.5524 | 0.6279 | 600 | 0.5768 | -3.0156 | -3.7031 | 0.6914 | 0.6953 | -660.0 | -620.0 | -7.0312 | -7.7188 |\n| 0.5602 | 0.7326 | 700 | 0.5756 | -3.1562 | -3.9062 | 0.7168 | 0.75 | -680.0 | -636.0 | -5.125 | -6.3438 |\n| 0.5581 | 0.8373 | 800 | 0.5854 | -3.3906 | -4.0312 | 0.6914 | 0.6289 | -692.0 | -656.0 | -5.0938 | -5.9688 |\n| 0.5793 | 0.9419 | 900 | 0.5657 | -3.1719 | -3.9062 | 0.7207 | 0.7383 | -680.0 | -636.0 | -3.9531 | -5.0312 |\n| 0.2783 | 1.0466 | 1000 | 0.6053 | -4.75 | -5.875 | 0.7188 | 1.125 | -876.0 | -792.0 | -2.2188 | -3.3594 |\n| 0.2417 | 1.1512 | 1100 | 0.6139 | -4.7812 | -5.8125 | 0.7070 | 1.0469 | -872.0 | -796.0 | -2.3594 | -4.125 |\n| 0.2429 | 1.2559 | 1200 | 0.5897 | -5.7188 | -6.8125 | 0.7227 | 1.0781 | -968.0 | -892.0 | -0.7188 | -2.1719 |\n| 0.2508 | 1.3605 | 1300 | 0.5948 | -5.4062 | -6.4062 | 0.6914 | 1.0 | -928.0 | -860.0 | -0.0104 | -1.5156 |\n| 0.2169 | 1.4652 | 1400 | 0.6104 | -5.7812 | -6.9062 | 0.7031 | 1.1016 | -976.0 | -896.0 | 0.0820 | -1.75 |\n| 0.2107 | 1.5699 | 1500 | 0.6062 | -6.0625 | -7.2812 | 0.6973 | 1.1953 | -1016.0 | -924.0 | -0.4590 | -2.1719 |\n| 0.2472 | 1.6745 | 1600 | 0.6158 | -5.625 | -6.7188 | 0.7070 | 1.1016 | -960.0 | -880.0 | -2.0312 | -3.9688 |\n| 0.2545 | 1.7792 | 1700 | 0.6170 | -6.25 | -7.5 | 0.7031 | 1.25 | -1040.0 | -944.0 | -1.2578 | -3.2031 |\n| 0.2383 | 1.8838 | 1800 | 0.6061 | -5.625 | -6.75 | 0.7012 | 1.1172 | -964.0 | -880.0 | 0.7383 | -1.1328 |\n| 0.2107 | 1.9885 | 1900 | 0.6135 | -6.5 | -7.7812 | 0.7383 | 1.2578 | -1064.0 | -968.0 | 0.3027 | -1.4297 |\n| 0.0186 | 2.0931 | 2000 | 0.7473 | -8.0625 | -9.875 | 0.7090 | 1.8594 | -1280.0 | -1120.0 | 2.2812 | 0.4980 |\n| 0.03 | 2.1978 | 2100 | 0.8345 | -9.9375 | -12.25 | 0.7070 | 2.2812 | -1512.0 | -1312.0 | 3.2031 | 1.5938 |\n| 0.0284 | 2.3025 | 2200 | 0.7741 | -9.1875 | -11.3125 | 0.7012 | 2.0781 | -1416.0 | -1240.0 | 2.7812 | 1.0156 |\n| 0.0352 | 2.4071 | 2300 | 0.7983 | -9.3125 | -11.3125 | 0.7090 | 2.0156 | -1424.0 | -1248.0 | 2.6406 | 0.9961 |\n| 0.0345 | 2.5118 | 2400 | 0.8249 | -9.8125 | -12.0 | 0.7266 | 2.1719 | -1488.0 | -1304.0 | 3.2656 | 1.5625 |\n| 0.0192 | 2.6164 | 2500 | 0.8865 | -10.25 | -12.5625 | 0.6973 | 2.2969 | -1544.0 | -1344.0 | 3.5938 | 1.9609 |\n| 0.0261 | 2.7211 | 2600 | 0.7963 | -9.1875 | -11.4375 | 0.7129 | 2.25 | -1432.0 | -1240.0 | 2.7031 | 0.8672 |\n| 0.0315 | 2.8257 | 2700 | 0.7619 | -9.0 | -10.9375 | 0.7109 | 1.9766 | -1384.0 | -1216.0 | 2.8594 | 0.8320 |\n| 0.0293 | 2.9304 | 2800 | 0.8241 | -9.75 | -12.0625 | 0.7070 | 2.2656 | -1496.0 | -1296.0 | 3.1719 | 1.3359 |\n| 0.0071 | 3.0351 | 2900 | 0.8609 | -10.0625 | -12.5 | 0.7188 | 2.3906 | -1536.0 | -1328.0 | 3.1719 | 1.3125 |\n| 0.0099 | 3.1397 | 3000 | 0.9558 | -11.5 | -14.1875 | 0.7051 | 2.6875 | -1704.0 | -1472.0 | 3.4062 | 1.6484 |\n| 0.0079 | 3.2444 | 3100 | 0.9341 | -11.125 | -13.75 | 0.7090 | 2.6562 | -1664.0 | -1432.0 | 3.25 | 1.5078 |\n| 0.0104 | 3.3490 | 3200 | 0.9926 | -11.9375 | -14.8125 | 0.7090 | 2.9062 | -1768.0 | -1512.0 | 3.6719 | 1.9922 |\n| 0.0089 | 3.4537 | 3300 | 0.9665 | -11.9375 | -14.8125 | 0.7188 | 2.875 | -1768.0 | -1512.0 | 3.8594 | 2.2656 |\n| 0.0098 | 3.5583 | 3400 | 0.9548 | -11.1875 | -13.875 | 0.7109 | 2.75 | -1680.0 | -1432.0 | 4.0 | 2.3438 |\n| 0.0109 | 3.6630 | 3500 | 1.0670 | -12.5625 | -15.6875 | 0.7168 | 3.1406 | -1856.0 | -1576.0 | 4.1875 | 2.5312 |\n| 0.0081 | 3.7677 | 3600 | 1.0376 | -12.375 | -15.4375 | 0.7188 | 3.0938 | -1832.0 | -1552.0 | 4.125 | 2.4844 |\n| 0.0081 | 3.8723 | 3700 | 1.0725 | -13.0 | -16.25 | 0.7168 | 3.25 | -1912.0 | -1616.0 | 4.1875 | 2.5938 |\n| 0.0041 | 3.9770 | 3800 | 1.1346 | -13.5 | -17.0 | 0.7188 | 3.4688 | -1984.0 | -1672.0 | 4.2188 | 2.5781 |\n| 0.0036 | 4.0816 | 3900 | 1.1589 | -13.8125 | -17.375 | 0.7168 | 3.5156 | -2024.0 | -1696.0 | 4.25 | 2.625 |\n| 0.0016 | 4.1863 | 4000 | 1.1790 | -14.0625 | -17.625 | 0.7168 | 3.5781 | -2048.0 | -1720.0 | 4.2812 | 2.6719 |\n| 0.0037 | 4.2909 | 4100 | 1.1847 | -14.0625 | -17.625 | 0.7168 | 3.6094 | -2064.0 | -1728.0 | 4.3125 | 2.6562 |\n| 0.007 | 4.3956 | 4200 | 1.1905 | -14.1875 | -17.75 | 0.7227 | 3.6406 | -2064.0 | -1736.0 | 4.3125 | 2.6719 |\n| 0.0038 | 4.5003 | 4300 | 1.1835 | -14.0625 | -17.75 | 0.7207 | 3.6406 | -2064.0 | -1728.0 | 4.2812 | 2.6406 |\n| 0.0093 | 4.6049 | 4400 | 1.1819 | -14.0625 | -17.625 | 0.7207 | 3.625 | -2048.0 | -1720.0 | 4.2812 | 2.625 |\n| 0.006 | 4.7096 | 4500 | 1.1817 | -14.0 | -17.625 | 0.7227 | 3.6406 | -2048.0 | -1720.0 | 4.2812 | 2.6094 |\n| 0.0037 | 4.8142 | 4600 | 1.1826 | -14.0 | -17.625 | 0.7227 | 3.6406 | -2048.0 | -1720.0 | 4.25 | 2.6094 |\n| 0.0059 | 4.9189 | 4700 | 1.1836 | -14.0 | -17.625 | 0.7227 | 3.625 | -2048.0 | -1720.0 | 4.2812 | 2.625 |\n\n\n### Framework versions\n\n- Transformers 4.44.2\n- Pytorch 2.1.2\n- Datasets 2.18.0\n- Tokenizers 0.19.1\n\n\n",
"related_quantizations": []
},
"tags": [
"gguf",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 0,
"downloads": 287,
"gated": false,
"private": false,
"last_modified": "2025-04-01T14:07:13.000Z",
"created_at": "2025-04-01T13:49:33.000Z",
"pipeline_tag": "",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "67ebeeed700f3039d192c944",
"id": "RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf",
"modelId": "RichardErkhov/CharlesLi_-_OpenELM-1_1B-DPO-full-1-5-gguf",
"sha": "8bca72b799d3fbd5465a944662ee0b07c214a17c",
"createdAt": "2025-04-01T13:49:33.000Z",
"lastModified": "2025-04-01T14:07:13.000Z",
"author": "RichardErkhov",
"downloads": 287,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "",
"library_name": "",
"siblings_count": 24
}