afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-gguf - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-gguf overview
Quantized GGUF model files for zephyr-7b-gemma-sft-african-ultrachat-200k from masakhane
Downloads
123
Likes
1
Pipeline
text-generation
Library
—
Visibility
Public
Access
Open
Repository Files & Downloads
7 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| zephyr-7b-gemma-sft-african-ultrachat-200k.Q2_K.gguf | GGUF | Q2_K | 3.24 GB | Download |
| zephyr-7b-gemma-sft-african-ultrachat-200k.Q3_K_M.gguf | GGUF | Q3_K_M | 4.07 GB | Download |
| zephyr-7b-gemma-sft-african-ultrachat-200k.Q4_K_M.gguf | GGUF | Q4_K_M | 4.96 GB | Download |
| zephyr-7b-gemma-sft-african-ultrachat-200k.Q5_K_M.gguf | GGUF | Q5_K_M | 5.72 GB | Download |
| zephyr-7b-gemma-sft-african-ultrachat-200k.Q6_K.gguf | GGUF | Q6_K | 6.53 GB | Download |
| zephyr-7b-gemma-sft-african-ultrachat-200k.Q8_0.gguf | GGUF | — | 8.45 GB | Download |
| zephyr-7b-gemma-sft-african-ultrachat-200k.fp16.gguf | GGUF | — | 15.91 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"base_model": "masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k",
"datasets": [
"masakhane/african-ultrachat",
"israel/untrachat_en"
],
"inference": true,
"license": "gemma",
"model-index": [
{
"name": "zephyr-7b-gemma-sft-african-ultrachat-2000k",
"results": []
}
],
"model_creator": "masakhane",
"model_name": "zephyr-7b-gemma-sft-african-ultrachat-200k",
"pipeline_tag": "text-generation",
"quantized_by": "afrideva",
"tags": [
"alignment-handbook",
"trl",
"sft",
"generated_from_trainer",
"trl",
"sft",
"generated_from_trainer",
"gguf",
"ggml",
"quantized"
],
"frontmatter": {
"base_model": "masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k",
"datasets": [
"masakhane/african-ultrachat",
"israel/untrachat_en"
],
"inference": "true",
"license": [
"name: zephyr-7b-gemma-sft-african-ultrachat-2000k"
],
"model_creator": "masakhane",
"model_name": "zephyr-7b-gemma-sft-african-ultrachat-200k",
"pipeline_tag": "text-generation",
"quantized_by": "afrideva",
"tags": [
"alignment-handbook",
"trl",
"sft",
"generated_from_trainer",
"trl",
"sft",
"generated_from_trainer",
"gguf",
"ggml",
"quantized"
]
},
"hero_image_url": "",
"summary": "Quantized GGUF model files for zephyr-7b-gemma-sft-african-ultrachat-200k from masakhane",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nbase_model: masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k\ndatasets:\n- masakhane/african-ultrachat\n- israel/untrachat_en\ninference: true\nlicense: gemma\nmodel-index:\n- name: zephyr-7b-gemma-sft-african-ultrachat-2000k\n results: []\nmodel_creator: masakhane\nmodel_name: zephyr-7b-gemma-sft-african-ultrachat-200k\npipeline_tag: text-generation\nquantized_by: afrideva\ntags:\n- alignment-handbook\n- trl\n- sft\n- generated_from_trainer\n- trl\n- sft\n- generated_from_trainer\n- gguf\n- ggml\n- quantized\n---\n\n# zephyr-7b-gemma-sft-african-ultrachat-200k-GGUF\n\nQuantized GGUF model files for [zephyr-7b-gemma-sft-african-ultrachat-200k](https://huggingface.co/masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k) from [masakhane](https://huggingface.co/masakhane)\n\n## Original Model Card:\n\n<!-- This model card has been generated automatically according to the information the Trainer had access to. You\nshould probably proofread and complete it, then remove this comment. -->\n\n# zephyr-7b-gemma-sft-african-ultrachat-2000k\n\nThis model is a fine-tuned version of [google/gemma-7b](https://huggingface.co/google/gemma-7b) on the masakhane/african-ultrachat and the israel/untrachat_en datasets.\nIt achieves the following results on the evaluation set:\n- Loss: 1.1549\n\n## Model description\n\nMore information needed\n\n## Intended uses & limitations\n\nMore information needed\n\n## Training and evaluation data\n\nMore information needed\n\n## Training procedure\n\n### Training hyperparameters\n\nThe following hyperparameters were used during training:\n- learning_rate: 1e-05\n- train_batch_size: 1\n- eval_batch_size: 1\n- seed: 42\n- distributed_type: multi-GPU\n- num_devices: 8\n- gradient_accumulation_steps: 2\n- total_train_batch_size: 16\n- total_eval_batch_size: 8\n- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08\n- lr_scheduler_type: cosine\n- lr_scheduler_warmup_ratio: 0.1\n- num_epochs: 3\n\n### Training results\n\n| Training Loss | Epoch | Step | Validation Loss |\n|:-------------:|:-----:|:-----:|:---------------:|\n| 1.0785 | 1.0 | 17748 | 1.2602 |\n| 0.6614 | 2.0 | 35496 | 1.1089 |\n| 0.2983 | 3.0 | 53244 | 1.1549 |\n\n\n### Framework versions\n\n- Transformers 4.39.0.dev0\n- Pytorch 2.2.1+cu121\n- Datasets 2.14.6\n- Tokenizers 0.15.2",
"related_quantizations": []
},
"tags": [
"gguf",
"alignment-handbook",
"trl",
"sft",
"generated_from_trainer",
"ggml",
"quantized",
"text-generation",
"dataset:masakhane/african-ultrachat",
"dataset:israel/untrachat_en",
"base_model:masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k",
"base_model:quantized:masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k",
"license:gemma",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 1,
"downloads": 123,
"gated": false,
"private": false,
"last_modified": "2024-05-12T05:28:00.000Z",
"created_at": "2024-05-12T03:39:31.000Z",
"pipeline_tag": "text-generation",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "664039f357090e574255e7c4",
"id": "afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-GGUF",
"modelId": "afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-GGUF",
"sha": "6b5fde0970a7ad90010695d8ba5bff3fd1c6c6a9",
"createdAt": "2024-05-12T03:39:31.000Z",
"lastModified": "2024-05-12T05:28:00.000Z",
"author": "afrideva",
"downloads": 123,
"likes": 1,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "",
"siblings_count": 9
}