GraySoft
Projects Models About FAQ Contact Download guIDE →

afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-gguf - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-gguf overview

Quantized GGUF model files for zephyr-7b-gemma-sft-african-ultrachat-200k from masakhane

ggufalignment-handbooktrlsftgenerated_from_trainerggmlquantizedtext-generationdataset:masakhane/african-ultrachatdataset:israel/untrachat_enbase_model:masakhane/zephyr-7b-gemma-sft-african-ultrachat-200kbase_model:quantized:masakhane/zephyr-7b-gemma-sft-african-ultrachat-200klicense:gemmaendpoints_compatibleregion:usconversational
afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-gguf visual
Downloads
123
Likes
1
Pipeline
text-generation
Library
Visibility
Public
Access
Open

Repository Files & Downloads

7 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
zephyr-7b-gemma-sft-african-ultrachat-200k.Q2_K.gguf GGUF Q2_K 3.24 GB Download
zephyr-7b-gemma-sft-african-ultrachat-200k.Q3_K_M.gguf GGUF Q3_K_M 4.07 GB Download
zephyr-7b-gemma-sft-african-ultrachat-200k.Q4_K_M.gguf GGUF Q4_K_M 4.96 GB Download
zephyr-7b-gemma-sft-african-ultrachat-200k.Q5_K_M.gguf GGUF Q5_K_M 5.72 GB Download
zephyr-7b-gemma-sft-african-ultrachat-200k.Q6_K.gguf GGUF Q6_K 6.53 GB Download
zephyr-7b-gemma-sft-african-ultrachat-200k.Q8_0.gguf GGUF 8.45 GB Download
zephyr-7b-gemma-sft-african-ultrachat-200k.fp16.gguf GGUF 15.91 GB Download

Model Details Live

Model Slug
afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-gguf
Author
afrideva
Pipeline Task
text-generation
Library
Created
2024-05-12
Last Modified
2024-05-12
Gated
No
Private
No
HF SHA
6b5fde0970a7ad90010695d8ba5bff3fd1c6c6a9
License
name: zephyr-7b-gemma-sft-african-ultrachat-2000k
Language
Unknown
Base Model
masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": "masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k",
    "datasets": [
      "masakhane/african-ultrachat",
      "israel/untrachat_en"
    ],
    "inference": true,
    "license": "gemma",
    "model-index": [
      {
        "name": "zephyr-7b-gemma-sft-african-ultrachat-2000k",
        "results": []
      }
    ],
    "model_creator": "masakhane",
    "model_name": "zephyr-7b-gemma-sft-african-ultrachat-200k",
    "pipeline_tag": "text-generation",
    "quantized_by": "afrideva",
    "tags": [
      "alignment-handbook",
      "trl",
      "sft",
      "generated_from_trainer",
      "trl",
      "sft",
      "generated_from_trainer",
      "gguf",
      "ggml",
      "quantized"
    ],
    "frontmatter": {
      "base_model": "masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k",
      "datasets": [
        "masakhane/african-ultrachat",
        "israel/untrachat_en"
      ],
      "inference": "true",
      "license": [
        "name: zephyr-7b-gemma-sft-african-ultrachat-2000k"
      ],
      "model_creator": "masakhane",
      "model_name": "zephyr-7b-gemma-sft-african-ultrachat-200k",
      "pipeline_tag": "text-generation",
      "quantized_by": "afrideva",
      "tags": [
        "alignment-handbook",
        "trl",
        "sft",
        "generated_from_trainer",
        "trl",
        "sft",
        "generated_from_trainer",
        "gguf",
        "ggml",
        "quantized"
      ]
    },
    "hero_image_url": "",
    "summary": "Quantized GGUF model files for zephyr-7b-gemma-sft-african-ultrachat-200k from masakhane",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k\ndatasets:\n- masakhane/african-ultrachat\n- israel/untrachat_en\ninference: true\nlicense: gemma\nmodel-index:\n- name: zephyr-7b-gemma-sft-african-ultrachat-2000k\n  results: []\nmodel_creator: masakhane\nmodel_name: zephyr-7b-gemma-sft-african-ultrachat-200k\npipeline_tag: text-generation\nquantized_by: afrideva\ntags:\n- alignment-handbook\n- trl\n- sft\n- generated_from_trainer\n- trl\n- sft\n- generated_from_trainer\n- gguf\n- ggml\n- quantized\n---\n\n# zephyr-7b-gemma-sft-african-ultrachat-200k-GGUF\n\nQuantized GGUF model files for [zephyr-7b-gemma-sft-african-ultrachat-200k](https://huggingface.co/masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k) from [masakhane](https://huggingface.co/masakhane)\n\n## Original Model Card:\n\n<!-- This model card has been generated automatically according to the information the Trainer had access to. You\nshould probably proofread and complete it, then remove this comment. -->\n\n# zephyr-7b-gemma-sft-african-ultrachat-2000k\n\nThis model is a fine-tuned version of [google/gemma-7b](https://huggingface.co/google/gemma-7b) on the masakhane/african-ultrachat and the israel/untrachat_en datasets.\nIt achieves the following results on the evaluation set:\n- Loss: 1.1549\n\n## Model description\n\nMore information needed\n\n## Intended uses & limitations\n\nMore information needed\n\n## Training and evaluation data\n\nMore information needed\n\n## Training procedure\n\n### Training hyperparameters\n\nThe following hyperparameters were used during training:\n- learning_rate: 1e-05\n- train_batch_size: 1\n- eval_batch_size: 1\n- seed: 42\n- distributed_type: multi-GPU\n- num_devices: 8\n- gradient_accumulation_steps: 2\n- total_train_batch_size: 16\n- total_eval_batch_size: 8\n- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08\n- lr_scheduler_type: cosine\n- lr_scheduler_warmup_ratio: 0.1\n- num_epochs: 3\n\n### Training results\n\n| Training Loss | Epoch | Step  | Validation Loss |\n|:-------------:|:-----:|:-----:|:---------------:|\n| 1.0785        | 1.0   | 17748 | 1.2602          |\n| 0.6614        | 2.0   | 35496 | 1.1089          |\n| 0.2983        | 3.0   | 53244 | 1.1549          |\n\n\n### Framework versions\n\n- Transformers 4.39.0.dev0\n- Pytorch 2.2.1+cu121\n- Datasets 2.14.6\n- Tokenizers 0.15.2",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "alignment-handbook",
    "trl",
    "sft",
    "generated_from_trainer",
    "ggml",
    "quantized",
    "text-generation",
    "dataset:masakhane/african-ultrachat",
    "dataset:israel/untrachat_en",
    "base_model:masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k",
    "base_model:quantized:masakhane/zephyr-7b-gemma-sft-african-ultrachat-200k",
    "license:gemma",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 1,
  "downloads": 123,
  "gated": false,
  "private": false,
  "last_modified": "2024-05-12T05:28:00.000Z",
  "created_at": "2024-05-12T03:39:31.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "664039f357090e574255e7c4",
  "id": "afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-GGUF",
  "modelId": "afrideva/zephyr-7b-gemma-sft-african-ultrachat-200k-GGUF",
  "sha": "6b5fde0970a7ad90010695d8ba5bff3fd1c6c6a9",
  "createdAt": "2024-05-12T03:39:31.000Z",
  "lastModified": "2024-05-12T05:28:00.000Z",
  "author": "afrideva",
  "downloads": 123,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 9
}