ewof/koishi-7b-qlora-gguf Q4_K_S GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
ewof/koishi-7b-qlora-gguf overview
Comprehensive model page for ewof/koishi-7b-qlora-gguf
Downloads
105
Likes
0
Pipeline
—
Library
transformers
Visibility
Public
Access
Open
Repository Files & Downloads
13 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| ggml-model-F16.gguf | GGUF | F16 | 13.49 GB | Download |
| ggml-model-Q2_K.gguf | GGUF | Q2_K | 2.53 GB | Download |
| ggml-model-Q3_K_L.gguf | GGUF | Q3_K_L | 3.56 GB | Download |
| ggml-model-Q3_K_M.gguf | GGUF | Q3_K_M | 3.28 GB | Download |
| ggml-model-Q3_K_S.gguf | GGUF | Q3_K_S | 2.95 GB | Download |
| ggml-model-Q4_0.gguf | GGUF | — | 3.83 GB | Download |
| ggml-model-Q4_K_M.gguf | GGUF | Q4_K_M | 4.07 GB | Download |
| ggml-model-Q4_K_S.gguf | GGUF | Q4_K_S | 3.86 GB | Download |
| ggml-model-Q5_0.gguf | GGUF | — | 4.65 GB | Download |
| ggml-model-Q5_K_M.gguf | GGUF | Q5_K_M | 4.78 GB | Download |
| ggml-model-Q5_K_S.gguf | GGUF | Q5_K_S | 4.65 GB | Download |
| ggml-model-Q6_K.gguf | GGUF | Q6_K | 5.53 GB | Download |
| ggml-model-Q8_0.gguf | GGUF | — | 7.17 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"datasets": [
"ewof/koishi-instruct-metharme"
],
"frontmatter": {
"datasets": [
"ewof/koishi-instruct-metharme"
]
},
"hero_image_url": "",
"summary": "",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\ndatasets:\n - ewof/koishi-instruct-metharme\n---\n\n## GGUF\n\nlittle endian\n\n\n## Training\n\n[axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training\non a 6x nvidia a40 gpu cluster.\n\nthe a40 GPU cluster has been graciously provided by [Arc Compute](https://www.arccompute.io/).\n\ntrained on koishi commit 6e675d1 for one epoch\n\n## Base Model\n\nrank 16 lora tune of mistralai/Mistral-7B-v0.1 (all modules, merged)\n\n## Prompting\n\nThe current model version has been trained on prompts using three different roles, which are denoted by the following tokens: `<|system|>`, `<|user|>` and `<|model|>`.\n\nThe `<|system|>` prompt can be used to inject out-of-channel information behind the scenes, while the `<|user|>` prompt should be used to indicate user input. The `<|model|>` token should then be used to indicate that the model should generate a response. These tokens can happen multiple times and be chained up to form a conversation history.",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"mistral",
"dataset:ewof/koishi-instruct-metharme",
"endpoints_compatible",
"region:us"
],
"likes": 0,
"downloads": 105,
"gated": false,
"private": false,
"last_modified": "2024-04-09T08:26:04.000Z",
"created_at": "2023-12-15T23:59:02.000Z",
"pipeline_tag": "",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "657ce84617f67d5b8791e40e",
"id": "ewof/koishi-7b-qlora-gguf",
"modelId": "ewof/koishi-7b-qlora-gguf",
"sha": "49ebfcb3f0564519a2b76d90ee74a803e75bf966",
"createdAt": "2023-12-15T23:59:02.000Z",
"lastModified": "2024-04-09T08:26:04.000Z",
"author": "ewof",
"downloads": 105,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "",
"library_name": "transformers",
"siblings_count": 16
}