richarderkhov/kendamarron_-_tokara-0.5b-v0.1-gguf overview
Quantization made by Richard Erkhov. Github Discord Request more models Tokara-0.5B-v0.1 - GGUF | Name | Quant method | Size | | ---- | ---- | ---- | | Tokara-0.5B-v0.1.Q2K.gguf | Q2K | 0.23GB | | Tokara-0.5B-v0.1.IQ3XS.gguf | IQ3XS | 0.24GB | | Tokara-0.5B-v0.1.IQ3S.gguf | IQ3S | 0.25GB | | Tokara-0.5B-v0.1.Q3KS.gguf | Q3KS | 0.25GB | | Tokara-0.5B-v0.1.IQ3M.gguf | IQ3M | 0.26GB | | Tokara-0.5B-v0.1.Q3K.gguf | Q3K | 0.26GB | | Tokara-0.5B-v0.1.Q3KM.gguf | Q3KM | 0.26GB | | Tokara-0.5B-v0.1.Q3KL.gguf | Q3KL | 0.28GB | | Tokara-0.5B-v0.1.IQ4XS.gguf | IQ4XS | 0.28GB | | Tokara-0.5B-v0.1.Q40.gguf | Q40 | 0.29GB | | Tokara-0.5B-v0.1.IQ4NL.gguf | IQ4NL | 0.29GB | | Tokara-0.5B-v0.1.Q4KS.gguf | Q4KS | 0.29GB | | Tokara-0.5B-v0.1.Q4K.gguf | Q4K | 0.3GB | | Tokara-0.5B-v0.1.Q4KM.gguf | Q4KM | 0.3GB | | Tokara-0.5B-v0.1.Q41.gguf | Q41 | 0.3GB | | Tokara-0.5B-v0.1.Q50.gguf | Q50 | 0.32GB | | Tokara-0.5B-v0.1.Q5KS.gguf | Q5KS | 0.32GB | | Tokara-0.5B-v0.1.Q5K.gguf | Q5K | 0.33GB | | Tokara-0.5B-v0.1.Q5KM.gguf | Q5KM | 0.33GB | | Tokara-0.5B-v0.1.Q51.gguf | Q51 | 0.34GB | | Tokara-0.5B-v0.1.Q6K.gguf | Q6K | 0.36GB | | Tokara-0.5B-v0.1.Q80.gguf | Q80 | 0.47GB | Original model description: --- license: other licensename: tongyi-qianwen-research licenselink: https://huggingface.co/Qwen/Qwen1.5-0.5B/blob/main/LICENSE language: pipeline_tag: text-generation datasets: ---
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Tokara-0.5B-v0.1.IQ3_M.gguf | GGUF | IQ3_M | 261.66 MB | Download |
| Tokara-0.5B-v0.1.IQ3_S.gguf | GGUF | IQ3_S | 254.18 MB | Download |
| Tokara-0.5B-v0.1.IQ3_XS.gguf | GGUF | IQ3_XS | 247.29 MB | Download |
| Tokara-0.5B-v0.1.IQ4_NL.gguf | GGUF | IQ4_NL | 294.26 MB | Download |
| Tokara-0.5B-v0.1.IQ4_XS.gguf | GGUF | IQ4_XS | 285.33 MB | Download |
| Tokara-0.5B-v0.1.Q2_K.gguf | GGUF | Q2_K | 235.90 MB | Download |
| Tokara-0.5B-v0.1.Q3_K.gguf | GGUF | Q3_K | 269.92 MB | Download |
| Tokara-0.5B-v0.1.Q3_K_L.gguf | GGUF | Q3_K_L | 283.58 MB | Download |
| Tokara-0.5B-v0.1.Q3_K_M.gguf | GGUF | Q3_K_M | 269.92 MB | Download |
| Tokara-0.5B-v0.1.Q3_K_S.gguf | GGUF | Q3_K_S | 254.18 MB | Download |
| Tokara-0.5B-v0.1.Q4_0.gguf | GGUF | — | 293.23 MB | Download |
| Tokara-0.5B-v0.1.Q4_1.gguf | GGUF | — | 311.61 MB | Download |
| Tokara-0.5B-v0.1.Q4_K.gguf | GGUF | Q4_K | 304.83 MB | Download |
| Tokara-0.5B-v0.1.Q4_K_M.gguf | GGUF | Q4_K_M | 304.83 MB | Download |
| Tokara-0.5B-v0.1.Q4_K_S.gguf | GGUF | Q4_K_S | 294.76 MB | Download |
| Tokara-0.5B-v0.1.Q5_0.gguf | GGUF | — | 329.98 MB | Download |
| Tokara-0.5B-v0.1.Q5_1.gguf | GGUF | — | 348.36 MB | Download |
| Tokara-0.5B-v0.1.Q5_K.gguf | GGUF | Q5_K | 335.96 MB | Download |
| Tokara-0.5B-v0.1.Q5_K_M.gguf | GGUF | Q5_K_M | 335.96 MB | Download |
| Tokara-0.5B-v0.1.Q5_K_S.gguf | GGUF | Q5_K_S | 329.98 MB | Download |
| Tokara-0.5B-v0.1.Q6_K.gguf | GGUF | Q6_K | 369.03 MB | Download |
| Tokara-0.5B-v0.1.Q8_0.gguf | GGUF | — | 476.17 MB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"frontmatter": {},
"hero_image_url": "",
"summary": "Quantization made by Richard Erkhov. Github Discord Request more models Tokara-0.5B-v0.1 - GGUF | Name | Quant method | Size | | ---- | ---- | ---- | | Tokara-0.5B-v0.1.Q2_K.gguf | Q2_K | 0.23GB | | Tokara-0.5B-v0.1.IQ3_XS.gguf | IQ3_XS | 0.24GB | | Tokara-0.5B-v0.1.IQ3_S.gguf | IQ3_S | 0.25GB | | Tokara-0.5B-v0.1.Q3_K_S.gguf | Q3_K_S | 0.25GB | | Tokara-0.5B-v0.1.IQ3_M.gguf | IQ3_M | 0.26GB | | Tokara-0.5B-v0.1.Q3_K.gguf | Q3_K | 0.26GB | | Tokara-0.5B-v0.1.Q3_K_M.gguf | Q3_K_M | 0.26GB | | Tokara-0.5B-v0.1.Q3_K_L.gguf | Q3_K_L | 0.28GB | | Tokara-0.5B-v0.1.IQ4_XS.gguf | IQ4_XS | 0.28GB | | Tokara-0.5B-v0.1.Q4_0.gguf | Q4_0 | 0.29GB | | Tokara-0.5B-v0.1.IQ4_NL.gguf | IQ4_NL | 0.29GB | | Tokara-0.5B-v0.1.Q4_K_S.gguf | Q4_K_S | 0.29GB | | Tokara-0.5B-v0.1.Q4_K.gguf | Q4_K | 0.3GB | | Tokara-0.5B-v0.1.Q4_K_M.gguf | Q4_K_M | 0.3GB | | Tokara-0.5B-v0.1.Q4_1.gguf | Q4_1 | 0.3GB | | Tokara-0.5B-v0.1.Q5_0.gguf | Q5_0 | 0.32GB | | Tokara-0.5B-v0.1.Q5_K_S.gguf | Q5_K_S | 0.32GB | | Tokara-0.5B-v0.1.Q5_K.gguf | Q5_K | 0.33GB | | Tokara-0.5B-v0.1.Q5_K_M.gguf | Q5_K_M | 0.33GB | | Tokara-0.5B-v0.1.Q5_1.gguf | Q5_1 | 0.34GB | | Tokara-0.5B-v0.1.Q6_K.gguf | Q6_K | 0.36GB | | Tokara-0.5B-v0.1.Q8_0.gguf | Q8_0 | 0.47GB | Original model description: --- license: other license_name: tongyi-qianwen-research license_link: https://huggingface.co/Qwen/Qwen1.5-0.5B/blob/main/LICENSE language: pipeline_tag: text-generation datasets: ---",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nTokara-0.5B-v0.1 - GGUF\n- Model creator: https://huggingface.co/Kendamarron/\n- Original model: https://huggingface.co/Kendamarron/Tokara-0.5B-v0.1/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [Tokara-0.5B-v0.1.Q2_K.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q2_K.gguf) | Q2_K | 0.23GB |\n| [Tokara-0.5B-v0.1.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.IQ3_XS.gguf) | IQ3_XS | 0.24GB |\n| [Tokara-0.5B-v0.1.IQ3_S.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.IQ3_S.gguf) | IQ3_S | 0.25GB |\n| [Tokara-0.5B-v0.1.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q3_K_S.gguf) | Q3_K_S | 0.25GB |\n| [Tokara-0.5B-v0.1.IQ3_M.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.IQ3_M.gguf) | IQ3_M | 0.26GB |\n| [Tokara-0.5B-v0.1.Q3_K.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q3_K.gguf) | Q3_K | 0.26GB |\n| [Tokara-0.5B-v0.1.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q3_K_M.gguf) | Q3_K_M | 0.26GB |\n| [Tokara-0.5B-v0.1.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q3_K_L.gguf) | Q3_K_L | 0.28GB |\n| [Tokara-0.5B-v0.1.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.IQ4_XS.gguf) | IQ4_XS | 0.28GB |\n| [Tokara-0.5B-v0.1.Q4_0.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q4_0.gguf) | Q4_0 | 0.29GB |\n| [Tokara-0.5B-v0.1.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.IQ4_NL.gguf) | IQ4_NL | 0.29GB |\n| [Tokara-0.5B-v0.1.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q4_K_S.gguf) | Q4_K_S | 0.29GB |\n| [Tokara-0.5B-v0.1.Q4_K.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q4_K.gguf) | Q4_K | 0.3GB |\n| [Tokara-0.5B-v0.1.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q4_K_M.gguf) | Q4_K_M | 0.3GB |\n| [Tokara-0.5B-v0.1.Q4_1.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q4_1.gguf) | Q4_1 | 0.3GB |\n| [Tokara-0.5B-v0.1.Q5_0.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q5_0.gguf) | Q5_0 | 0.32GB |\n| [Tokara-0.5B-v0.1.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q5_K_S.gguf) | Q5_K_S | 0.32GB |\n| [Tokara-0.5B-v0.1.Q5_K.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q5_K.gguf) | Q5_K | 0.33GB |\n| [Tokara-0.5B-v0.1.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q5_K_M.gguf) | Q5_K_M | 0.33GB |\n| [Tokara-0.5B-v0.1.Q5_1.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q5_1.gguf) | Q5_1 | 0.34GB |\n| [Tokara-0.5B-v0.1.Q6_K.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q6_K.gguf) | Q6_K | 0.36GB |\n| [Tokara-0.5B-v0.1.Q8_0.gguf](https://huggingface.co/RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf/blob/main/Tokara-0.5B-v0.1.Q8_0.gguf) | Q8_0 | 0.47GB |\n\n\n\n\nOriginal model description:\n---\nlicense: other\nlicense_name: tongyi-qianwen-research\nlicense_link: https://huggingface.co/Qwen/Qwen1.5-0.5B/blob/main/LICENSE\nlanguage:\n- ja\n- en\npipeline_tag: text-generation\ndatasets:\n- izumi-lab/wikipedia-ja-20230720\n- oscar-corpus/OSCAR-2301\n- aixsatoshi/cosmopedia-japanese-100k\n- BEE-spoke-data/wikipedia-20230901.en-deduped\n---\n\n## モデルについて\n[Qwen/Qwen1.5-0.5B](https://huggingface.co/Qwen/Qwen1.5-0.5B)を日英データ5Bトークンで継続事前学習したモデルです。\n\nベンチマークのスコアは低下していますが、ベースモデルよりも安定して日本語を出力するようになっています。\n\n詳細は[こちら](https://zenn.dev/kendama/articles/55564e12da6e82)をご覧ください。\n\n## ベンチマーク\n[Stability-AI/lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness)の3項目で評価\n| モデル | jsquad(1-shot) | jcommonsenseqa(1-shot) | jnli(1-shot) | \n| ---------------------------- | -------------- | ---------------------- | ------------ | \n| Kendamarron/Tokara-0.5B-v0.1 | 26.4295 | 0.2663 | 0.5509 | \n| Qwen/Qwen1.5-0.5B | 31.3597 | 0.2556 | 0.5534 | \n\n## 名前について\n日本の在来馬であるトカラ馬から\n\n```python\nfrom transformers import AutoTokenizer, AutoModelForCausalLM, pipeline\n\nmodel = AutoModelForCausalLM.from_pretrained('Kendamarron/Tokara-0.5B-v0.1')\ntokenizer = AutoTokenizer.from_pretrained('Kendamarron/Tokara-0.5B-v0.1')\n\npipe = pipeline('text-generation', model=model, tokenizer=tokenizer)\n\nprompt = \"大規模言語モデルとは、\"\n\nprint(pipe(prompt, max_length=128, repetition_penalty=1.1, temperature=0.7, top_p=0.95))\n\n```\n\n",
"related_quantizations": []
},
"tags": [
"gguf",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 0,
"downloads": 244,
"gated": false,
"private": false,
"last_modified": "2024-07-18T03:51:15.000Z",
"created_at": "2024-07-18T03:43:19.000Z",
"pipeline_tag": "",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "66988f5724de09d10ce5dcd4",
"id": "RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf",
"modelId": "RichardErkhov/Kendamarron_-_Tokara-0.5B-v0.1-gguf",
"sha": "7d7031588fa1a4b6fe6510ec838e643d291a2c51",
"createdAt": "2024-07-18T03:43:19.000Z",
"lastModified": "2024-07-18T03:51:15.000Z",
"author": "RichardErkhov",
"downloads": 244,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "",
"library_name": "",
"siblings_count": 24
}