GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

richarderkhov/opencsg_-_csg-wukong-1b-chat-v0.1-gguf overview

[OpenCSG Community] [github] [wechat] [Twitter] OpenCSG stands for Converged resources, Software refinement, and Generative LM. The 'C' represents Converged resources, indicating the integration and full utilization of hybrid resources. The 'S' stands for Software refinement, signifying software that is refined by large models. The 'G' represents Generative LM, which denotes widespread, inclusive, and democratized generative large models. The vision of OpenCSG is to empower every industry, every company, and every individual to own their models. We adhere to the principles of openness and open source, making the large model software stack of OpenCSG available to the community. We welcome everyone to use, send feedback, and contribute collaboratively.

ggufendpoints_compatibleregion:us
richarderkhov/opencsg_-_csg-wukong-1b-chat-v0.1-gguf visual
Downloads
282
Likes
0
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

22 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
csg-wukong-1B-chat-v0.1.IQ3_M.gguf GGUF IQ3_M 492.28 MB Download
csg-wukong-1B-chat-v0.1.IQ3_S.gguf GGUF IQ3_S 477.67 MB Download
csg-wukong-1B-chat-v0.1.IQ3_XS.gguf GGUF IQ3_XS 455.50 MB Download
csg-wukong-1B-chat-v0.1.IQ4_NL.gguf GGUF IQ4_NL 611.35 MB Download
csg-wukong-1B-chat-v0.1.IQ4_XS.gguf GGUF IQ4_XS 581.56 MB Download
csg-wukong-1B-chat-v0.1.Q2_K.gguf GGUF Q2_K 412.11 MB Download
csg-wukong-1B-chat-v0.1.Q3_K.gguf GGUF Q3_K 523.00 MB Download
csg-wukong-1B-chat-v0.1.Q3_K_L.gguf GGUF Q3_K_L 564.12 MB Download
csg-wukong-1B-chat-v0.1.Q3_K_M.gguf GGUF Q3_K_M 523.00 MB Download
csg-wukong-1B-chat-v0.1.Q3_K_S.gguf GGUF Q3_K_S 476.21 MB Download
csg-wukong-1B-chat-v0.1.Q4_0.gguf GGUF 607.23 MB Download
csg-wukong-1B-chat-v0.1.Q4_1.gguf GGUF 668.89 MB Download
csg-wukong-1B-chat-v0.1.Q4_K.gguf GGUF Q4_K 636.88 MB Download
csg-wukong-1B-chat-v0.1.Q4_K_M.gguf GGUF Q4_K_M 636.88 MB Download
csg-wukong-1B-chat-v0.1.Q4_K_S.gguf GGUF Q4_K_S 610.23 MB Download
csg-wukong-1B-chat-v0.1.Q5_0.gguf GGUF 730.54 MB Download
csg-wukong-1B-chat-v0.1.Q5_1.gguf GGUF 792.20 MB Download
csg-wukong-1B-chat-v0.1.Q5_K.gguf GGUF Q5_K 745.82 MB Download
csg-wukong-1B-chat-v0.1.Q5_K_M.gguf GGUF Q5_K_M 745.82 MB Download
csg-wukong-1B-chat-v0.1.Q5_K_S.gguf GGUF Q5_K_S 730.54 MB Download
csg-wukong-1B-chat-v0.1.Q6_K.gguf GGUF Q6_K 861.56 MB Download
csg-wukong-1B-chat-v0.1.Q8_0.gguf GGUF 1.09 GB Download

Model Details Live

Model Slug
richarderkhov/opencsg_-_csg-wukong-1b-chat-v0.1-gguf
Author
RichardErkhov
Pipeline Task
Library
Created
2024-06-25
Last Modified
2024-06-25
Gated
No
Private
No
HF SHA
038c015b6ffb8f9d06d90dca8ddc2338b49dc203
License
Unknown
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "./csg-wukong-logo-green.jpg",
    "summary": "[OpenCSG Community]   [github]  [wechat]  [Twitter]   OpenCSG stands for Converged resources, Software refinement, and Generative LM. The 'C' represents Converged resources, indicating the integration and full utilization of hybrid resources. The 'S' stands for Software refinement, signifying software that is refined by large models. The 'G' represents Generative LM, which denotes widespread, inclusive, and democratized generative large models. The vision of OpenCSG is to empower every industry, every company, and every individual to own their models. We adhere to the principles of openness and open source, making the large model software stack of OpenCSG available to the community. We welcome everyone to use, send feedback, and contribute collaboratively.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\ncsg-wukong-1B-chat-v0.1 - GGUF\n- Model creator: https://huggingface.co/opencsg/\n- Original model: https://huggingface.co/opencsg/csg-wukong-1B-chat-v0.1/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [csg-wukong-1B-chat-v0.1.Q2_K.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q2_K.gguf) | Q2_K | 0.4GB |\n| [csg-wukong-1B-chat-v0.1.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.IQ3_XS.gguf) | IQ3_XS | 0.44GB |\n| [csg-wukong-1B-chat-v0.1.IQ3_S.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.IQ3_S.gguf) | IQ3_S | 0.47GB |\n| [csg-wukong-1B-chat-v0.1.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q3_K_S.gguf) | Q3_K_S | 0.47GB |\n| [csg-wukong-1B-chat-v0.1.IQ3_M.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.IQ3_M.gguf) | IQ3_M | 0.48GB |\n| [csg-wukong-1B-chat-v0.1.Q3_K.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q3_K.gguf) | Q3_K | 0.51GB |\n| [csg-wukong-1B-chat-v0.1.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q3_K_M.gguf) | Q3_K_M | 0.51GB |\n| [csg-wukong-1B-chat-v0.1.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q3_K_L.gguf) | Q3_K_L | 0.55GB |\n| [csg-wukong-1B-chat-v0.1.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.IQ4_XS.gguf) | IQ4_XS | 0.57GB |\n| [csg-wukong-1B-chat-v0.1.Q4_0.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q4_0.gguf) | Q4_0 | 0.59GB |\n| [csg-wukong-1B-chat-v0.1.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.IQ4_NL.gguf) | IQ4_NL | 0.6GB |\n| [csg-wukong-1B-chat-v0.1.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q4_K_S.gguf) | Q4_K_S | 0.6GB |\n| [csg-wukong-1B-chat-v0.1.Q4_K.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q4_K.gguf) | Q4_K | 0.62GB |\n| [csg-wukong-1B-chat-v0.1.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q4_K_M.gguf) | Q4_K_M | 0.62GB |\n| [csg-wukong-1B-chat-v0.1.Q4_1.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q4_1.gguf) | Q4_1 | 0.65GB |\n| [csg-wukong-1B-chat-v0.1.Q5_0.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q5_0.gguf) | Q5_0 | 0.71GB |\n| [csg-wukong-1B-chat-v0.1.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q5_K_S.gguf) | Q5_K_S | 0.71GB |\n| [csg-wukong-1B-chat-v0.1.Q5_K.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q5_K.gguf) | Q5_K | 0.73GB |\n| [csg-wukong-1B-chat-v0.1.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q5_K_M.gguf) | Q5_K_M | 0.73GB |\n| [csg-wukong-1B-chat-v0.1.Q5_1.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q5_1.gguf) | Q5_1 | 0.77GB |\n| [csg-wukong-1B-chat-v0.1.Q6_K.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q6_K.gguf) | Q6_K | 0.84GB |\n| [csg-wukong-1B-chat-v0.1.Q8_0.gguf](https://huggingface.co/RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf/blob/main/csg-wukong-1B-chat-v0.1.Q8_0.gguf) | Q8_0 | 1.09GB |\n\n\n\n\nOriginal model description:\n---\nlanguage:\n- en\npipeline_tag: text-generation\ntags:\n- code\nlicense: apache-2.0\n---\n\n\n\n# **csg-wukong-1B-chat-v0.1**          [[中文]](#chinese)    [[English]](#english)\n\n<a id=\"english\"></a>\n\n<p align=\"center\">\n<img width=\"900px\" alt=\"OpenCSG\" src=\"./csg-wukong-logo-green.jpg\">\n</p>\n\n<p align=\"center\"><a href=\"https://portal.opencsg.com/models\">[OpenCSG Community]</a>   <a href=\"https://github.com/OpenCSGs/Awesome-SLMs\">[github]</a>  <a href=\"https://cdn-uploads.huggingface.co/production/uploads/64c71b27d43e4dee51a8b31a/HU6vz21qKTEmUBCWqCFh9.jpeg\">[wechat]</a>  <a href=\"https://twitter.com/OpenCsg\">[Twitter]</a> </p>\n\n\n</div>\nOpenCSG stands for Converged resources, Software refinement, and Generative LM. The 'C' represents Converged resources, indicating the integration and full utilization of hybrid resources. The 'S' stands for Software refinement, signifying software that is refined by large models. The 'G' represents Generative LM, which denotes widespread, inclusive, and democratized generative large models.\n\nThe vision of OpenCSG is to empower every industry, every company, and every individual to own their models. We adhere to the principles of openness and open source, making the large model software stack of OpenCSG available to the community. We welcome everyone to use, send feedback, and contribute collaboratively.\n\n\n\n\n## Model Description\n\n\n\n\n**csg-wukong-1B-chat-v0.1** was finetuned on csg-wukong-1B\n<br>\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/661790397437201d78141856/sZvOqCJY4gOEvVhpmlH_N.png)\n\n## Model Evaluation results\n\nWe submitted csg-wukong-1B on the [open_llm_leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard), and\nthe results show our model ranked the 8th among the ~1.5B pretrained small language models.\n\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/661790397437201d78141856/_HRTxL6N0qnNPNt-P8k9k.png)\n\n\n\n# Training\n\n## Hardware\n\n- **GPUs:** 6 V100 \n- **Training time:** 6 hours \n\n## Software\n\n- **Orchestration:** [Deepspeed](https://github.com/OpenCSGs)\n- **Neural networks:** [PyTorch](https://github.com/pytorch/pytorch)\n- **BP16 if applicable:** [apex](https://github.com/NVIDIA/apex)\n\n\n<a id=\"chinese\"></a>\n\n<p>\n\n</p>\n\n# OpenCSG介绍\n\n\n<p align=\"center\">\n<img width=\"300px\" alt=\"OpenCSG\" src=\"https://cdn-uploads.huggingface.co/production/uploads/64c71b27d43e4dee51a8b31a/GwYXPKuEoGCGcMICeW-sb.jpeg\">\n</p>\n\n<p align=\"center\"><a href=\"https://opencsg.com/models\">[OpenCSG 社区]</a>   <a href=\"https://github.com/OpenCSGs/Awesome-SLMs\">[github]</a>  <a href=\"https://cdn-uploads.huggingface.co/production/uploads/64c71b27d43e4dee51a8b31a/HU6vz21qKTEmUBCWqCFh9.jpeg\">[微信]</a>  <a href=\"https://twitter.com/OpenCsg\">[推特]</a> </p>\n\n\n\n</div>\nOpenCSG中 Open是开源开放;C 代表 Converged resources,整合和充分利用的混合异构资源优势,算力降本增效;S 代表 Software refined,重新定义软件的交付方式,通过大模型驱动软件开发,人力降本增效;G 代表 Generative LM,大众化、普惠化和民主化的可商用的开源生成式大模型。\n\nOpenCSG的愿景是让每个行业、每个公司、每个人都拥有自己的模型。 我们坚持开源开放的原则,将OpenCSG的大模型软件栈开源到社区,欢迎使用、反馈和参与共建,欢迎关注。\n\n\n\n## 模型介绍\n\n\n**csg-wukong-1B-chat-v0.1** 在csg-wukong-1B模型上微调而成。\n<br>\n\n\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/661790397437201d78141856/YrpSwEsRGdaQj56__8o0U.png)\n\n\n## 模型评测结果\n\n我们把csg-wukong-1B模型提交到[open_llm_leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)榜单上,结果显示我们的模型目前在~1.5B小语言模型中排名第8。\n\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/661790397437201d78141856/ZfWZ1Fd7ccKrJVx0okV9z.png)\n\n\n\n# 训练\n\n## 硬件资源\n\n- **GPU数量:** 6 V100 \n- **训练时间:** 6小时\n\n## 软件使用\n\n- **微调训练框架:** [Deepspeed](https://github.com/OpenCSGs)\n- **深度学习框架:** [PyTorch](https://github.com/pytorch/pytorch)\n- **BP16:** [apex](https://github.com/NVIDIA/apex)\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "endpoints_compatible",
    "region:us"
  ],
  "likes": 0,
  "downloads": 282,
  "gated": false,
  "private": false,
  "last_modified": "2024-06-25T21:58:12.000Z",
  "created_at": "2024-06-25T21:33:48.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "667b37bc9ccdbeba7c3c671f",
  "id": "RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf",
  "modelId": "RichardErkhov/opencsg_-_csg-wukong-1B-chat-v0.1-gguf",
  "sha": "038c015b6ffb8f9d06d90dca8ddc2338b49dc203",
  "createdAt": "2024-06-25T21:33:48.000Z",
  "lastModified": "2024-06-25T21:58:12.000Z",
  "author": "RichardErkhov",
  "downloads": 282,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 24
}