Model Intelligence Sheet

solaren/qwen3-moe-6x0.6b-3.6b-writing-on-fire-uncensored-q8_0-gguf overview

This model was converted to GGUF format from DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

transformersggufprogrammingcode generationcodecodeqwenmoecodingcoderqwen2chatqwenqwen-codermixture of experts6 experts2 active experts40k contextqwen3finetuneqwen3_moecreativeall use casesroleplaymergellama-cppgguf-my-repotext-generationenfrzh

solaren/qwen3-moe-6x0.6b-3.6b-writing-on-fire-uncensored-q8_0-gguf visual

Downloads

485

Likes

Pipeline

text-generation

Library

transformers

Visibility

Public

Access

Open

Repository Files & Downloads

1 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
qwen3-moe-6x0.6b-3.6b-writing-on-fire-uncensored-q8_0.gguf	GGUF	—	1.90 GB	Download

Model Details Live

Model Slug

solaren/qwen3-moe-6x0.6b-3.6b-writing-on-fire-uncensored-q8_0-gguf

Author

Solaren

Pipeline Task

text-generation

Library

transformers

Created

2025-08-27

Last Modified

2025-08-27

Gated

Private

HF SHA

c27c0e3046355671025a8e52bfd4a9d9bd1276dc

License

apache-2.0

Language

en, fr, zh, de

Base Model

DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "library_name": "transformers",
    "language": [
      "en",
      "fr",
      "zh",
      "de"
    ],
    "tags": [
      "programming",
      "code generation",
      "code",
      "codeqwen",
      "moe",
      "coding",
      "coder",
      "qwen2",
      "chat",
      "qwen",
      "qwen-coder",
      "mixture of experts",
      "6 experts",
      "2 active experts",
      "40k context",
      "qwen3",
      "finetune",
      "qwen3_moe",
      "creative",
      "all use cases",
      "roleplay",
      "merge",
      "llama-cpp",
      "gguf-my-repo"
    ],
    "base_model": "DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored",
    "pipeline_tag": "text-generation",
    "frontmatter": {
      "license": "apache-2.0",
      "library_name": "transformers",
      "language": [
        "en",
        "fr",
        "zh",
        "de"
      ],
      "tags": [
        "programming",
        "code generation",
        "code",
        "codeqwen",
        "moe",
        "coding",
        "coder",
        "qwen2",
        "chat",
        "qwen",
        "qwen-coder",
        "mixture of experts",
        "6 experts",
        "2 active experts",
        "40k context",
        "qwen3",
        "finetune",
        "qwen3_moe",
        "creative",
        "all use cases",
        "roleplay",
        "merge",
        "llama-cpp",
        "gguf-my-repo"
      ],
      "base_model": "DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored",
      "pipeline_tag": "text-generation"
    },
    "hero_image_url": "",
    "summary": "This model was converted to GGUF format from DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nlibrary_name: transformers\nlanguage:\n- en\n- fr\n- zh\n- de\ntags:\n- programming\n- code generation\n- code\n- codeqwen\n- moe\n- coding\n- coder\n- qwen2\n- chat\n- qwen\n- qwen-coder\n- mixture of experts\n- 6 experts\n- 2 active experts\n- 40k context\n- qwen3\n- finetune\n- qwen3_moe\n- creative\n- all use cases\n- roleplay\n- merge\n- llama-cpp\n- gguf-my-repo\nbase_model: DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored\npipeline_tag: text-generation\n---\n\n# Solaren/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored-Q8_0-GGUF\nThis model was converted to GGUF format from [`DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored`](https://huggingface.co/DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.\nRefer to the [original model card](https://huggingface.co/DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored) for more details on the model.\n\n## Use with llama.cpp\nInstall llama.cpp through brew (works on Mac and Linux)\n\n```bash\nbrew install llama.cpp\n\n```\nInvoke the llama.cpp server or the CLI.\n\n### CLI:\n```bash\nllama-cli --hf-repo Solaren/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored-Q8_0-GGUF --hf-file qwen3-moe-6x0.6b-3.6b-writing-on-fire-uncensored-q8_0.gguf -p \"The meaning to life and the universe is\"\n```\n\n### Server:\n```bash\nllama-server --hf-repo Solaren/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored-Q8_0-GGUF --hf-file qwen3-moe-6x0.6b-3.6b-writing-on-fire-uncensored-q8_0.gguf -c 2048\n```\n\nNote: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.\n\nStep 1: Clone llama.cpp from GitHub.\n```\ngit clone https://github.com/ggerganov/llama.cpp\n```\n\nStep 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).\n```\ncd llama.cpp && LLAMA_CURL=1 make\n```\n\nStep 3: Run inference through the main binary.\n```\n./llama-cli --hf-repo Solaren/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored-Q8_0-GGUF --hf-file qwen3-moe-6x0.6b-3.6b-writing-on-fire-uncensored-q8_0.gguf -p \"The meaning to life and the universe is\"\n```\nor \n```\n./llama-server --hf-repo Solaren/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored-Q8_0-GGUF --hf-file qwen3-moe-6x0.6b-3.6b-writing-on-fire-uncensored-q8_0.gguf -c 2048\n```\n",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "programming",
    "code generation",
    "code",
    "codeqwen",
    "moe",
    "coding",
    "coder",
    "qwen2",
    "chat",
    "qwen",
    "qwen-coder",
    "mixture of experts",
    "6 experts",
    "2 active experts",
    "40k context",
    "qwen3",
    "finetune",
    "qwen3_moe",
    "creative",
    "all use cases",
    "roleplay",
    "merge",
    "llama-cpp",
    "gguf-my-repo",
    "text-generation",
    "en",
    "fr",
    "zh",
    "de",
    "base_model:DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored",
    "base_model:quantized:DavidAU/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us"
  ],
  "likes": 8,
  "downloads": 485,
  "gated": false,
  "private": false,
  "last_modified": "2025-08-27T07:29:28.000Z",
  "created_at": "2025-08-27T07:29:18.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "68aeb3ced494b8cb92910cd3",
  "id": "Solaren/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored-Q8_0-GGUF",
  "modelId": "Solaren/Qwen3-MOE-6x0.6B-3.6B-Writing-On-Fire-Uncensored-Q8_0-GGUF",
  "sha": "c27c0e3046355671025a8e52bfd4a9d9bd1276dc",
  "createdAt": "2025-08-27T07:29:18.000Z",
  "lastModified": "2025-08-27T07:29:28.000Z",
  "author": "Solaren",
  "downloads": 485,
  "likes": 8,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 3
}