Model Intelligence Sheet

wkwong/lunavox-qwen3-tts-gguf overview

This directory contains converted runtime artifacts (ONNX and GGUF) for various Qwen3-TTS model variants. These artifacts are generated from original Hugging Face checkpoints to be used by the LunaVox inference engine.

onnxgguftext-to-speechttsaudiocppquantizedlunavoxlicense:apache-2.0endpoints_compatibleregion:us

Downloads

607

Likes

Pipeline

text-to-speech

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

2 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
qwen3_tts_predictor.q8_0.gguf	GGUF	—	144.12 MB	Download
qwen3_tts_talker.q5_k.gguf	GGUF	Q5_K	959.63 MB	Download

Model Details Live

Model Slug

wkwong/lunavox-qwen3-tts-gguf

Author

wkwong

Pipeline Task

text-to-speech

Library

—

Created

2025-09-24

Last Modified

2026-03-26

Gated

Private

HF SHA

694d0724541eea9e17e24d49817dc17cc08b7a34

License

apache-2.0

Language

Unknown

Base Model

Qwen/Qwen3-TTS

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "base_model": "Qwen/Qwen3-TTS",
    "tags": [
      "text-to-speech",
      "tts",
      "audio",
      "cpp",
      "quantized",
      "lunavox",
      "onnx"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "base_model": "Qwen/Qwen3-TTS",
      "tags": [
        "text-to-speech",
        "tts",
        "audio",
        "cpp",
        "quantized",
        "lunavox",
        "onnx"
      ]
    },
    "hero_image_url": "",
    "summary": "This directory contains converted runtime artifacts (ONNX and GGUF) for various Qwen3-TTS model variants. These artifacts are generated from original Hugging Face checkpoints to be used by the LunaVox inference engine.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nbase_model: Qwen/Qwen3-TTS\ntags:\n- text-to-speech\n- tts\n- audio\n- cpp\n- quantized\n- lunavox\n- onnx\n---\n\n# LunaVox Runtime Models\n\nThis directory contains converted runtime artifacts (ONNX and GGUF) for various Qwen3-TTS model variants. These artifacts are generated from original Hugging Face checkpoints to be used by the LunaVox inference engine.\n\n## Downloading and Setup\n\n### 1. Automatic Source Download in `pull-model`\n`lunavox pull-model` is the only model preparation entrypoint.  \nIf required Hugging Face source weights are missing, CLI prompts in English and downloads after confirmation.\n\n### 2. Model Cache\nOriginal model weights are cached in the standard Hugging Face directory:\n`~/.cache/huggingface/hub/models--Qwen--...`\n\n## Directory Structure\n\nEach model variant subfolder (e.g., `models/base_small/`) typically contains:\n\n- `qwen3_tts_talker.q5_k.gguf`: Quantized Talker model (Llama-based).\n- `qwen3_tts_predictor.q8_0.gguf`: Quantized Predictor model (Llama-based).\n- `qwen3_tts_codec_encoder.fp16.onnx`: Audio Tokenizer (Mimi-based).\n- `qwen3_tts_speaker_encoder.fp16.onnx`: Reference Audio Speaker Encoder.\n- `qwen3_tts_decoder.fp16.onnx`: Audio Decoder (Mimi-based).\n- `embeddings/`: Projected text and codec embeddings.\n- `tokenizer.json`: Hugging Face text tokenizer configuration.\n\n## Available Variants\n- `base`: Qwen3-TTS-12Hz-1.7B-Base\n- `base_small`: Qwen3-TTS-12Hz-0.6B-Base\n- `custom`: Qwen3-TTS-12Hz-1.7B-CustomVoice\n- `custom_small`: Qwen3-TTS-12Hz-0.6B-CustomVoice\n- `design`: Qwen3-TTS-12Hz-1.7B-VoiceDesign\n",
    "related_quantizations": []
  },
  "tags": [
    "onnx",
    "gguf",
    "text-to-speech",
    "tts",
    "audio",
    "cpp",
    "quantized",
    "lunavox",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us"
  ],
  "likes": 1,
  "downloads": 607,
  "gated": false,
  "private": false,
  "last_modified": "2026-03-26T23:49:44.000Z",
  "created_at": "2025-09-24T17:52:19.000Z",
  "pipeline_tag": "text-to-speech",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "68d42fd3afe046d61b367f49",
  "id": "wkwong/Lunavox-Qwen3-TTS-GGUF",
  "modelId": "wkwong/Lunavox-Qwen3-TTS-GGUF",
  "sha": "694d0724541eea9e17e24d49817dc17cc08b7a34",
  "createdAt": "2025-09-24T17:52:19.000Z",
  "lastModified": "2026-03-26T23:49:44.000Z",
  "author": "wkwong",
  "downloads": 607,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-to-speech",
  "library_name": "",
  "siblings_count": 123
}