Model Intelligence Sheet
volko76/qwen3.5-122b-a10b-ud-iq4_xs-gguf-merged overview
from unsloth : https://huggingface.co/unsloth/Qwen3.5-122B-A10B-GGUF I simply took the UD-IQ4_XS and merged all the shards (0001.gguf, 0002.gguf, 0003.gguf) and merged them into one .gguf I used llama.cpp gguf-split tool to merge : https://github.com/ggml-org/llama.cpp/tree/master/tools/gguf-split Usefull for example for vLLM because they don't allow multishards Feel free to check out my website : https://cheapllm.shop for unlimited FREE inference of this model (during the beta, after that the pricing will be $0.02/M input and $0.10/M output so the cheapest provider by a big margin) If you're interested in D&D/RP, you can also check out https://fablia.fr for free D&D/RP experiences
Downloads
616
Likes
0
Pipeline
—
Library
—
Visibility
Public
Access
Open
Repository Files & Downloads
1 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Qwen3.5-122B-A10B-UD-IQ4_XS.gguf | GGUF | IQ4_XS | 56.09 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"base_model": [
"unsloth/Qwen3.5-122B-A10B-GGUF",
"Qwen/Qwen3.5-122B-A10B"
],
"frontmatter": {
"base_model": [
"unsloth/Qwen3.5-122B-A10B-GGUF",
"Qwen/Qwen3.5-122B-A10B"
]
},
"hero_image_url": "",
"summary": "from unsloth : https://huggingface.co/unsloth/Qwen3.5-122B-A10B-GGUF I simply took the UD-IQ4_XS and merged all the shards (0001.gguf, 0002.gguf, 0003.gguf) and merged them into one .gguf I used llama.cpp gguf-split tool to merge : https://github.com/ggml-org/llama.cpp/tree/master/tools/gguf-split Usefull for example for vLLM because they don't allow multishards Feel free to check out my website : https://cheapllm.shop for unlimited FREE inference of this model (during the beta, after that the pricing will be $0.02/M input and $0.10/M output so the cheapest provider by a big margin) If you're interested in D&D/RP, you can also check out https://fablia.fr for free D&D/RP experiences",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nbase_model:\n- unsloth/Qwen3.5-122B-A10B-GGUF\n- Qwen/Qwen3.5-122B-A10B\n---\nfrom unsloth : https://huggingface.co/unsloth/Qwen3.5-122B-A10B-GGUF\n\nI simply took the UD-IQ4_XS and merged all the shards (0001.gguf, 0002.gguf, 0003.gguf) and merged them into one .gguf\nI used llama.cpp gguf-split tool to merge : https://github.com/ggml-org/llama.cpp/tree/master/tools/gguf-split\n\nUsefull for example for vLLM because they don't allow multishards\n\nFeel free to check out my website : https://cheapllm.shop for unlimited FREE inference of this model (during the beta, after that the pricing will be $0.02/M input and $0.10/M output so the cheapest provider by a big margin)\n\nIf you're interested in D&D/RP, you can also check out https://fablia.fr for free D&D/RP experiences",
"related_quantizations": []
},
"tags": [
"gguf",
"base_model:Qwen/Qwen3.5-122B-A10B",
"base_model:quantized:Qwen/Qwen3.5-122B-A10B",
"endpoints_compatible",
"region:us",
"imatrix",
"conversational"
],
"likes": 0,
"downloads": 616,
"gated": false,
"private": false,
"last_modified": "2026-03-31T15:08:09.000Z",
"created_at": "2026-03-29T22:25:52.000Z",
"pipeline_tag": "",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "69c9a6f0d18d4aed320f7e18",
"id": "Volko76/Qwen3.5-122B-A10B-UD-IQ4_XS-GGUF-MERGED",
"modelId": "Volko76/Qwen3.5-122B-A10B-UD-IQ4_XS-GGUF-MERGED",
"sha": "157147f629a2eadae0b0635e99f449114c3e1041",
"createdAt": "2026-03-29T22:25:52.000Z",
"lastModified": "2026-03-31T15:08:09.000Z",
"author": "Volko76",
"downloads": 616,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "",
"library_name": "",
"siblings_count": 3
}