melvin56/qwen3-8b-abliterated-gguf Q2_K GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
melvin56/qwen3-8b-abliterated-gguf overview
Original Model : huihui-ai/Qwen3-8B-abliterated Llama.cpp build: 0208355 (5342) I used imatrix to create all these quants using this Dataset. --- | | CPU (AVX2) | CPU (ARM NEON) | Metal | cuBLAS | rocBLAS | SYCL | CLBlast | Vulkan | Kompute | | :------------ | :---------: | :------------: | :---: | :----: | :-----: | :---: | :------: | :----: | :------: | | K-quants | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ 🐢5 | ✅ 🐢5 | ❌ | | I-quants | ✅ 🐢4 | ✅ 🐢4 | ✅ 🐢4 | ✅ | ✅ | Partial¹ | ❌ | ❌ | ❌ |
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| qwen3-8b-abliterated-BF16.gguf | GGUF | BF16 | 15.26 GB | Download |
| qwen3-8b-abliterated-IQ4_XS.gguf | GGUF | IQ4_XS | 4.25 GB | Download |
| qwen3-8b-abliterated-Q2_K.gguf | GGUF | Q2_K | 3.06 GB | Download |
| qwen3-8b-abliterated-Q3_K_M.gguf | GGUF | Q3_K_M | 3.84 GB | Download |
| qwen3-8b-abliterated-Q4_K_M.gguf | GGUF | Q4_K_M | 4.68 GB | Download |
| qwen3-8b-abliterated-Q5_K_M.gguf | GGUF | Q5_K_M | 5.45 GB | Download |
| qwen3-8b-abliterated-Q6_K.gguf | GGUF | Q6_K | 6.26 GB | Download |
| qwen3-8b-abliterated-Q8_0.gguf | GGUF | — | 8.11 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "apache-2.0",
"license_link": "https://huggingface.co/Qwen/Qwen3-8B/blob/main/LICENSE",
"pipeline_tag": "text-generation",
"base_model": [
"huihui-ai/Qwen3-8B-abliterated"
],
"tags": [
"chat",
"abliterated",
"uncensored"
],
"extra_gated_prompt": "**Usage Warnings**\n\n“**Risk of Sensitive or Controversial Outputs**“: This model’s safety filtering has been significantly reduced, potentially generating sensitive, controversial, or inappropriate content. Users should exercise caution and rigorously review generated outputs.\n“**Not Suitable for All Audiences**:“ Due to limited content filtering, the model’s outputs may be inappropriate for public settings, underage users, or applications requiring high security.\n“**Legal and Ethical Responsibilities**“: Users must ensure their usage complies with local laws and ethical standards. Generated content may carry legal or ethical risks, and users are solely responsible for any consequences.\n“**Research and Experimental Use**“: It is recommended to use this model for research, testing, or controlled environments, avoiding direct use in production or public-facing commercial applications.\n“**Monitoring and Review Recommendations**“: Users are strongly advised to monitor model outputs in real-time and conduct manual reviews when necessary to prevent the dissemination of inappropriate content.\n“**No Default Safety Guarantees**“: Unlike standard models, this model has not undergone rigorous safety optimization. huihui.ai bears no responsibility for any consequences arising from its use.",
"frontmatter": {
"license": "apache-2.0",
"license_link": "https://huggingface.co/Qwen/Qwen3-8B/blob/main/LICENSE",
"pipeline_tag": "text-generation",
"base_model": [
"huihui-ai/Qwen3-8B-abliterated"
],
"tags": [
"chat",
"abliterated",
"uncensored"
],
"extra_gated_prompt": ">-"
},
"hero_image_url": "",
"summary": "Original Model : huihui-ai/Qwen3-8B-abliterated Llama.cpp build: 0208355 (5342) I used imatrix to create all these quants using this Dataset. --- | | CPU (AVX2) | CPU (ARM NEON) | Metal | cuBLAS | rocBLAS | SYCL | CLBlast | Vulkan | Kompute | | :------------ | :---------: | :------------: | :---: | :----: | :-----: | :---: | :------: | :----: | :------: | | K-quants | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ 🐢5 | ✅ 🐢5 | ❌ | | I-quants | ✅ 🐢4 | ✅ 🐢4 | ✅ 🐢4 | ✅ | ✅ | Partial¹ | ❌ | ❌ | ❌ | `` ✅: feature works 🚫: feature does not work ❓: unknown, please contribute if you can test it youself 🐢: feature is slow ¹: IQ3_S and IQ1_S, see #5886 ²: Only with -ngl 0 ³: Inference is 50% slower ⁴: Slower than K-quants of comparable size ⁵: Slower than cuBLAS/rocBLAS on similar cards ⁶: Only q8_0 and iq4_nl ``",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: apache-2.0\nlicense_link: https://huggingface.co/Qwen/Qwen3-8B/blob/main/LICENSE\npipeline_tag: text-generation\nbase_model:\n- huihui-ai/Qwen3-8B-abliterated\ntags:\n- chat\n- abliterated\n- uncensored\nextra_gated_prompt: >-\n **Usage Warnings**\n\n\n “**Risk of Sensitive or Controversial Outputs**“: This model’s safety\n filtering has been significantly reduced, potentially generating sensitive,\n controversial, or inappropriate content. Users should exercise caution and\n rigorously review generated outputs.\n\n “**Not Suitable for All Audiences**:“ Due to limited content filtering, the\n model’s outputs may be inappropriate for public settings, underage users, or\n applications requiring high security.\n\n “**Legal and Ethical Responsibilities**“: Users must ensure their usage\n complies with local laws and ethical standards. Generated content may carry\n legal or ethical risks, and users are solely responsible for any consequences.\n\n “**Research and Experimental Use**“: It is recommended to use this model for\n research, testing, or controlled environments, avoiding direct use in\n production or public-facing commercial applications.\n\n “**Monitoring and Review Recommendations**“: Users are strongly advised to\n monitor model outputs in real-time and conduct manual reviews when necessary\n to prevent the dissemination of inappropriate content.\n\n “**No Default Safety Guarantees**“: Unlike standard models, this model has not\n undergone rigorous safety optimization. huihui.ai bears no responsibility for\n any consequences arising from its use.\n---\n\n# Melvin56/Qwen3-8B-abliterated-GGUF\n\nOriginal Model : [huihui-ai/Qwen3-8B-abliterated](https://huggingface.co/huihui-ai/Qwen3-8B-abliterated)\n\nLlama.cpp build: 0208355 (5342)\n\nI used imatrix to create all these quants using this [Dataset](https://gist.github.com/tristandruyen/9e207a95c7d75ddf37525d353e00659c/#file-calibration_data_v5_rc-txt).\n\n---\n\n| | CPU (AVX2) | CPU (ARM NEON) | Metal | cuBLAS | rocBLAS | SYCL | CLBlast | Vulkan | Kompute |\n| :------------ | :---------: | :------------: | :---: | :----: | :-----: | :---: | :------: | :----: | :------: |\n| K-quants | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ 🐢5 | ✅ 🐢5 | ❌ |\n| I-quants | ✅ 🐢4 | ✅ 🐢4 | ✅ 🐢4 | ✅ | ✅ | Partial¹ | ❌ | ❌ | ❌ |\n```\n✅: feature works\n🚫: feature does not work\n❓: unknown, please contribute if you can test it youself\n🐢: feature is slow\n¹: IQ3_S and IQ1_S, see #5886\n²: Only with -ngl 0\n³: Inference is 50% slower\n⁴: Slower than K-quants of comparable size\n⁵: Slower than cuBLAS/rocBLAS on similar cards\n⁶: Only q8_0 and iq4_nl\n```",
"related_quantizations": []
},
"tags": [
"gguf",
"chat",
"abliterated",
"uncensored",
"text-generation",
"base_model:huihui-ai/Qwen3-8B-abliterated",
"base_model:quantized:huihui-ai/Qwen3-8B-abliterated",
"license:apache-2.0",
"endpoints_compatible",
"region:us"
],
"likes": 0,
"downloads": 128,
"gated": false,
"private": false,
"last_modified": "2025-05-11T09:05:32.000Z",
"created_at": "2025-05-11T01:17:01.000Z",
"pipeline_tag": "text-generation",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "681ffa8df8c808f784a25561",
"id": "Melvin56/Qwen3-8B-abliterated-GGUF",
"modelId": "Melvin56/Qwen3-8B-abliterated-GGUF",
"sha": "6ece045abb3caec6972c0a8a17b1df9c6cbf89e4",
"createdAt": "2025-05-11T01:17:01.000Z",
"lastModified": "2025-05-11T09:05:32.000Z",
"author": "Melvin56",
"downloads": 128,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "",
"siblings_count": 11
}