duyntnet/stanta-lelemon-maid-7b-imatrix-gguf IQ3_S GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
duyntnet/stanta-lelemon-maid-7b-imatrix-gguf overview
!image/png # Vision/multimodal capabilities: If you want to use vision functionality: You must use the latest versions of Koboldcpp. To use the multimodal capabilities of this model and use vision you need to load the specified mmproj file, this can be found inside this model repo. You can load the mmproj by using the corresponding section in the interface: !image/png # Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |---------------------------------|----:| |Avg. |69.79| |AI2 Reasoning Challenge (25-Shot)|67.58| |HellaSwag (10-Shot) |86.03| |MMLU (5-Shot) |64.79| |TruthfulQA (0-shot) |59.58| |Winogrande (5-shot) |79.64| |GSM8k (5-shot) |61.11|
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Stanta-Lelemon-Maid-7B-IQ1_M.gguf | GGUF | IQ1_M | 1.63 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ1_S.gguf | GGUF | IQ1_S | 1.50 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ2_M.gguf | GGUF | IQ2_M | 2.33 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ2_S.gguf | GGUF | IQ2_S | 2.15 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ2_XS.gguf | GGUF | IQ2_XS | 2.05 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ2_XXS.gguf | GGUF | IQ2_XXS | 1.85 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ3_M.gguf | GGUF | IQ3_M | 3.06 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ3_S.gguf | GGUF | IQ3_S | 2.96 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ3_XS.gguf | GGUF | IQ3_XS | 2.81 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ3_XXS.gguf | GGUF | IQ3_XXS | 2.63 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ4_NL.gguf | GGUF | IQ4_NL | 3.84 GB | Download |
| Stanta-Lelemon-Maid-7B-IQ4_XS.gguf | GGUF | IQ4_XS | 3.64 GB | Download |
| Stanta-Lelemon-Maid-7B-Q2_K.gguf | GGUF | Q2_K | 2.53 GB | Download |
| Stanta-Lelemon-Maid-7B-Q2_K_S.gguf | GGUF | Q2_K_S | 2.36 GB | Download |
| Stanta-Lelemon-Maid-7B-Q3_K_L.gguf | GGUF | Q3_K_L | 3.56 GB | Download |
| Stanta-Lelemon-Maid-7B-Q3_K_M.gguf | GGUF | Q3_K_M | 3.28 GB | Download |
| Stanta-Lelemon-Maid-7B-Q3_K_S.gguf | GGUF | Q3_K_S | 2.95 GB | Download |
| Stanta-Lelemon-Maid-7B-Q4_0.gguf | GGUF | — | 3.84 GB | Download |
| Stanta-Lelemon-Maid-7B-Q4_1.gguf | GGUF | — | 4.24 GB | Download |
| Stanta-Lelemon-Maid-7B-Q4_K_M.gguf | GGUF | Q4_K_M | 4.07 GB | Download |
| Stanta-Lelemon-Maid-7B-Q4_K_S.gguf | GGUF | Q4_K_S | 3.86 GB | Download |
| Stanta-Lelemon-Maid-7B-Q5_0.gguf | GGUF | — | 4.67 GB | Download |
| Stanta-Lelemon-Maid-7B-Q5_1.gguf | GGUF | — | 5.07 GB | Download |
| Stanta-Lelemon-Maid-7B-Q5_K_M.gguf | GGUF | Q5_K_M | 4.78 GB | Download |
| Stanta-Lelemon-Maid-7B-Q5_K_S.gguf | GGUF | Q5_K_S | 4.65 GB | Download |
| Stanta-Lelemon-Maid-7B-Q6_K.gguf | GGUF | Q6_K | 5.53 GB | Download |
| Stanta-Lelemon-Maid-7B-Q8_0.gguf | GGUF | — | 7.17 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": false,
"tags": [
"transformers",
"gguf",
"imatrix",
"Stanta-Lelemon-Maid-7B"
],
"frontmatter": {
"license": "other",
"language": [
"en"
],
"pipeline_tag": "text-generation",
"inference": "false",
"tags": [
"transformers",
"gguf",
"imatrix",
"Stanta-Lelemon-Maid-7B"
]
},
"hero_image_url": "https://cdn-uploads.huggingface.co/production/uploads/642265bc01c62c1e4102dc36/pQ5xEj4JOyNF_dvBkuIa6.png",
"summary": "!image/png # Vision/multimodal capabilities: If you want to use vision functionality: * You must use the latest versions of Koboldcpp. To use the multimodal capabilities of this model and use **vision** you need to load the specified **mmproj** file, this can be found inside this model repo. * You can load the **mmproj** by using the corresponding section in the interface: !image/png # Open LLM Leaderboard Evaluation Results Detailed results can be found here | Metric |Value| |---------------------------------|----:| |Avg. |69.79| |AI2 Reasoning Challenge (25-Shot)|67.58| |HellaSwag (10-Shot) |86.03| |MMLU (5-Shot) |64.79| |TruthfulQA (0-shot) |59.58| |Winogrande (5-shot) |79.64| |GSM8k (5-shot) |61.11|",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- Stanta-Lelemon-Maid-7B\n---\n\nQuantizations of https://huggingface.co/ChaoticNeutrals/Stanta-Lelemon-Maid-7B\n\n\n### Open source inference clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [jan](https://github.com/janhq/jan)\n\n### Closed source inference clients/UIs\n* [LM Studio](https://lmstudio.ai/)\n* [Backyard AI](https://backyard.ai/)\n* More will be added...\n---\n\n# From original readme\n\n\n\n# Vision/multimodal capabilities:\n\n If you want to use vision functionality:\n\n * You must use the latest versions of [Koboldcpp](https://github.com/LostRuins/koboldcpp).\n \nTo use the multimodal capabilities of this model and use **vision** you need to load the specified **mmproj** file, this can be found inside this model repo.\n \n * You can load the **mmproj** by using the corresponding section in the interface:\n\n \n# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)\nDetailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Nitral-AI__Stanta-Lelemon-Maid-7B)\n\n| Metric |Value|\n|---------------------------------|----:|\n|Avg. |69.79|\n|AI2 Reasoning Challenge (25-Shot)|67.58|\n|HellaSwag (10-Shot) |86.03|\n|MMLU (5-Shot) |64.79|\n|TruthfulQA (0-shot) |59.58|\n|Winogrande (5-shot) |79.64|\n|GSM8k (5-shot) |61.11|",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"imatrix",
"Stanta-Lelemon-Maid-7B",
"text-generation",
"en",
"license:other",
"region:us"
],
"likes": 0,
"downloads": 618,
"gated": false,
"private": false,
"last_modified": "2025-06-20T13:05:34.000Z",
"created_at": "2025-06-20T12:05:15.000Z",
"pipeline_tag": "text-generation",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "68554e7b3c94d2806c12ba66",
"id": "duyntnet/Stanta-Lelemon-Maid-7B-imatrix-GGUF",
"modelId": "duyntnet/Stanta-Lelemon-Maid-7B-imatrix-GGUF",
"sha": "f9cd193808459592f4c38b073fcfc43e2d3f05d5",
"createdAt": "2025-06-20T12:05:15.000Z",
"lastModified": "2025-06-20T13:05:34.000Z",
"author": "duyntnet",
"downloads": 618,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "transformers",
"siblings_count": 29
}