prithivmlmods/sage-mm-qwen3-vl-4b-sft_rl-gguf overview
The SAGE-MM-Qwen3-VL-4B-SFT_RL from allenai is a 4B-parameter reinforcement learning (RL) post-trained vision-language model, further refined from the SAGE-MM-Qwen3-VL-4B-SFT base via RL on top of Qwen/Qwen3-VL-4B-Instruct, serving as the core decision-maker in the SAGE (Smart Any-Horizon Agent) system for long video reasoning with improved performance (e.g., +4.1% over SFT per experiments) through two-stage operation: Stage-1 analyzes initial frames/metadata to classify queries as single-turn or multi-turn, while Stage-2 iteratively invokes JSON-formatted tools like web-search, transcribe-speech (ASR), ground-event (temporal localization), extract-video-parts (frames/subclips), and analyze for dynamic context updates until resolution. Designed for extended video Q&A beyond fixed horizons—handling sports, narratives, events—it requires the SAGE GitHub runtime for tool parsing/execution and outputs action strings for robust, any-length video understanding under Apache 2.0 for research/educational use per Ai2 guidelines, with GGUF quantizations available.
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| SAGE-MM-Qwen3-VL-4B-SFT_RL.IQ4_XS.gguf | GGUF | IQ4_XS | 2.32 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q2_K.gguf | GGUF | Q2_K | 1.67 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q3_K_L.gguf | GGUF | Q3_K_L | 2.24 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q3_K_M.gguf | GGUF | Q3_K_M | 2.09 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q3_K_S.gguf | GGUF | Q3_K_S | 1.91 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q4_K_M.gguf | GGUF | Q4_K_M | 2.53 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q4_K_S.gguf | GGUF | Q4_K_S | 2.42 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q5_K_M.gguf | GGUF | Q5_K_M | 2.94 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q5_K_S.gguf | GGUF | Q5_K_S | 2.88 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q6_K.gguf | GGUF | Q6_K | 3.38 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q8_0.gguf | GGUF | — | 4.37 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.f16.gguf | GGUF | F16 | 8.22 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ1_M.gguf | GGUF | IQ1_M | 1.17 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ1_S.gguf | GGUF | IQ1_S | 1.10 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_M.gguf | GGUF | IQ2_M | 1.56 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_S.gguf | GGUF | IQ2_S | 1.48 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_XS.gguf | GGUF | IQ2_XS | 1.38 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_XXS.gguf | GGUF | IQ2_XXS | 1.28 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_M.gguf | GGUF | IQ3_M | 1.98 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_S.gguf | GGUF | IQ3_S | 1.92 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_XS.gguf | GGUF | IQ3_XS | 1.85 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_XXS.gguf | GGUF | IQ3_XXS | 1.71 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ4_NL.gguf | GGUF | IQ4_NL | 2.42 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ4_XS.gguf | GGUF | IQ4_XS | 2.31 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q2_K.gguf | GGUF | Q2_K | 1.67 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q2_K_S.gguf | GGUF | Q2_K_S | 1.57 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q3_K_L.gguf | GGUF | Q3_K_L | 2.24 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q3_K_M.gguf | GGUF | Q3_K_M | 2.09 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q3_K_S.gguf | GGUF | Q3_K_S | 1.91 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_0.gguf | GGUF | — | 2.42 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_1.gguf | GGUF | — | 2.64 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_K_M.gguf | GGUF | Q4_K_M | 2.53 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_K_S.gguf | GGUF | Q4_K_S | 2.42 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q5_K_M.gguf | GGUF | Q5_K_M | 2.94 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q5_K_S.gguf | GGUF | Q5_K_S | 2.88 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q6_K.gguf | GGUF | Q6_K | 3.38 GB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.imatrix.gguf | GGUF | — | 3.69 MB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.mmproj-Q8_0.gguf | GGUF | — | 432.94 MB | Download |
| SAGE-MM-Qwen3-VL-4B-SFT_RL.mmproj-f16.gguf | GGUF | F16 | 797.44 MB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "apache-2.0",
"language": [
"en"
],
"base_model": [
"allenai/SAGE-MM-Qwen3-VL-4B-SFT_RL"
],
"pipeline_tag": "video-text-to-text",
"library_name": "transformers",
"tags": [
"text-generation-inference",
"llama.cpp"
],
"frontmatter": {
"license": "apache-2.0",
"language": [
"en"
],
"base_model": [
"allenai/SAGE-MM-Qwen3-VL-4B-SFT_RL"
],
"pipeline_tag": "video-text-to-text",
"library_name": "transformers",
"tags": [
"text-generation-inference",
"llama.cpp"
]
},
"hero_image_url": "https://www.nethype.de/huggingface_embed/quantpplgraph.png",
"summary": "> The SAGE-MM-Qwen3-VL-4B-SFT_RL from allenai is a 4B-parameter reinforcement learning (RL) post-trained vision-language model, further refined from the SAGE-MM-Qwen3-VL-4B-SFT base via RL on top of Qwen/Qwen3-VL-4B-Instruct, serving as the core decision-maker in the SAGE (Smart Any-Horizon Agent) system for long video reasoning with improved performance (e.g., +4.1% over SFT per experiments) through two-stage operation: Stage-1 analyzes initial frames/metadata to classify queries as single-turn or multi-turn, while Stage-2 iteratively invokes JSON-formatted tools like web-search, transcribe-speech (ASR), ground-event (temporal localization), extract-video-parts (frames/subclips), and analyze for dynamic context updates until resolution. Designed for extended video Q&A beyond fixed horizons—handling sports, narratives, events—it requires the SAGE GitHub runtime for tool parsing/execution and outputs action strings for robust, any-length video understanding under Apache 2.0 for research/educational use per Ai2 guidelines, with GGUF quantizations available.",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: apache-2.0\nlanguage:\n- en\nbase_model:\n- allenai/SAGE-MM-Qwen3-VL-4B-SFT_RL\npipeline_tag: video-text-to-text\nlibrary_name: transformers\ntags:\n- text-generation-inference\n- llama.cpp\n---\n\n# **SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF**\n\n> The SAGE-MM-Qwen3-VL-4B-SFT_RL from allenai is a 4B-parameter reinforcement learning (RL) post-trained vision-language model, further refined from the SAGE-MM-Qwen3-VL-4B-SFT base via RL on top of Qwen/Qwen3-VL-4B-Instruct, serving as the core decision-maker in the SAGE (Smart Any-Horizon Agent) system for long video reasoning with improved performance (e.g., +4.1% over SFT per experiments) through two-stage operation: Stage-1 analyzes initial frames/metadata to classify queries as single-turn or multi-turn, while Stage-2 iteratively invokes JSON-formatted tools like web-search, transcribe-speech (ASR), ground-event (temporal localization), extract-video-parts (frames/subclips), and analyze for dynamic context updates until resolution. Designed for extended video Q&A beyond fixed horizons—handling sports, narratives, events—it requires the SAGE GitHub runtime for tool parsing/execution and outputs action strings for robust, any-length video understanding under Apache 2.0 for research/educational use per Ai2 guidelines, with GGUF quantizations available.\n\n## SAGE-MM-Qwen3-VL-4B-SFT_RL [GGUF]\n\n| File Name | Quant Type | File Size | File Link |\n| - | - | - | - |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.IQ4_XS.gguf | IQ4_XS | 2.49 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.IQ4_XS.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q2_K.gguf | Q2_K | 1.8 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q2_K.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q3_K_L.gguf | Q3_K_L | 2.41 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q3_K_L.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q3_K_M.gguf | Q3_K_M | 2.24 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q3_K_M.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q3_K_S.gguf | Q3_K_S | 2.05 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q3_K_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q4_K_M.gguf | Q4_K_M | 2.72 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q4_K_M.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q4_K_S.gguf | Q4_K_S | 2.6 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q4_K_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q5_K_M.gguf | Q5_K_M | 3.16 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q5_K_M.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q5_K_S.gguf | Q5_K_S | 3.09 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q5_K_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q6_K.gguf | Q6_K | 3.63 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q6_K.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.Q8_0.gguf | Q8_0 | 4.69 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.Q8_0.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.f16.gguf | F16 | 8.83 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.f16.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ1_M.gguf | i1-IQ1_M | 1.25 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ1_M.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ1_S.gguf | i1-IQ1_S | 1.18 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ1_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_M.gguf | i1-IQ2_M | 1.68 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_M.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_S.gguf | i1-IQ2_S | 1.58 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_XS.gguf | i1-IQ2_XS | 1.48 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_XS.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_XXS.gguf | i1-IQ2_XXS | 1.37 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ2_XXS.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_M.gguf | i1-IQ3_M | 2.13 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_M.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_S.gguf | i1-IQ3_S | 2.07 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_XS.gguf | i1-IQ3_XS | 1.98 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_XS.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_XXS.gguf | i1-IQ3_XXS | 1.84 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ3_XXS.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ4_NL.gguf | i1-IQ4_NL | 2.6 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ4_NL.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ4_XS.gguf | i1-IQ4_XS | 2.48 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-IQ4_XS.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q2_K.gguf | i1-Q2_K | 1.8 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q2_K.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q2_K_S.gguf | i1-Q2_K_S | 1.69 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q2_K_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q3_K_L.gguf | i1-Q3_K_L | 2.41 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q3_K_L.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q3_K_M.gguf | i1-Q3_K_M | 2.24 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q3_K_M.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q3_K_S.gguf | i1-Q3_K_S | 2.05 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q3_K_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_0.gguf | i1-Q4_0 | 2.59 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_0.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_1.gguf | i1-Q4_1 | 2.84 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_1.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_K_M.gguf | i1-Q4_K_M | 2.72 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_K_M.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_K_S.gguf | i1-Q4_K_S | 2.6 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q4_K_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q5_K_M.gguf | i1-Q5_K_M | 3.16 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q5_K_M.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q5_K_S.gguf | i1-Q5_K_S | 3.09 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q5_K_S.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q6_K.gguf | i1-Q6_K | 3.63 GB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.i1-Q6_K.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.imatrix.gguf | imatrix | 3.87 MB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.imatrix.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.mmproj-Q8_0.gguf | mmproj-Q8_0 | 454 MB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.mmproj-Q8_0.gguf) |\n| SAGE-MM-Qwen3-VL-4B-SFT_RL.mmproj-f16.gguf | mmproj-f16 | 836 MB | [Download](https://huggingface.co/prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF/blob/main/SAGE-MM-Qwen3-VL-4B-SFT_RL.mmproj-f16.gguf) |\n\n## Quants Usage \n\n(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)\n\nHere is a handy graph by ikawrakow comparing some lower-quality quant\ntypes (lower is better):\n\n",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"qwen3_vl",
"text-generation-inference",
"llama.cpp",
"video-text-to-text",
"en",
"base_model:allenai/SAGE-MM-Qwen3-VL-4B-SFT_RL",
"base_model:quantized:allenai/SAGE-MM-Qwen3-VL-4B-SFT_RL",
"license:apache-2.0",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 1,
"downloads": 431,
"gated": false,
"private": false,
"last_modified": "2025-12-20T04:52:15.000Z",
"created_at": "2025-12-18T05:36:06.000Z",
"pipeline_tag": "video-text-to-text",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "694392c6ebcb83a8188f441c",
"id": "prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF",
"modelId": "prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF",
"sha": "b2b67bb8636bf4371c3f2178b259e2c74dd089d8",
"createdAt": "2025-12-18T05:36:06.000Z",
"lastModified": "2025-12-20T04:52:15.000Z",
"author": "prithivMLmods",
"downloads": 431,
"likes": 1,
"gated": false,
"private": false,
"pipeline_tag": "video-text-to-text",
"library_name": "transformers",
"siblings_count": 42
}