GraySoft
Model Comparison

lmstudio-community/nvidia-nemotron-3-super-120b-a12b-ggufvsunsloth/qwen3-8b-gguf

Side-by-side comparison of lmstudio-community/nvidia-nemotron-3-super-120b-a12b-gguf and unsloth/qwen3-8b-gguf: downloads, license, context length, tasks, and benchmarks.

unsloth/qwen3-8b-gguf

unsloth · text-generation

If you are using llama.cpp, Ollama, Open WebUI etc., you can add /think and /no_think to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations. Here is an example of mu…

Side-by-side Specifications

lmstudio-community/nvidia-nemotron-3-super-120b-a12b-ggufunsloth/qwen3-8b-gguf
Authorlmstudio-communityunsloth
Pipeline Tasktext-generation
Librarytransformers
Downloads29,50650,330
Likes8111
LicenseUnknownUnknown
Context Length
Created2026-03-102025-04-28
Last Modified2026-03-112025-06-08
Tags
ggufbase_model:nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16base_model:quantized:nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16license:otherendpoints_compatibleregion:usconversational
transformersggufqwen3text-generationqwenunslothenarxiv:2309.00071base_model:Qwen/Qwen3-8Bbase_model:quantized:Qwen/Qwen3-8B

View full details: lmstudio-community/nvidia-nemotron-3-super-120b-a12b-gguf · unsloth/qwen3-8b-gguf