Model Comparison

lmstudio-community/nvidia-nemotron-3-super-120b-a12b-ggufvsunsloth/qwen3-8b-gguf

Side-by-side comparison of lmstudio-community/nvidia-nemotron-3-super-120b-a12b-gguf and unsloth/qwen3-8b-gguf: downloads, license, context length, tasks, and benchmarks.

lmstudio-community/nvidia-nemotron-3-super-120b-a12b-gguf

lmstudio-community · —

unsloth/qwen3-8b-gguf

unsloth · text-generation

If you are using llama.cpp, Ollama, Open WebUI etc., you can add /think and /no_think to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations. Here is an example of mu…

Side-by-side Specifications

	lmstudio-community/nvidia-nemotron-3-super-120b-a12b-gguf	unsloth/qwen3-8b-gguf
Author	lmstudio-community	unsloth
Pipeline Task	—	text-generation
Library	—	transformers
Downloads	29,506	50,330
Likes	8	111
License	Unknown	Unknown
Context Length	—	—
Created	2026-03-10	2025-04-28
Last Modified	2026-03-11	2025-06-08
Tags	ggufbase_model:nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16base_model:quantized:nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16license:otherendpoints_compatibleregion:usconversational	transformersggufqwen3text-generationqwenunslothenarxiv:2309.00071base_model:Qwen/Qwen3-8Bbase_model:quantized:Qwen/Qwen3-8B

View full details: lmstudio-community/nvidia-nemotron-3-super-120b-a12b-gguf · unsloth/qwen3-8b-gguf