mradermacher/nemotron-cascade-2-30b-a3b-i1-ggufvsunsloth/qwen3-1.7b-gguf
Side-by-side comparison of mradermacher/nemotron-cascade-2-30b-a3b-i1-gguf and unsloth/qwen3-1.7b-gguf: downloads, license, context length, tasks, and benchmarks.
mradermacher/nemotron-cascade-2-30b-a3b-i1-gguf
## About weighted/imatrix quants of https://huggingface.co/nvidia/Nemotron-Cascade-2-30B-A3B ***For a convenient overview and download list, visit our model page for this model.*** static quants are available at https://huggingface.co/mradermacher/Nemotron-Cascade-2-30B-A3B-GGUF
unsloth/qwen3-1.7b-gguf
If you are using llama.cpp, Ollama, Open WebUI etc., you can add /think and /no_think to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations. Here is an example of mu…
Side-by-side Specifications
| mradermacher/nemotron-cascade-2-30b-a3b-i1-gguf | unsloth/qwen3-1.7b-gguf | |
|---|---|---|
| Author | mradermacher | unsloth |
| Pipeline Task | — | text-generation |
| Library | transformers | transformers |
| Downloads | 23,972 | 22,966 |
| Likes | 26 | 66 |
| License | Unknown | Unknown |
| Context Length | — | — |
| Created | 2026-03-20 | 2025-04-28 |
| Last Modified | 2026-03-20 | 2025-06-08 |
| Tags |
View full details: mradermacher/nemotron-cascade-2-30b-a3b-i1-gguf · unsloth/qwen3-1.7b-gguf