Model Intelligence Sheet
ronaldcmz/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-GGUF overview
Nemotron Cascade 8B Thinking Claude 4.5 Opus High Reasoning Distill GGUF This model was finetuned and converted to GGUF format using Unsloth https://github.com…
Runs locally from ~3.51 GB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).
Repository Files & Downloads
8 GGUF files detected
Direct downloads for local inference
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.bf16.gguf | GGUF | GGUF | 15.26 GB | Download |
| Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.iq4_nl.gguf | GGUF | GGUF | 4.49 GB | Download |
| Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.q3_k_m.gguf | GGUF | GGUF | 3.84 GB | Download |
| Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.q3_k_s.gguf | GGUF | GGUF | 3.51 GB | Download |
| Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.q4_k_m.gguf | GGUF | GGUF | 4.68 GB | Download |
| Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.q8_0.gguf | GGUF | GGUF | 8.11 GB | Download |
| Nemotron-Cascade-8B-Thinking.F16.gguf | GGUF | GGUF | 15.26 GB | Download |
| Nemotron-Cascade-8B-Thinking.Q8_0.gguf | GGUF | GGUF | 8.11 GB | Download |
Model Details
Model README
---
tags:
- gguf
- llama.cpp
- unsloth
base_model:
- TeichAI/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill
datasets:
- TeichAI/claude-4.5-opus-high-reasoning-250x
---
Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-GGUF
This model was finetuned and converted to GGUF format using Unsloth.
Run ronaldcmz/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-GGUF with guIDE
Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.
Source: Hugging Face · Compare models