GraySoft
Projects Models Compare Cloud benchmarks FAQ Download guIDE →
Model Intelligence Sheet

ronaldcmz/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-GGUF overview

Nemotron Cascade 8B Thinking Claude 4.5 Opus High Reasoning Distill GGUF This model was finetuned and converted to GGUF format using Unsloth https://github.com…

ggufqwen3llama.cppunslothdataset:TeichAI/claude-4.5-opus-high-reasoning-250xbase_model:TeichAI/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distillbase_model:quantized:TeichAI/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distillendpoints_compatibleregion:usconversational

Runs locally from ~3.51 GB disk (4 GB VRAM class GPUs with llama.cpp / guIDE).

Downloads
0
Likes
0
Pipeline
Author

Repository Files & Downloads

8 GGUF files detected
Direct downloads for local inference
FileTypeQuantizationSizeLink
Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.bf16.ggufGGUFGGUF15.26 GBDownload
Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.iq4_nl.ggufGGUFGGUF4.49 GBDownload
Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.q3_k_m.ggufGGUFGGUF3.84 GBDownload
Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.q3_k_s.ggufGGUFGGUF3.51 GBDownload
Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.q4_k_m.ggufGGUFGGUF4.68 GBDownload
Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-Distill.q8_0.ggufGGUFGGUF8.11 GBDownload
Nemotron-Cascade-8B-Thinking.F16.ggufGGUFGGUF15.26 GBDownload
Nemotron-Cascade-8B-Thinking.Q8_0.ggufGGUFGGUF8.11 GBDownload

Model Details

Model IDronaldcmz/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-GGUF
Authorronaldcmz
Pipeline
License
Base modelTeichAI/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill
Last modified2026-06-20T06:20:24.000Z

Model README

---

tags:

  • gguf
  • llama.cpp
  • unsloth

base_model:

  • TeichAI/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill

datasets:

  • TeichAI/claude-4.5-opus-high-reasoning-250x

---

Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Run ronaldcmz/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-GGUF with guIDE

Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.

Download guIDE → · Browse 524k+ models · Compare models

Source: Hugging Face · Compare models