GraySoft
Model Comparison

nvidia/nvidia-nemotron-3-nano-4b-ggufvsteichai/devstral-small-2505-deepseek-v3.2-speciale-distill-gguf

Side-by-side comparison of nvidia/nvidia-nemotron-3-nano-4b-gguf and teichai/devstral-small-2505-deepseek-v3.2-speciale-distill-gguf: downloads, license, context length, tasks, and benchmarks.

nvidia/nvidia-nemotron-3-nano-4b-gguf

nvidia · text-generation

**Model Developer:** NVIDIA Corporation **Model Dates:** Dec 2025 \- Jan 2026 **Data Freshness:** September 2024 The pretraining data has a cutoff date of September 2024\.

teichai/devstral-small-2505-deepseek-v3.2-speciale-distill-gguf

TeichAI · —

This model was trained on a non-reasoning (reasoning traces were removed) dataset of **DeepSeek v3.2 Speciale**. ---

Side-by-side Specifications

nvidia/nvidia-nemotron-3-nano-4b-ggufteichai/devstral-small-2505-deepseek-v3.2-speciale-distill-gguf
AuthornvidiaTeichAI
Pipeline Tasktext-generation
Librarytransformers
Downloads26,80522,100
Likes11811
LicenseUnknownUnknown
Context Length
Created2026-03-072026-02-04
Last Modified2026-03-162026-02-04
Tags
transformersggufnvidiapytorchtext-generationendataset:nvidia/Nemotron-CC-v2dataset:nvidia/Nemotron-Post-Training-Dataset-v2dataset:nvidia/Nemotron-Science-v1dataset:nvidia/Nemotron-Instruction-Following-Chat-v1
ggufllama.cppunslothendataset:TeichAI/deepseek-v3.2-speciale-OpenCodeReasoning-3kdataset:TeichAI/deepseek-v3.2-speciale-1000xdataset:TeichAI/deepseek-v3.2-speciale-openr1-math-3kbase_model:TeichAI/Devstral-Small-2505-Deepseek-V3.2-Speciale-Distillbase_model:quantized:TeichAI/Devstral-Small-2505-Deepseek-V3.2-Speciale-Distillendpoints_compatible

View full details: nvidia/nvidia-nemotron-3-nano-4b-gguf · teichai/devstral-small-2505-deepseek-v3.2-speciale-distill-gguf