GraySoft
Model Comparison

unsloth/deepseek-r1-0528-qwen3-8b-ggufvsunsloth/mistral-large-3-675b-instruct-2512-gguf

Side-by-side comparison of unsloth/deepseek-r1-0528-qwen3-8b-gguf and unsloth/mistral-large-3-675b-instruct-2512-gguf: downloads, license, context length, tasks, and benchmarks.

unsloth/deepseek-r1-0528-qwen3-8b-gguf

unsloth · text-generation

Paper Link👁️

unsloth/mistral-large-3-675b-instruct-2512-gguf

unsloth · —

From our family of large models, **Mistral Large 3** is a state-of-the-art general-purpose **Multimodal granular Mixture-of-Experts** model with **41B active parameters** and **675B total parameters** trained from the ground up. This model is the instruct post-trained version, f…

Side-by-side Specifications

unsloth/deepseek-r1-0528-qwen3-8b-ggufunsloth/mistral-large-3-675b-instruct-2512-gguf
Authorunslothunsloth
Pipeline Tasktext-generation
Librarytransformers
Downloads37,06430,621
Likes39017
LicenseUnknownUnknown
Context Length
Created2025-05-292025-12-07
Last Modified2025-06-162025-12-16
Tags
transformersggufqwen3text-generationunslothdeepseekqwenarxiv:2501.12948base_model:deepseek-ai/DeepSeek-R1-0528-Qwen3-8Bbase_model:quantized:deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
ggufmistral-commonmistralunslothenfresdeitpt

View full details: unsloth/deepseek-r1-0528-qwen3-8b-gguf · unsloth/mistral-large-3-675b-instruct-2512-gguf