Model Comparison

unsloth/deepseek-r1-0528-qwen3-8b-ggufvsunsloth/mistral-large-3-675b-instruct-2512-gguf

Side-by-side comparison of unsloth/deepseek-r1-0528-qwen3-8b-gguf and unsloth/mistral-large-3-675b-instruct-2512-gguf: downloads, license, context length, tasks, and benchmarks.

unsloth/deepseek-r1-0528-qwen3-8b-gguf

unsloth · text-generation

Paper Link👁️

unsloth/mistral-large-3-675b-instruct-2512-gguf

unsloth · —

From our family of large models, **Mistral Large 3** is a state-of-the-art general-purpose **Multimodal granular Mixture-of-Experts** model with **41B active parameters** and **675B total parameters** trained from the ground up. This model is the instruct post-trained version, f…

Side-by-side Specifications

	unsloth/deepseek-r1-0528-qwen3-8b-gguf	unsloth/mistral-large-3-675b-instruct-2512-gguf
Author	unsloth	unsloth
Pipeline Task	text-generation	—
Library	transformers	—
Downloads	37,064	30,621
Likes	390	17
License	Unknown	Unknown
Context Length	—	—
Created	2025-05-29	2025-12-07
Last Modified	2025-06-16	2025-12-16
Tags	transformersggufqwen3text-generationunslothdeepseekqwenarxiv:2501.12948base_model:deepseek-ai/DeepSeek-R1-0528-Qwen3-8Bbase_model:quantized:deepseek-ai/DeepSeek-R1-0528-Qwen3-8B	ggufmistral-commonmistralunslothenfresdeitpt

View full details: unsloth/deepseek-r1-0528-qwen3-8b-gguf · unsloth/mistral-large-3-675b-instruct-2512-gguf