GraySoft
Model Comparison

microsoft/bitnet-b1.58-2b-4t-ggufvsunsloth/mistral-large-3-675b-instruct-2512-gguf

Side-by-side comparison of microsoft/bitnet-b1.58-2b-4t-gguf and unsloth/mistral-large-3-675b-instruct-2512-gguf: downloads, license, context length, tasks, and benchmarks.

microsoft/bitnet-b1.58-2b-4t-gguf

microsoft · text-generation

This repository contains the weights for **BitNet b1.58 2B4T**, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale, developed by Microsoft Research. Trained on a corpus of 4 trillion tokens, this model demonstrates that native 1-bit L…

unsloth/mistral-large-3-675b-instruct-2512-gguf

unsloth · —

From our family of large models, **Mistral Large 3** is a state-of-the-art general-purpose **Multimodal granular Mixture-of-Experts** model with **41B active parameters** and **675B total parameters** trained from the ground up. This model is the instruct post-trained version, f…

Side-by-side Specifications

microsoft/bitnet-b1.58-2b-4t-ggufunsloth/mistral-large-3-675b-instruct-2512-gguf
Authormicrosoftunsloth
Pipeline Tasktext-generation
Librarytransformers
Downloads36,03030,621
Likes26617
LicenseUnknownUnknown
Context Length
Created2025-04-152025-12-07
Last Modified2025-12-172025-12-16
Tags
transformersggufchatbitnettext-generationlarge-language-modelenarxiv:2504.12285license:mitendpoints_compatible
ggufmistral-commonmistralunslothenfresdeitpt

View full details: microsoft/bitnet-b1.58-2b-4t-gguf · unsloth/mistral-large-3-675b-instruct-2512-gguf