GraySoft
Model Comparison

microsoft/bitnet-b1.58-2b-4t-ggufvsmradermacher/mn-12b-mag-mell-r1-gguf

Side-by-side comparison of microsoft/bitnet-b1.58-2b-4t-gguf and mradermacher/mn-12b-mag-mell-r1-gguf: downloads, license, context length, tasks, and benchmarks.

microsoft/bitnet-b1.58-2b-4t-gguf

microsoft · text-generation

This repository contains the weights for **BitNet b1.58 2B4T**, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale, developed by Microsoft Research. Trained on a corpus of 4 trillion tokens, this model demonstrates that native 1-bit L…

mradermacher/mn-12b-mag-mell-r1-gguf

mradermacher · —

## About static quants of https://huggingface.co/inflatebot/MN-12B-Mag-Mell-R1 weighted/imatrix quants are available at https://huggingface.co/mradermacher/MN-12B-Mag-Mell-R1-i1-GGUF

Side-by-side Specifications

microsoft/bitnet-b1.58-2b-4t-ggufmradermacher/mn-12b-mag-mell-r1-gguf
Authormicrosoftmradermacher
Pipeline Tasktext-generation
Librarytransformerstransformers
Downloads36,03050,033
Likes26647
LicenseUnknownUnknown
Context Length
Created2025-04-152024-09-16
Last Modified2025-12-172024-09-17
Tags
transformersggufchatbitnettext-generationlarge-language-modelenarxiv:2504.12285license:mitendpoints_compatible
transformersggufmergekitmergeenbase_model:inflatebot/MN-12B-Mag-Mell-R1base_model:quantized:inflatebot/MN-12B-Mag-Mell-R1endpoints_compatibleregion:usconversational

View full details: microsoft/bitnet-b1.58-2b-4t-gguf · mradermacher/mn-12b-mag-mell-r1-gguf