GraySoft
Model Comparison

microsoft/bitnet-b1.58-2b-4t-ggufvsunsloth/deepseek-r1-0528-qwen3-8b-gguf

Side-by-side comparison of microsoft/bitnet-b1.58-2b-4t-gguf and unsloth/deepseek-r1-0528-qwen3-8b-gguf: downloads, license, context length, tasks, and benchmarks.

microsoft/bitnet-b1.58-2b-4t-gguf

microsoft · text-generation

This repository contains the weights for **BitNet b1.58 2B4T**, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale, developed by Microsoft Research. Trained on a corpus of 4 trillion tokens, this model demonstrates that native 1-bit L…

unsloth/deepseek-r1-0528-qwen3-8b-gguf

unsloth · text-generation

Paper Link👁️

Side-by-side Specifications

microsoft/bitnet-b1.58-2b-4t-ggufunsloth/deepseek-r1-0528-qwen3-8b-gguf
Authormicrosoftunsloth
Pipeline Tasktext-generationtext-generation
Librarytransformerstransformers
Downloads36,03037,064
Likes266390
LicenseUnknownUnknown
Context Length
Created2025-04-152025-05-29
Last Modified2025-12-172025-06-16
Tags
transformersggufchatbitnettext-generationlarge-language-modelenarxiv:2504.12285license:mitendpoints_compatible
transformersggufqwen3text-generationunslothdeepseekqwenarxiv:2501.12948base_model:deepseek-ai/DeepSeek-R1-0528-Qwen3-8Bbase_model:quantized:deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

View full details: microsoft/bitnet-b1.58-2b-4t-gguf · unsloth/deepseek-r1-0528-qwen3-8b-gguf