Model Comparison
maziyarpanahi/deepseek-r1-0528-qwen3-8b-ggufvsmicrosoft/bitnet-b1.58-2b-4t-gguf
Side-by-side comparison of maziyarpanahi/deepseek-r1-0528-qwen3-8b-gguf and microsoft/bitnet-b1.58-2b-4t-gguf: downloads, license, context length, tasks, and benchmarks.
maziyarpanahi/deepseek-r1-0528-qwen3-8b-gguf
MaziyarPanahi · text-generation
# MaziyarPanahi/DeepSeek-R1-0528-Qwen3-8B-GGUF
microsoft/bitnet-b1.58-2b-4t-gguf
microsoft · text-generation
This repository contains the weights for **BitNet b1.58 2B4T**, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale, developed by Microsoft Research. Trained on a corpus of 4 trillion tokens, this model demonstrates that native 1-bit L…
Side-by-side Specifications
| maziyarpanahi/deepseek-r1-0528-qwen3-8b-gguf | microsoft/bitnet-b1.58-2b-4t-gguf | |
|---|---|---|
| Author | MaziyarPanahi | microsoft |
| Pipeline Task | text-generation | text-generation |
| Library | — | transformers |
| Downloads | 90,103 | 36,030 |
| Likes | 9 | 266 |
| License | Unknown | Unknown |
| Context Length | — | — |
| Created | 2025-05-29 | 2025-04-15 |
| Last Modified | 2025-05-29 | 2025-12-17 |
| Tags |
View full details: maziyarpanahi/deepseek-r1-0528-qwen3-8b-gguf · microsoft/bitnet-b1.58-2b-4t-gguf