GraySoft
Model Comparison

maziyarpanahi/deepseek-r1-0528-qwen3-8b-ggufvsmicrosoft/bitnet-b1.58-2b-4t-gguf

Side-by-side comparison of maziyarpanahi/deepseek-r1-0528-qwen3-8b-gguf and microsoft/bitnet-b1.58-2b-4t-gguf: downloads, license, context length, tasks, and benchmarks.

maziyarpanahi/deepseek-r1-0528-qwen3-8b-gguf

MaziyarPanahi · text-generation

# MaziyarPanahi/DeepSeek-R1-0528-Qwen3-8B-GGUF

microsoft/bitnet-b1.58-2b-4t-gguf

microsoft · text-generation

This repository contains the weights for **BitNet b1.58 2B4T**, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale, developed by Microsoft Research. Trained on a corpus of 4 trillion tokens, this model demonstrates that native 1-bit L…

Side-by-side Specifications

maziyarpanahi/deepseek-r1-0528-qwen3-8b-ggufmicrosoft/bitnet-b1.58-2b-4t-gguf
AuthorMaziyarPanahimicrosoft
Pipeline Tasktext-generationtext-generation
Librarytransformers
Downloads90,10336,030
Likes9266
LicenseUnknownUnknown
Context Length
Created2025-05-292025-04-15
Last Modified2025-05-292025-12-17
Tags
ggufmistralquantized2-bit3-bit4-bit5-bit6-bit8-bitGGUF
transformersggufchatbitnettext-generationlarge-language-modelenarxiv:2504.12285license:mitendpoints_compatible

View full details: maziyarpanahi/deepseek-r1-0528-qwen3-8b-gguf · microsoft/bitnet-b1.58-2b-4t-gguf