GraySoft
Model Comparison

bartowski/deepseek-r1-distill-qwen-32b-abliterated-ggufvsmicrosoft/bitnet-b1.58-2b-4t-gguf

Side-by-side comparison of bartowski/deepseek-r1-distill-qwen-32b-abliterated-gguf and microsoft/bitnet-b1.58-2b-4t-gguf: downloads, license, context length, tasks, and benchmarks.

microsoft/bitnet-b1.58-2b-4t-gguf

microsoft · text-generation

This repository contains the weights for **BitNet b1.58 2B4T**, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale, developed by Microsoft Research. Trained on a corpus of 4 trillion tokens, this model demonstrates that native 1-bit L…

Side-by-side Specifications

bartowski/deepseek-r1-distill-qwen-32b-abliterated-ggufmicrosoft/bitnet-b1.58-2b-4t-gguf
Authorbartowskimicrosoft
Pipeline Tasktext-generationtext-generation
Librarytransformers
Downloads27,54136,030
Likes136266
LicenseUnknownUnknown
Context Length
Created2025-01-252025-04-15
Last Modified2025-01-252025-12-17
Tags
ggufabliterateduncensoredtext-generationbase_model:huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliteratedbase_model:quantized:huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliteratedendpoints_compatibleregion:usimatrixconversational
transformersggufchatbitnettext-generationlarge-language-modelenarxiv:2504.12285license:mitendpoints_compatible

View full details: bartowski/deepseek-r1-distill-qwen-32b-abliterated-gguf · microsoft/bitnet-b1.58-2b-4t-gguf