GraySoft
Model Comparison

microsoft/bitnet-b1.58-2b-4t-ggufvsteichai/qwen3-30b-a3b-thinking-2507-claude-4.5-sonnet-high-reasoning-distill-gguf

Side-by-side comparison of microsoft/bitnet-b1.58-2b-4t-gguf and teichai/qwen3-30b-a3b-thinking-2507-claude-4.5-sonnet-high-reasoning-distill-gguf: downloads, license, context length, tasks, and benchmarks.

microsoft/bitnet-b1.58-2b-4t-gguf

microsoft · text-generation

This repository contains the weights for **BitNet b1.58 2B4T**, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale, developed by Microsoft Research. Trained on a corpus of 4 trillion tokens, this model demonstrates that native 1-bit L…

teichai/qwen3-30b-a3b-thinking-2507-claude-4.5-sonnet-high-reasoning-distill-gguf

TeichAI · text-generation

This model was trained on a **Claude Sonnet 4.5 (reasoning)** dataset with a high reasoning effort. | Model | Effective parameters | Active parameters | | ------------- | ------------- | ------------- | | TeichAI/gpt-oss-20b-claude-4.5-sonnet-high-reasoning-distill-GGUF | 20 B |…

Side-by-side Specifications

microsoft/bitnet-b1.58-2b-4t-ggufteichai/qwen3-30b-a3b-thinking-2507-claude-4.5-sonnet-high-reasoning-distill-gguf
AuthormicrosoftTeichAI
Pipeline Tasktext-generationtext-generation
Librarytransformerstransformers
Downloads36,03024,326
Likes26624
LicenseUnknownUnknown
Context Length
Created2025-04-152025-11-16
Last Modified2025-12-172025-11-19
Tags
transformersggufchatbitnettext-generationlarge-language-modelenarxiv:2504.12285license:mitendpoints_compatible
transformersgguftext-generation-inferenceunslothqwen3_moetext-generationendataset:TeichAI/claude-sonnet-4.5-high-reasoning-250xbase_model:TeichAI/Qwen3-30B-A3B-Thinking-2507-Claude-4.5-Sonnet-High-Reasoning-Distillbase_model:quantized:TeichAI/Qwen3-30B-A3B-Thinking-2507-Claude-4.5-Sonnet-High-Reasoning-Distill

View full details: microsoft/bitnet-b1.58-2b-4t-gguf · teichai/qwen3-30b-a3b-thinking-2507-claude-4.5-sonnet-high-reasoning-distill-gguf