GraySoft
Model Comparison

bartowski/deepseek-r1-distill-qwen-32b-ggufvsfailspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf

Side-by-side comparison of bartowski/deepseek-r1-distill-qwen-32b-gguf and failspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf: downloads, license, context length, tasks, and benchmarks.

bartowski/deepseek-r1-distill-qwen-32b-gguf

bartowski · text-generation

failspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf

failspy · —

# Llama-3-70B-Instruct-abliterated-v3.5 Model Card My original Jupyter "cookbook" to replicate the methodology can be found here My personal library o' code used (WIP, looking to improve and generalize) This is meta-llama/Meta-Llama-3-70B-Instruct with orthogonalized bfloat16 sa…

Side-by-side Specifications

bartowski/deepseek-r1-distill-qwen-32b-gguffailspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf
Authorbartowskifailspy
Pipeline Tasktext-generation
Librarytransformers
Downloads23,95836,982
Likes30125
LicenseUnknownUnknown
Context Length
Created2025-01-202024-05-28
Last Modified2025-01-222024-05-30
Tags
gguftext-generationbase_model:deepseek-ai/DeepSeek-R1-Distill-Qwen-32Bbase_model:quantized:deepseek-ai/DeepSeek-R1-Distill-Qwen-32Bendpoints_compatibleregion:usimatrixconversational
transformersgguflicense:llama3endpoints_compatibleregion:usconversational

View full details: bartowski/deepseek-r1-distill-qwen-32b-gguf · failspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf