Model Comparison

bartowski/deepseek-r1-distill-qwen-32b-ggufvsfailspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf

Side-by-side comparison of bartowski/deepseek-r1-distill-qwen-32b-gguf and failspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf: downloads, license, context length, tasks, and benchmarks.

bartowski/deepseek-r1-distill-qwen-32b-gguf

bartowski · text-generation

failspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf

failspy · —

# Llama-3-70B-Instruct-abliterated-v3.5 Model Card My original Jupyter "cookbook" to replicate the methodology can be found here My personal library o' code used (WIP, looking to improve and generalize) This is meta-llama/Meta-Llama-3-70B-Instruct with orthogonalized bfloat16 sa…

Side-by-side Specifications

	bartowski/deepseek-r1-distill-qwen-32b-gguf	failspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf
Author	bartowski	failspy
Pipeline Task	text-generation	—
Library	—	transformers
Downloads	23,958	36,982
Likes	301	25
License	Unknown	Unknown
Context Length	—	—
Created	2025-01-20	2024-05-28
Last Modified	2025-01-22	2024-05-30
Tags	gguftext-generationbase_model:deepseek-ai/DeepSeek-R1-Distill-Qwen-32Bbase_model:quantized:deepseek-ai/DeepSeek-R1-Distill-Qwen-32Bendpoints_compatibleregion:usimatrixconversational	transformersgguflicense:llama3endpoints_compatibleregion:usconversational

View full details: bartowski/deepseek-r1-distill-qwen-32b-gguf · failspy/meta-llama-3-70b-instruct-abliterated-v3.5-gguf