GraySoft
Model Comparison

bartowski/deepseek-r1-distill-qwen-32b-ggufvslmg-anon/vntl-llama3-8b-v2-gguf

Side-by-side comparison of bartowski/deepseek-r1-distill-qwen-32b-gguf and lmg-anon/vntl-llama3-8b-v2-gguf: downloads, license, context length, tasks, and benchmarks.

bartowski/deepseek-r1-distill-qwen-32b-gguf

bartowski · text-generation

lmg-anon/vntl-llama3-8b-v2-gguf

lmg-anon · translation

This is a LLaMA 3 Youko qlora fine-tune, created using a new version of the VNTL dataset. The purpose of this fine-tune is to improve performance of LLMs at translating Japanese visual novels to English. Unlike the previous version, this one doesn't includes the "chat mode".

Side-by-side Specifications

bartowski/deepseek-r1-distill-qwen-32b-gguflmg-anon/vntl-llama3-8b-v2-gguf
Authorbartowskilmg-anon
Pipeline Tasktext-generationtranslation
Library
Downloads23,9581,522,503
Likes30113
LicenseUnknownUnknown
Context Length
Created2025-01-202025-01-02
Last Modified2025-01-222025-01-02
Tags
gguftext-generationbase_model:deepseek-ai/DeepSeek-R1-Distill-Qwen-32Bbase_model:quantized:deepseek-ai/DeepSeek-R1-Distill-Qwen-32Bendpoints_compatibleregion:usimatrixconversational
gguftranslationjaendataset:lmg-anon/VNTL-v5-1kbase_model:rinna/llama-3-youko-8bbase_model:quantized:rinna/llama-3-youko-8blicense:llama3endpoints_compatibleregion:us

View full details: bartowski/deepseek-r1-distill-qwen-32b-gguf · lmg-anon/vntl-llama3-8b-v2-gguf