GraySoft
Model Comparison

unsloth/deepseek-r1-distill-llama-8b-ggufvsvaluefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8

Side-by-side comparison of unsloth/deepseek-r1-distill-llama-8b-gguf and valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8: downloads, license, context length, tasks, and benchmarks.

unsloth/deepseek-r1-distill-llama-8b-gguf

unsloth · text-generation

We have a free Google Colab notebook for turning Llama 3.1 (8B) into a reasoning model: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb

valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8

ValueFX9507 · reinforcement-learning

prompt = """你是一个小女孩/你是一个XX角色... 我走进门,看到你冲上来迎接我 需要体现人物的气质 加入环境描写 保持对话风格 我看到XX进门...""" `` **参数推荐**: `python generation_config = { "temperature": 0.75, "top_p": 0.6, "repetition_penalty": 1.08, "max_new_tokens": 1536, "do_sample": True } ``

Side-by-side Specifications

unsloth/deepseek-r1-distill-llama-8b-ggufvaluefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8
AuthorunslothValueFX9507
Pipeline Tasktext-generationreinforcement-learning
Librarytransformerstransformers
Downloads41,45623,629
Likes295201
LicenseUnknownUnknown
Context Length
Created2025-01-202025-02-15
Last Modified2025-05-102025-03-28
Tags
transformersggufllamatext-generationdeepseekunslothllama-3metaenarxiv:2501.12948
transformersggufincremental-pretrainingsftreinforcement-learningroleplaycotzhenlicense:apache-2.0

View full details: unsloth/deepseek-r1-distill-llama-8b-gguf · valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8