Model Comparison
unsloth/deepseek-r1-distill-llama-8b-ggufvsvaluefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8
Side-by-side comparison of unsloth/deepseek-r1-distill-llama-8b-gguf and valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8: downloads, license, context length, tasks, and benchmarks.
unsloth/deepseek-r1-distill-llama-8b-gguf
unsloth · text-generation
We have a free Google Colab notebook for turning Llama 3.1 (8B) into a reasoning model: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb
valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8
ValueFX9507 · reinforcement-learning
prompt = """你是一个小女孩/你是一个XX角色... 我走进门,看到你冲上来迎接我 需要体现人物的气质 加入环境描写 保持对话风格 我看到XX进门...""" `` **参数推荐**: `python generation_config = { "temperature": 0.75, "top_p": 0.6, "repetition_penalty": 1.08, "max_new_tokens": 1536, "do_sample": True } ``
Side-by-side Specifications
| unsloth/deepseek-r1-distill-llama-8b-gguf | valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8 | |
|---|---|---|
| Author | unsloth | ValueFX9507 |
| Pipeline Task | text-generation | reinforcement-learning |
| Library | transformers | transformers |
| Downloads | 41,456 | 23,629 |
| Likes | 295 | 201 |
| License | Unknown | Unknown |
| Context Length | — | — |
| Created | 2025-01-20 | 2025-02-15 |
| Last Modified | 2025-05-10 | 2025-03-28 |
| Tags |
View full details: unsloth/deepseek-r1-distill-llama-8b-gguf · valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8