Model Comparison

unsloth/deepseek-r1-distill-llama-8b-ggufvsvaluefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8

Side-by-side comparison of unsloth/deepseek-r1-distill-llama-8b-gguf and valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8: downloads, license, context length, tasks, and benchmarks.

unsloth/deepseek-r1-distill-llama-8b-gguf

unsloth · text-generation

We have a free Google Colab notebook for turning Llama 3.1 (8B) into a reasoning model: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb

valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8

ValueFX9507 · reinforcement-learning

prompt = """你是一个小女孩/你是一个XX角色... 我走进门，看到你冲上来迎接我需要体现人物的气质加入环境描写保持对话风格我看到XX进门...""" `` **参数推荐**： `python generation_config = { "temperature": 0.75, "top_p": 0.6, "repetition_penalty": 1.08, "max_new_tokens": 1536, "do_sample": True } ``

Side-by-side Specifications

	unsloth/deepseek-r1-distill-llama-8b-gguf	valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8
Author	unsloth	ValueFX9507
Pipeline Task	text-generation	reinforcement-learning
Library	transformers	transformers
Downloads	41,456	23,629
Likes	295	201
License	Unknown	Unknown
Context Length	—	—
Created	2025-01-20	2025-02-15
Last Modified	2025-05-10	2025-03-28
Tags	transformersggufllamatext-generationdeepseekunslothllama-3metaenarxiv:2501.12948	transformersggufincremental-pretrainingsftreinforcement-learningroleplaycotzhenlicense:apache-2.0

View full details: unsloth/deepseek-r1-distill-llama-8b-gguf · valuefx9507/tifa-deepsexv2-7b-mgrpo-gguf-q8