quantfactory/llama-3-8b-tkk-elite-v1.0-gguf overview
This is quantized version of tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0 created using llama.cpp # Model Description Llama-3-TKK-8B-Elite-V1.0 Llama-3-TKK-8B-Elite-V1.0, a generative model built upon the LLaMA 8B architecture, represents my individual undergraduate graduation project. Developed during my studies in Software Engineering at Malatya Turgut Özal University, this project stands as a culmination of my academic endeavors. I extend my sincere appreciation to Assoc. Prof. Dr. Harun Bingöl, who served as both my department chair and thesis advisor. His invaluable guidance, unwavering support, and mentorship have significantly shaped my educational and research experiences. I am deeply grateful for his continuous encouragement, insightful feedback, and unwavering dedication. Thank you, Dr. Bingöl... !image/png Model Details Training took 133 hours and 59 minutes for a total of 37,420 steps and was conducted on 8 Tesla V100 GPUs. Base Model: LLaMA 8B based LLM Model Developers: Tarık Kaan Koç Thesis Advisor: Assoc. Prof. Dr. Harun Bingöl Input: Text only Output: Text only Training Dataset: Cleaned Turkish raw data with 1 million raw instruction Turkish data, private Training Method: Fine-tuning with LORA LORA Fine-Tuning Configuration !image/png loraalpha: 16 loradropout: 0.1 r: 64 bias: none tasktype: CAUSALLM ### Example Usage: ### Output:
Repository Files & Downloads
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Llama-3-8B-TKK-Elite-V1.0.Q2_K.f16.gguf | GGUF | Q2_K | 2.96 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q2_K.f32.gguf | GGUF | Q2_K | 2.96 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q3_K_L.f32.gguf | GGUF | Q3_K_L | 4.03 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q3_K_L.gguf | GGUF | Q3_K_L | 4.03 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q3_K_M.f32.gguf | GGUF | Q3_K_M | 3.74 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q3_K_M.gguf | GGUF | Q3_K_M | 3.74 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q3_K_S.f32.gguf | GGUF | Q3_K_S | 3.41 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q3_K_S.gguf | GGUF | Q3_K_S | 3.41 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q4_0.f32.gguf | GGUF | F32 | 4.34 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q4_0.gguf | GGUF | — | 4.34 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q4_1.f32.gguf | GGUF | F32 | 4.78 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q4_1.gguf | GGUF | — | 4.78 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q4_K_M.f32.gguf | GGUF | Q4_K_M | 4.58 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q4_K_M.gguf | GGUF | Q4_K_M | 4.58 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q4_K_S.f32.gguf | GGUF | Q4_K_S | 4.37 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q4_K_S.gguf | GGUF | Q4_K_S | 4.37 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q5_0.f32.gguf | GGUF | F32 | 5.21 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q5_0.gguf | GGUF | — | 5.21 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q5_1.f32.gguf | GGUF | F32 | 5.65 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q5_1.gguf | GGUF | — | 5.65 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q5_K_M.f32.gguf | GGUF | Q5_K_M | 5.34 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q5_K_M.gguf | GGUF | Q5_K_M | 5.34 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q5_K_S.f32.gguf | GGUF | Q5_K_S | 5.21 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q5_K_S.gguf | GGUF | Q5_K_S | 5.21 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q6_K.f32.gguf | GGUF | Q6_K | 6.14 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q6_K.gguf | GGUF | Q6_K | 6.14 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q8_0.f32.gguf | GGUF | F32 | 7.95 GB | Download |
| Llama-3-8B-TKK-Elite-V1.0.Q8_0.gguf | GGUF | — | 7.95 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "apache-2.0",
"base_model": "tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
"pipeline_tag": "text-generation",
"frontmatter": {
"license": "apache-2.0",
"base_model": "tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
"pipeline_tag": "text-generation"
},
"hero_image_url": "https://cdn-uploads.huggingface.co/production/uploads/62bdd8065f304e8ea762287f/yjhKqN_bkVuJRa7JMtMBW.png",
"summary": "This is quantized version of tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0 created using llama.cpp # Model Description Llama-3-TKK-8B-Elite-V1.0 Llama-3-TKK-8B-Elite-V1.0, a generative model built upon the LLaMA 8B architecture, represents my individual undergraduate graduation project. Developed during my studies in Software Engineering at Malatya Turgut Özal University, this project stands as a culmination of my academic endeavors. I extend my sincere appreciation to Assoc. Prof. Dr. Harun Bingöl, who served as both my department chair and thesis advisor. His invaluable guidance, unwavering support, and mentorship have significantly shaped my educational and research experiences. I am deeply grateful for his continuous encouragement, insightful feedback, and unwavering dedication. Thank you, Dr. Bingöl... !image/png Model Details Training took 133 hours and 59 minutes for a total of 37,420 steps and was conducted on 8 Tesla V100 GPUs. Base Model: LLaMA 8B based LLM Model Developers: Tarık Kaan Koç Thesis Advisor: Assoc. Prof. Dr. Harun Bingöl Input: Text only Output: Text only Training Dataset: Cleaned Turkish raw data with 1 million raw instruction Turkish data, private Training Method: Fine-tuning with LORA LORA Fine-Tuning Configuration !image/png lora_alpha: 16 lora_dropout: 0.1 r: 64 bias: none task_type: CAUSAL_LM ### Example Usage: ``python from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer, pipeline import torch model_id = \"tarikkaankoc7/TKK-LLaMA3-8B-Elite-V1.0\" model = AutoModelForCausalLM.from_pretrained( model_id, torch_dtype=torch.bfloat16, device_map=\"auto\", trust_remote_code=True ) tokenizer = AutoTokenizer.from_pretrained( model_id, trust_remote_code=True ) streamer = TextStreamer(tokenizer) text_generation_pipeline = pipeline( \"text-generation\", model=model, tokenizer=tokenizer, model_kwargs={\"torch_dtype\": torch.bfloat16}, streamer=streamer ) messages = [ {\"role\": \"system\", \"content\": \"Sen yardımsever bir yapay zeka asistanısın ve kullanıcıların verdiği talimatlara doğrultusunda en iyi cevabı üretmeye çalışıyorsun.\"}, {\"role\": \"user\", \"content\": \"Leonardo da Vinci'nin en ünlü tablosu hangisidir?\"} ] prompt = tokenizer.apply_chat_template( messages, tokenize=False, add_generation_prompt=True ) terminators = [ tokenizer.eos_token_id ] outputs = text_generation_pipeline( prompt, max_new_tokens=2048, eos_token_id=terminators, do_sample=True, temperature=0.6, top_p=0.95 ) print(outputs[0][\"generated_text\"]) ` ### Output: ` Leonardo da Vinci'nin en ünlü tablosu Mona Lisa'dır. ``",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: apache-2.0\nbase_model: tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0\npipeline_tag: text-generation\n---\n\n# QuantFactory/Llama-3-8B-TKK-Elite-V1.0-GGUF\nThis is quantized version of [tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0](https://huggingface.co/tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0) created using llama.cpp\n\n# Model Description\n\n<h1 style=\"text-align: center;\">Llama-3-TKK-8B-Elite-V1.0 </h1>\n\n<p style=\"text-align: center;\">\n Llama-3-TKK-8B-Elite-V1.0, a generative model built upon the LLaMA 8B architecture, represents my individual undergraduate graduation project. Developed during my studies in Software Engineering at Malatya Turgut Özal University, this project stands as a culmination of my academic endeavors. I extend my sincere appreciation to Assoc. Prof. Dr. Harun Bingöl, who served as both my department chair and thesis advisor. His invaluable guidance, unwavering support, and mentorship have significantly shaped my educational and research experiences. I am deeply grateful for his continuous encouragement, insightful feedback, and unwavering dedication. Thank you, Dr. Bingöl...\n</p>\n\n\n \n\n\n\n<h2>Model Details</h2>\n\n<p>\n Training took 133 hours and 59 minutes for a total of 37,420 steps and was conducted on 8 Tesla V100 GPUs.\n</p>\n\n\n<ul>\n <li><strong>Base Model:</strong> LLaMA 8B based LLM</li>\n <li><strong>Model Developers:</strong> Tarık Kaan Koç</li>\n <li><strong>Thesis Advisor:</strong> Assoc. Prof. Dr. Harun Bingöl</li>\n <li><strong>Input:</strong> Text only</li>\n <li><strong>Output:</strong> Text only</li>\n <li><strong>Training Dataset:</strong> Cleaned Turkish raw data with 1 million raw instruction Turkish data, private</li>\n <li><strong>Training Method:</strong> Fine-tuning with LORA</li>\n</ul>\n\n<h2>LORA Fine-Tuning Configuration</h2>\n\n\n\n\n<ul>\n <li><strong>lora_alpha:</strong> 16</li>\n <li><strong>lora_dropout:</strong> 0.1</li>\n <li><strong>r:</strong> 64</li>\n <li><strong>bias:</strong> none</li>\n <li><strong>task_type:</strong> CAUSAL_LM</li>\n</ul>\n\n\n\n### Example Usage:\n\n```python\nfrom transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer, pipeline\nimport torch\n\nmodel_id = \"tarikkaankoc7/TKK-LLaMA3-8B-Elite-V1.0\"\n\nmodel = AutoModelForCausalLM.from_pretrained(\n model_id,\n torch_dtype=torch.bfloat16,\n device_map=\"auto\",\n trust_remote_code=True\n)\n\ntokenizer = AutoTokenizer.from_pretrained(\n model_id,\n trust_remote_code=True\n)\n\nstreamer = TextStreamer(tokenizer)\n\ntext_generation_pipeline = pipeline(\n \"text-generation\",\n model=model,\n tokenizer=tokenizer,\n model_kwargs={\"torch_dtype\": torch.bfloat16},\n streamer=streamer\n)\n\nmessages = [\n {\"role\": \"system\", \"content\": \"Sen yardımsever bir yapay zeka asistanısın ve kullanıcıların verdiği talimatlara doğrultusunda en iyi cevabı üretmeye çalışıyorsun.\"},\n {\"role\": \"user\", \"content\": \"Leonardo da Vinci'nin en ünlü tablosu hangisidir?\"}\n]\n\nprompt = tokenizer.apply_chat_template(\n messages,\n tokenize=False,\n add_generation_prompt=True\n)\n\nterminators = [\n tokenizer.eos_token_id\n]\n\noutputs = text_generation_pipeline(\n prompt,\n max_new_tokens=2048,\n eos_token_id=terminators,\n do_sample=True,\n temperature=0.6,\n top_p=0.95\n)\n\nprint(outputs[0][\"generated_text\"])\n```\n\n### Output: \n\n```\nLeonardo da Vinci'nin en ünlü tablosu Mona Lisa'dır.\n```",
"related_quantizations": []
},
"tags": [
"gguf",
"text-generation",
"base_model:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
"base_model:quantized:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
"license:apache-2.0",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 0,
"downloads": 159,
"gated": false,
"private": false,
"last_modified": "2024-06-04T09:21:05.000Z",
"created_at": "2024-06-02T04:10:53.000Z",
"pipeline_tag": "text-generation",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "665bf0cd20dee2925d7c8a50",
"id": "QuantFactory/Llama-3-8B-TKK-Elite-V1.0-GGUF",
"modelId": "QuantFactory/Llama-3-8B-TKK-Elite-V1.0-GGUF",
"sha": "f95a2ae53b2f152b4c715068be917593930d27b4",
"createdAt": "2024-06-02T04:10:53.000Z",
"lastModified": "2024-06-04T09:21:05.000Z",
"author": "QuantFactory",
"downloads": 159,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "",
"siblings_count": 30
}