Model Intelligence Sheet

quantfactory/llama-3-8b-tkk-elite-v1.0-gguf overview

This is quantized version of tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0 created using llama.cpp # Model Description Llama-3-TKK-8B-Elite-V1.0 Llama-3-TKK-8B-Elite-V1.0, a generative model built upon the LLaMA 8B architecture, represents my individual undergraduate graduation project. Developed during my studies in Software Engineering at Malatya Turgut Özal University, this project stands as a culmination of my academic endeavors. I extend my sincere appreciation to Assoc. Prof. Dr. Harun Bingöl, who served as both my department chair and thesis advisor. His invaluable guidance, unwavering support, and mentorship have significantly shaped my educational and research experiences. I am deeply grateful for his continuous encouragement, insightful feedback, and unwavering dedication. Thank you, Dr. Bingöl... !image/png Model Details Training took 133 hours and 59 minutes for a total of 37,420 steps and was conducted on 8 Tesla V100 GPUs. Base Model: LLaMA 8B based LLM Model Developers: Tarık Kaan Koç Thesis Advisor: Assoc. Prof. Dr. Harun Bingöl Input: Text only Output: Text only Training Dataset: Cleaned Turkish raw data with 1 million raw instruction Turkish data, private Training Method: Fine-tuning with LORA LORA Fine-Tuning Configuration !image/png loraalpha: 16 loradropout: 0.1 r: 64 bias: none tasktype: CAUSALLM ### Example Usage: ### Output:

gguftext-generationbase_model:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0base_model:quantized:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0license:apache-2.0endpoints_compatibleregion:usconversational

quantfactory/llama-3-8b-tkk-elite-v1.0-gguf visual

Downloads

159

Likes

Pipeline

text-generation

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

28 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Llama-3-8B-TKK-Elite-V1.0.Q2_K.f16.gguf	GGUF	Q2_K	2.96 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q2_K.f32.gguf	GGUF	Q2_K	2.96 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_L.f32.gguf	GGUF	Q3_K_L	4.03 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_L.gguf	GGUF	Q3_K_L	4.03 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_M.f32.gguf	GGUF	Q3_K_M	3.74 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_M.gguf	GGUF	Q3_K_M	3.74 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_S.f32.gguf	GGUF	Q3_K_S	3.41 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_S.gguf	GGUF	Q3_K_S	3.41 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q4_0.f32.gguf	GGUF	F32	4.34 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q4_0.gguf	GGUF	—	4.34 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q4_1.f32.gguf	GGUF	F32	4.78 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q4_1.gguf	GGUF	—	4.78 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q4_K_M.f32.gguf	GGUF	Q4_K_M	4.58 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q4_K_M.gguf	GGUF	Q4_K_M	4.58 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q4_K_S.f32.gguf	GGUF	Q4_K_S	4.37 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q4_K_S.gguf	GGUF	Q4_K_S	4.37 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q5_0.f32.gguf	GGUF	F32	5.21 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q5_0.gguf	GGUF	—	5.21 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q5_1.f32.gguf	GGUF	F32	5.65 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q5_1.gguf	GGUF	—	5.65 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q5_K_M.f32.gguf	GGUF	Q5_K_M	5.34 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q5_K_M.gguf	GGUF	Q5_K_M	5.34 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q5_K_S.f32.gguf	GGUF	Q5_K_S	5.21 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q5_K_S.gguf	GGUF	Q5_K_S	5.21 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q6_K.f32.gguf	GGUF	Q6_K	6.14 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q6_K.gguf	GGUF	Q6_K	6.14 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q8_0.f32.gguf	GGUF	F32	7.95 GB	Download
Llama-3-8B-TKK-Elite-V1.0.Q8_0.gguf	GGUF	—	7.95 GB	Download

Model Details Live

Model Slug

quantfactory/llama-3-8b-tkk-elite-v1.0-gguf

Author

QuantFactory

Pipeline Task

text-generation

Library

—

Created

2024-06-02

Last Modified

2024-06-04

Gated

Private

HF SHA

f95a2ae53b2f152b4c715068be917593930d27b4

License

apache-2.0

Language

Unknown

Base Model

tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "base_model": "tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
    "pipeline_tag": "text-generation",
    "frontmatter": {
      "license": "apache-2.0",
      "base_model": "tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
      "pipeline_tag": "text-generation"
    },
    "hero_image_url": "https://cdn-uploads.huggingface.co/production/uploads/62bdd8065f304e8ea762287f/yjhKqN_bkVuJRa7JMtMBW.png",
    "summary": "This is quantized version of tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0 created using llama.cpp # Model Description Llama-3-TKK-8B-Elite-V1.0   Llama-3-TKK-8B-Elite-V1.0, a generative model built upon the LLaMA 8B architecture, represents my individual undergraduate graduation project. Developed during my studies in Software Engineering at Malatya Turgut Özal University, this project stands as a culmination of my academic endeavors. I extend my sincere appreciation to Assoc. Prof. Dr. Harun Bingöl, who served as both my department chair and thesis advisor. His invaluable guidance, unwavering support, and mentorship have significantly shaped my educational and research experiences. I am deeply grateful for his continuous encouragement, insightful feedback, and unwavering dedication. Thank you, Dr. Bingöl...  !image/png Model Details  Training took 133 hours and 59 minutes for a total of 37,420 steps and was conducted on 8 Tesla V100 GPUs.   Base Model: LLaMA 8B based LLM Model Developers: Tarık Kaan Koç Thesis Advisor: Assoc. Prof. Dr. Harun Bingöl Input: Text only Output: Text only Training Dataset: Cleaned Turkish raw data with 1 million raw instruction Turkish data, private Training Method: Fine-tuning with LORA  LORA Fine-Tuning Configuration !image/png  lora_alpha: 16 lora_dropout: 0.1 r: 64 bias: none task_type: CAUSAL_LM  ### Example Usage: ``python from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer, pipeline import torch model_id = \"tarikkaankoc7/TKK-LLaMA3-8B-Elite-V1.0\" model = AutoModelForCausalLM.from_pretrained( model_id, torch_dtype=torch.bfloat16, device_map=\"auto\", trust_remote_code=True ) tokenizer = AutoTokenizer.from_pretrained( model_id, trust_remote_code=True ) streamer = TextStreamer(tokenizer) text_generation_pipeline = pipeline( \"text-generation\", model=model, tokenizer=tokenizer, model_kwargs={\"torch_dtype\": torch.bfloat16}, streamer=streamer ) messages = [ {\"role\": \"system\", \"content\": \"Sen yardımsever bir yapay zeka asistanısın ve kullanıcıların verdiği talimatlara doğrultusunda en iyi cevabı üretmeye çalışıyorsun.\"}, {\"role\": \"user\", \"content\": \"Leonardo da Vinci'nin en ünlü tablosu hangisidir?\"} ] prompt = tokenizer.apply_chat_template( messages, tokenize=False, add_generation_prompt=True ) terminators = [ tokenizer.eos_token_id ] outputs = text_generation_pipeline( prompt, max_new_tokens=2048, eos_token_id=terminators, do_sample=True, temperature=0.6, top_p=0.95 ) print(outputs[0][\"generated_text\"]) ` ### Output: ` Leonardo da Vinci'nin en ünlü tablosu Mona Lisa'dır. ``",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nbase_model: tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0\npipeline_tag: text-generation\n---\n\n# QuantFactory/Llama-3-8B-TKK-Elite-V1.0-GGUF\nThis is quantized version of [tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0](https://huggingface.co/tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0) created using llama.cpp\n\n# Model Description\n\n<h1 style=\"text-align: center;\">Llama-3-TKK-8B-Elite-V1.0 </h1>\n\n<p style=\"text-align: center;\">\n    Llama-3-TKK-8B-Elite-V1.0, a generative model built upon the LLaMA 8B architecture, represents my individual undergraduate graduation project. Developed during my studies in Software Engineering at Malatya Turgut Özal University, this project stands as a culmination of my academic endeavors. I extend my sincere appreciation to Assoc. Prof. Dr. Harun Bingöl, who served as both my department chair and thesis advisor. His invaluable guidance, unwavering support, and mentorship have significantly shaped my educational and research experiences. I am deeply grateful for his continuous encouragement, insightful feedback, and unwavering dedication. Thank you, Dr. Bingöl...\n</p>\n\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/62bdd8065f304e8ea762287f/yjhKqN_bkVuJRa7JMtMBW.png) \n\n\n\n<h2>Model Details</h2>\n\n<p>\n    Training took 133 hours and 59 minutes for a total of 37,420 steps and was conducted on 8 Tesla V100 GPUs.\n</p>\n\n\n<ul>\n    <li><strong>Base Model:</strong> LLaMA 8B based LLM</li>\n    <li><strong>Model Developers:</strong> Tarık Kaan Koç</li>\n    <li><strong>Thesis Advisor:</strong> Assoc. Prof. Dr. Harun Bingöl</li>\n    <li><strong>Input:</strong> Text only</li>\n    <li><strong>Output:</strong> Text only</li>\n    <li><strong>Training Dataset:</strong> Cleaned Turkish raw data with 1 million raw instruction Turkish data, private</li>\n    <li><strong>Training Method:</strong> Fine-tuning with LORA</li>\n</ul>\n\n<h2>LORA Fine-Tuning Configuration</h2>\n\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/62bdd8065f304e8ea762287f/TYPXlGYUilOJ5fsQDK9-O.png)\n\n<ul>\n    <li><strong>lora_alpha:</strong> 16</li>\n    <li><strong>lora_dropout:</strong> 0.1</li>\n    <li><strong>r:</strong> 64</li>\n    <li><strong>bias:</strong> none</li>\n    <li><strong>task_type:</strong> CAUSAL_LM</li>\n</ul>\n\n\n\n### Example Usage:\n\n```python\nfrom transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer, pipeline\nimport torch\n\nmodel_id = \"tarikkaankoc7/TKK-LLaMA3-8B-Elite-V1.0\"\n\nmodel = AutoModelForCausalLM.from_pretrained(\n    model_id,\n    torch_dtype=torch.bfloat16,\n    device_map=\"auto\",\n    trust_remote_code=True\n)\n\ntokenizer = AutoTokenizer.from_pretrained(\n    model_id,\n    trust_remote_code=True\n)\n\nstreamer = TextStreamer(tokenizer)\n\ntext_generation_pipeline = pipeline(\n    \"text-generation\",\n    model=model,\n    tokenizer=tokenizer,\n    model_kwargs={\"torch_dtype\": torch.bfloat16},\n    streamer=streamer\n)\n\nmessages = [\n    {\"role\": \"system\", \"content\": \"Sen yardımsever bir yapay zeka asistanısın ve kullanıcıların verdiği talimatlara doğrultusunda en iyi cevabı üretmeye çalışıyorsun.\"},\n    {\"role\": \"user\", \"content\": \"Leonardo da Vinci'nin en ünlü tablosu hangisidir?\"}\n]\n\nprompt = tokenizer.apply_chat_template(\n    messages,\n    tokenize=False,\n    add_generation_prompt=True\n)\n\nterminators = [\n    tokenizer.eos_token_id\n]\n\noutputs = text_generation_pipeline(\n    prompt,\n    max_new_tokens=2048,\n    eos_token_id=terminators,\n    do_sample=True,\n    temperature=0.6,\n    top_p=0.95\n)\n\nprint(outputs[0][\"generated_text\"])\n```\n\n### Output: \n\n```\nLeonardo da Vinci'nin en ünlü tablosu Mona Lisa'dır.\n```",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "text-generation",
    "base_model:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
    "base_model:quantized:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 159,
  "gated": false,
  "private": false,
  "last_modified": "2024-06-04T09:21:05.000Z",
  "created_at": "2024-06-02T04:10:53.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "665bf0cd20dee2925d7c8a50",
  "id": "QuantFactory/Llama-3-8B-TKK-Elite-V1.0-GGUF",
  "modelId": "QuantFactory/Llama-3-8B-TKK-Elite-V1.0-GGUF",
  "sha": "f95a2ae53b2f152b4c715068be917593930d27b4",
  "createdAt": "2024-06-02T04:10:53.000Z",
  "lastModified": "2024-06-04T09:21:05.000Z",
  "author": "QuantFactory",
  "downloads": 159,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 30
}