GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

quantfactory/llama-3-8b-tkk-elite-v1.0-gguf overview

This is quantized version of tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0 created using llama.cpp # Model Description Llama-3-TKK-8B-Elite-V1.0 Llama-3-TKK-8B-Elite-V1.0, a generative model built upon the LLaMA 8B architecture, represents my individual undergraduate graduation project. Developed during my studies in Software Engineering at Malatya Turgut Özal University, this project stands as a culmination of my academic endeavors. I extend my sincere appreciation to Assoc. Prof. Dr. Harun Bingöl, who served as both my department chair and thesis advisor. His invaluable guidance, unwavering support, and mentorship have significantly shaped my educational and research experiences. I am deeply grateful for his continuous encouragement, insightful feedback, and unwavering dedication. Thank you, Dr. Bingöl... !image/png Model Details Training took 133 hours and 59 minutes for a total of 37,420 steps and was conducted on 8 Tesla V100 GPUs. Base Model: LLaMA 8B based LLM Model Developers: Tarık Kaan Koç Thesis Advisor: Assoc. Prof. Dr. Harun Bingöl Input: Text only Output: Text only Training Dataset: Cleaned Turkish raw data with 1 million raw instruction Turkish data, private Training Method: Fine-tuning with LORA LORA Fine-Tuning Configuration !image/png loraalpha: 16 loradropout: 0.1 r: 64 bias: none tasktype: CAUSALLM ### Example Usage: ### Output:

gguftext-generationbase_model:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0base_model:quantized:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0license:apache-2.0endpoints_compatibleregion:usconversational
quantfactory/llama-3-8b-tkk-elite-v1.0-gguf visual
Downloads
159
Likes
0
Pipeline
text-generation
Library
Visibility
Public
Access
Open

Repository Files & Downloads

28 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Llama-3-8B-TKK-Elite-V1.0.Q2_K.f16.gguf GGUF Q2_K 2.96 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q2_K.f32.gguf GGUF Q2_K 2.96 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_L.f32.gguf GGUF Q3_K_L 4.03 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_L.gguf GGUF Q3_K_L 4.03 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_M.f32.gguf GGUF Q3_K_M 3.74 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_M.gguf GGUF Q3_K_M 3.74 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_S.f32.gguf GGUF Q3_K_S 3.41 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q3_K_S.gguf GGUF Q3_K_S 3.41 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q4_0.f32.gguf GGUF F32 4.34 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q4_0.gguf GGUF 4.34 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q4_1.f32.gguf GGUF F32 4.78 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q4_1.gguf GGUF 4.78 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q4_K_M.f32.gguf GGUF Q4_K_M 4.58 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q4_K_M.gguf GGUF Q4_K_M 4.58 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q4_K_S.f32.gguf GGUF Q4_K_S 4.37 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q4_K_S.gguf GGUF Q4_K_S 4.37 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q5_0.f32.gguf GGUF F32 5.21 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q5_0.gguf GGUF 5.21 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q5_1.f32.gguf GGUF F32 5.65 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q5_1.gguf GGUF 5.65 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q5_K_M.f32.gguf GGUF Q5_K_M 5.34 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q5_K_M.gguf GGUF Q5_K_M 5.34 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q5_K_S.f32.gguf GGUF Q5_K_S 5.21 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q5_K_S.gguf GGUF Q5_K_S 5.21 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q6_K.f32.gguf GGUF Q6_K 6.14 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q6_K.gguf GGUF Q6_K 6.14 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q8_0.f32.gguf GGUF F32 7.95 GB Download
Llama-3-8B-TKK-Elite-V1.0.Q8_0.gguf GGUF 7.95 GB Download

Model Details Live

Model Slug
quantfactory/llama-3-8b-tkk-elite-v1.0-gguf
Author
QuantFactory
Pipeline Task
text-generation
Library
Created
2024-06-02
Last Modified
2024-06-04
Gated
No
Private
No
HF SHA
f95a2ae53b2f152b4c715068be917593930d27b4
License
apache-2.0
Language
Unknown
Base Model
tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "base_model": "tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
    "pipeline_tag": "text-generation",
    "frontmatter": {
      "license": "apache-2.0",
      "base_model": "tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
      "pipeline_tag": "text-generation"
    },
    "hero_image_url": "https://cdn-uploads.huggingface.co/production/uploads/62bdd8065f304e8ea762287f/yjhKqN_bkVuJRa7JMtMBW.png",
    "summary": "This is quantized version of tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0 created using llama.cpp # Model Description Llama-3-TKK-8B-Elite-V1.0   Llama-3-TKK-8B-Elite-V1.0, a generative model built upon the LLaMA 8B architecture, represents my individual undergraduate graduation project. Developed during my studies in Software Engineering at Malatya Turgut Özal University, this project stands as a culmination of my academic endeavors. I extend my sincere appreciation to Assoc. Prof. Dr. Harun Bingöl, who served as both my department chair and thesis advisor. His invaluable guidance, unwavering support, and mentorship have significantly shaped my educational and research experiences. I am deeply grateful for his continuous encouragement, insightful feedback, and unwavering dedication. Thank you, Dr. Bingöl...  !image/png Model Details  Training took 133 hours and 59 minutes for a total of 37,420 steps and was conducted on 8 Tesla V100 GPUs.   Base Model: LLaMA 8B based LLM Model Developers: Tarık Kaan Koç Thesis Advisor: Assoc. Prof. Dr. Harun Bingöl Input: Text only Output: Text only Training Dataset: Cleaned Turkish raw data with 1 million raw instruction Turkish data, private Training Method: Fine-tuning with LORA  LORA Fine-Tuning Configuration !image/png  lora_alpha: 16 lora_dropout: 0.1 r: 64 bias: none task_type: CAUSAL_LM  ### Example Usage: ``python from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer, pipeline import torch model_id = \"tarikkaankoc7/TKK-LLaMA3-8B-Elite-V1.0\" model = AutoModelForCausalLM.from_pretrained( model_id, torch_dtype=torch.bfloat16, device_map=\"auto\", trust_remote_code=True ) tokenizer = AutoTokenizer.from_pretrained( model_id, trust_remote_code=True ) streamer = TextStreamer(tokenizer) text_generation_pipeline = pipeline( \"text-generation\", model=model, tokenizer=tokenizer, model_kwargs={\"torch_dtype\": torch.bfloat16}, streamer=streamer ) messages = [ {\"role\": \"system\", \"content\": \"Sen yardımsever bir yapay zeka asistanısın ve kullanıcıların verdiği talimatlara doğrultusunda en iyi cevabı üretmeye çalışıyorsun.\"}, {\"role\": \"user\", \"content\": \"Leonardo da Vinci'nin en ünlü tablosu hangisidir?\"} ] prompt = tokenizer.apply_chat_template( messages, tokenize=False, add_generation_prompt=True ) terminators = [ tokenizer.eos_token_id ] outputs = text_generation_pipeline( prompt, max_new_tokens=2048, eos_token_id=terminators, do_sample=True, temperature=0.6, top_p=0.95 ) print(outputs[0][\"generated_text\"]) ` ### Output: ` Leonardo da Vinci'nin en ünlü tablosu Mona Lisa'dır. ``",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nbase_model: tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0\npipeline_tag: text-generation\n---\n\n# QuantFactory/Llama-3-8B-TKK-Elite-V1.0-GGUF\nThis is quantized version of [tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0](https://huggingface.co/tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0) created using llama.cpp\n\n# Model Description\n\n<h1 style=\"text-align: center;\">Llama-3-TKK-8B-Elite-V1.0 </h1>\n\n<p style=\"text-align: center;\">\n    Llama-3-TKK-8B-Elite-V1.0, a generative model built upon the LLaMA 8B architecture, represents my individual undergraduate graduation project. Developed during my studies in Software Engineering at Malatya Turgut Özal University, this project stands as a culmination of my academic endeavors. I extend my sincere appreciation to Assoc. Prof. Dr. Harun Bingöl, who served as both my department chair and thesis advisor. His invaluable guidance, unwavering support, and mentorship have significantly shaped my educational and research experiences. I am deeply grateful for his continuous encouragement, insightful feedback, and unwavering dedication. Thank you, Dr. Bingöl...\n</p>\n\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/62bdd8065f304e8ea762287f/yjhKqN_bkVuJRa7JMtMBW.png) \n\n\n\n<h2>Model Details</h2>\n\n<p>\n    Training took 133 hours and 59 minutes for a total of 37,420 steps and was conducted on 8 Tesla V100 GPUs.\n</p>\n\n\n<ul>\n    <li><strong>Base Model:</strong> LLaMA 8B based LLM</li>\n    <li><strong>Model Developers:</strong> Tarık Kaan Koç</li>\n    <li><strong>Thesis Advisor:</strong> Assoc. Prof. Dr. Harun Bingöl</li>\n    <li><strong>Input:</strong> Text only</li>\n    <li><strong>Output:</strong> Text only</li>\n    <li><strong>Training Dataset:</strong> Cleaned Turkish raw data with 1 million raw instruction Turkish data, private</li>\n    <li><strong>Training Method:</strong> Fine-tuning with LORA</li>\n</ul>\n\n<h2>LORA Fine-Tuning Configuration</h2>\n\n\n![image/png](https://cdn-uploads.huggingface.co/production/uploads/62bdd8065f304e8ea762287f/TYPXlGYUilOJ5fsQDK9-O.png)\n\n<ul>\n    <li><strong>lora_alpha:</strong> 16</li>\n    <li><strong>lora_dropout:</strong> 0.1</li>\n    <li><strong>r:</strong> 64</li>\n    <li><strong>bias:</strong> none</li>\n    <li><strong>task_type:</strong> CAUSAL_LM</li>\n</ul>\n\n\n\n### Example Usage:\n\n```python\nfrom transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer, pipeline\nimport torch\n\nmodel_id = \"tarikkaankoc7/TKK-LLaMA3-8B-Elite-V1.0\"\n\nmodel = AutoModelForCausalLM.from_pretrained(\n    model_id,\n    torch_dtype=torch.bfloat16,\n    device_map=\"auto\",\n    trust_remote_code=True\n)\n\ntokenizer = AutoTokenizer.from_pretrained(\n    model_id,\n    trust_remote_code=True\n)\n\nstreamer = TextStreamer(tokenizer)\n\ntext_generation_pipeline = pipeline(\n    \"text-generation\",\n    model=model,\n    tokenizer=tokenizer,\n    model_kwargs={\"torch_dtype\": torch.bfloat16},\n    streamer=streamer\n)\n\nmessages = [\n    {\"role\": \"system\", \"content\": \"Sen yardımsever bir yapay zeka asistanısın ve kullanıcıların verdiği talimatlara doğrultusunda en iyi cevabı üretmeye çalışıyorsun.\"},\n    {\"role\": \"user\", \"content\": \"Leonardo da Vinci'nin en ünlü tablosu hangisidir?\"}\n]\n\nprompt = tokenizer.apply_chat_template(\n    messages,\n    tokenize=False,\n    add_generation_prompt=True\n)\n\nterminators = [\n    tokenizer.eos_token_id\n]\n\noutputs = text_generation_pipeline(\n    prompt,\n    max_new_tokens=2048,\n    eos_token_id=terminators,\n    do_sample=True,\n    temperature=0.6,\n    top_p=0.95\n)\n\nprint(outputs[0][\"generated_text\"])\n```\n\n### Output: \n\n```\nLeonardo da Vinci'nin en ünlü tablosu Mona Lisa'dır.\n```",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "text-generation",
    "base_model:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
    "base_model:quantized:tarikkaankoc7/Llama-3-8B-TKK-Elite-V1.0",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 159,
  "gated": false,
  "private": false,
  "last_modified": "2024-06-04T09:21:05.000Z",
  "created_at": "2024-06-02T04:10:53.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "665bf0cd20dee2925d7c8a50",
  "id": "QuantFactory/Llama-3-8B-TKK-Elite-V1.0-GGUF",
  "modelId": "QuantFactory/Llama-3-8B-TKK-Elite-V1.0-GGUF",
  "sha": "f95a2ae53b2f152b4c715068be917593930d27b4",
  "createdAt": "2024-06-02T04:10:53.000Z",
  "lastModified": "2024-06-04T09:21:05.000Z",
  "author": "QuantFactory",
  "downloads": 159,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 30
}