GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

richarderkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf overview

README at https://github.com/huseinzol05/malaya/tree/master/session/llama3 WandB, https://wandb.ai/huseinzol05/fpf-llama-3-8b-8192-hf-packing?nw=nwuserhuseinzol05

ggufendpoints_compatibleregion:usconversational
richarderkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf visual
Downloads
131
Likes
0
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

22 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
malaysian-llama-3-8b-instruct-16k.IQ3_M.gguf GGUF IQ3_M 3.52 GB Download
malaysian-llama-3-8b-instruct-16k.IQ3_S.gguf GGUF IQ3_S 3.43 GB Download
malaysian-llama-3-8b-instruct-16k.IQ3_XS.gguf GGUF IQ3_XS 3.28 GB Download
malaysian-llama-3-8b-instruct-16k.IQ4_NL.gguf GGUF IQ4_NL 4.38 GB Download
malaysian-llama-3-8b-instruct-16k.IQ4_XS.gguf GGUF IQ4_XS 4.18 GB Download
malaysian-llama-3-8b-instruct-16k.Q2_K.gguf GGUF Q2_K 2.96 GB Download
malaysian-llama-3-8b-instruct-16k.Q3_K.gguf GGUF Q3_K 3.74 GB Download
malaysian-llama-3-8b-instruct-16k.Q3_K_L.gguf GGUF Q3_K_L 4.03 GB Download
malaysian-llama-3-8b-instruct-16k.Q3_K_M.gguf GGUF Q3_K_M 3.74 GB Download
malaysian-llama-3-8b-instruct-16k.Q3_K_S.gguf GGUF Q3_K_S 3.41 GB Download
malaysian-llama-3-8b-instruct-16k.Q4_0.gguf GGUF 4.34 GB Download
malaysian-llama-3-8b-instruct-16k.Q4_1.gguf GGUF 4.78 GB Download
malaysian-llama-3-8b-instruct-16k.Q4_K.gguf GGUF Q4_K 4.58 GB Download
malaysian-llama-3-8b-instruct-16k.Q4_K_M.gguf GGUF Q4_K_M 4.58 GB Download
malaysian-llama-3-8b-instruct-16k.Q4_K_S.gguf GGUF Q4_K_S 4.37 GB Download
malaysian-llama-3-8b-instruct-16k.Q5_0.gguf GGUF 5.21 GB Download
malaysian-llama-3-8b-instruct-16k.Q5_1.gguf GGUF 5.65 GB Download
malaysian-llama-3-8b-instruct-16k.Q5_K.gguf GGUF Q5_K 5.34 GB Download
malaysian-llama-3-8b-instruct-16k.Q5_K_M.gguf GGUF Q5_K_M 5.34 GB Download
malaysian-llama-3-8b-instruct-16k.Q5_K_S.gguf GGUF Q5_K_S 5.21 GB Download
malaysian-llama-3-8b-instruct-16k.Q6_K.gguf GGUF Q6_K 6.14 GB Download
malaysian-llama-3-8b-instruct-16k.Q8_0.gguf GGUF 7.95 GB Download

Model Details Live

Model Slug
richarderkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf
Author
RichardErkhov
Pipeline Task
Library
Created
2024-08-23
Last Modified
2024-08-23
Gated
No
Private
No
HF SHA
e124e274c17274eaadd9fe5606c53e0ac15387af
License
Unknown
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "",
    "summary": "README at https://github.com/huseinzol05/malaya/tree/master/session/llama3 WandB, https://wandb.ai/huseinzol05/fpf-llama-3-8b-8192-hf-packing?nw=nwuserhuseinzol05",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nmalaysian-llama-3-8b-instruct-16k - GGUF\n- Model creator: https://huggingface.co/mesolitica/\n- Original model: https://huggingface.co/mesolitica/malaysian-llama-3-8b-instruct-16k/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [malaysian-llama-3-8b-instruct-16k.Q2_K.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q2_K.gguf) | Q2_K | 2.96GB |\n| [malaysian-llama-3-8b-instruct-16k.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.IQ3_XS.gguf) | IQ3_XS | 3.28GB |\n| [malaysian-llama-3-8b-instruct-16k.IQ3_S.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.IQ3_S.gguf) | IQ3_S | 3.43GB |\n| [malaysian-llama-3-8b-instruct-16k.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q3_K_S.gguf) | Q3_K_S | 3.41GB |\n| [malaysian-llama-3-8b-instruct-16k.IQ3_M.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.IQ3_M.gguf) | IQ3_M | 3.52GB |\n| [malaysian-llama-3-8b-instruct-16k.Q3_K.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q3_K.gguf) | Q3_K | 3.74GB |\n| [malaysian-llama-3-8b-instruct-16k.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q3_K_M.gguf) | Q3_K_M | 3.74GB |\n| [malaysian-llama-3-8b-instruct-16k.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q3_K_L.gguf) | Q3_K_L | 4.03GB |\n| [malaysian-llama-3-8b-instruct-16k.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.IQ4_XS.gguf) | IQ4_XS | 4.18GB |\n| [malaysian-llama-3-8b-instruct-16k.Q4_0.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q4_0.gguf) | Q4_0 | 4.34GB |\n| [malaysian-llama-3-8b-instruct-16k.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.IQ4_NL.gguf) | IQ4_NL | 4.38GB |\n| [malaysian-llama-3-8b-instruct-16k.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q4_K_S.gguf) | Q4_K_S | 4.37GB |\n| [malaysian-llama-3-8b-instruct-16k.Q4_K.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q4_K.gguf) | Q4_K | 4.58GB |\n| [malaysian-llama-3-8b-instruct-16k.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q4_K_M.gguf) | Q4_K_M | 4.58GB |\n| [malaysian-llama-3-8b-instruct-16k.Q4_1.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q4_1.gguf) | Q4_1 | 4.78GB |\n| [malaysian-llama-3-8b-instruct-16k.Q5_0.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q5_0.gguf) | Q5_0 | 5.21GB |\n| [malaysian-llama-3-8b-instruct-16k.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q5_K_S.gguf) | Q5_K_S | 5.21GB |\n| [malaysian-llama-3-8b-instruct-16k.Q5_K.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q5_K.gguf) | Q5_K | 5.34GB |\n| [malaysian-llama-3-8b-instruct-16k.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q5_K_M.gguf) | Q5_K_M | 5.34GB |\n| [malaysian-llama-3-8b-instruct-16k.Q5_1.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q5_1.gguf) | Q5_1 | 5.65GB |\n| [malaysian-llama-3-8b-instruct-16k.Q6_K.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q6_K.gguf) | Q6_K | 6.14GB |\n| [malaysian-llama-3-8b-instruct-16k.Q8_0.gguf](https://huggingface.co/RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf/blob/main/malaysian-llama-3-8b-instruct-16k.Q8_0.gguf) | Q8_0 | 7.95GB |\n\n\n\n\nOriginal model description:\n---\nlanguage:\n- ms\n---\n\n# Full Parameter Finetuning Malaysian Llama-3 16384 context length on Malaysian chat completion 3B tokens\n\nREADME at https://github.com/huseinzol05/malaya/tree/master/session/llama3\n\nWandB, https://wandb.ai/huseinzol05/fpf-llama-3-8b-8192-hf-packing?nw=nwuserhuseinzol05\n\n## how-to\n\n```python\nfrom transformers import AutoTokenizer, AutoModelForCausalLM\nimport torch\n\ntokenizer = AutoTokenizer.from_pretrained('mesolitica/malaysian-llama-3-8b-instruct-16k')\nmodel = AutoModelForCausalLM.from_pretrained('mesolitica/malaysian-llama-3-8b-instruct-16k', \n    torch_dtype=torch.bfloat16,\n    use_flash_attention_2 = True\n)\n_ = model.cuda()\n\nmessages = [\n    {'role': 'user', 'content': 'kwsp tu apa'}\n]\nprompt = tokenizer.apply_chat_template(messages, tokenize = False)\ninputs = tokenizer([prompt], return_tensors='pt', add_special_tokens=False).to('cuda')\ngenerate_kwargs = dict(\n    inputs,\n    max_new_tokens=1024,\n    top_p=0.95,\n    top_k=50,\n    temperature=0.9,\n    do_sample=True,\n    num_beams=1,\n)\nr = model.generate(**generate_kwargs)\ntokenizer.decode(r[0])\n```\n\n```\n<|begin_of_text|><|start_header_id|>user<|end_header_id|>\n\nkwsp tu apa<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\nKWSP adalah singkatan kepada Kumpulan Wang Simpanan Pekerja, iaitu sejenis skim simpanan persaraan yang ditadbir oleh kerajaan Malaysia. Ia adalah sistem pencen yang disumbangkan oleh kedua-dua majikan dan pekerja. KWSP adalah skim simpanan wajib di Malaysia, dan kebanyakan pekerja mesti mendaftar dan menyumbang kepada KWSP. Skim ini ditadbir oleh Kumpulan Wang Simpanan Pekerja Malaysia. KWSP menyediakan pelbagai pilihan pelaburan untuk ahlinya, termasuk unit amanah, bon, dan juga simpanan tetap. Ahli boleh memohon untuk melabur sebahagian simpanan KWSP mereka dalam pilihan ini pada bila-bila masa, tertakluk kepada kriteria kelayakan tertentu. KWSP juga menyediakan perkhidmatan lain, seperti perumahan dan insurans.<|eot_id|>\n```\n\n```python\nmessages = [\n    {'role': 'user', 'content': 'nasi kandaq apa sedap kat sungai petani kedah'}\n]\nprompt = tokenizer.apply_chat_template(messages, tokenize = False)\ninputs = tokenizer([prompt], return_tensors='pt', add_special_tokens=False).to('cuda')\ngenerate_kwargs = dict(\n    inputs,\n    max_new_tokens=1024,\n    top_p=0.95,\n    top_k=50,\n    temperature=0.9,\n    do_sample=True,\n    num_beams=1,\n)\nr = model.generate(**generate_kwargs)\nprint(tokenizer.decode(r[0]))\n```\n\n```\n<|begin_of_text|><|start_header_id|>user<|end_header_id|>\n\nnasi kandaq apa sedap kat sungai petani kedah<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\nDi Sungai Petani, Kedah, terdapat sebuah gerai yang menyajikan \"nasi kandaq\" yang sangat lazat dan popular di kalangan penduduk setempat dan pelancong. Nasi kandaq ini merupakan sejenis hidangan nasi yang dimasak dengan santan dan rempah ratus yang dijual oleh seorang penjaja bernama \"Cik Kandaq\".\n\nGerai nasi kandaq yang terkenal ini terletak di Pekan Lama, Sungai Petani dan telah beroperasi selama lebih dari 30 tahun. Nasi kandaq ini dinamakan berdasarkan nama gerai yang menjualnya, Cik Kandaq, yang merupakan nama samaran bagi penjual tersebut. Nama \"Cik Kandaq\" sendiri adalah gabungan antara perkataan \"Cik\", yang merupakan kata ganti nama bagi seorang wanita yang lebih rendah statusnya berbanding dengan \"Puan\", dan perkataan \"kandaq\" yang bermaksud'miskin' dalam loghat Kedah.\n\nNasi kandaq yang dijual oleh Cik Kandaq sangat terkenal kerana rasa nasi yang lemak dan wangi dari campuran santan dan rempah ratus, serta kerangupan lauk-pauk yang dihidangkan bersama nasi tersebut. Lauk-pauk yang dihidangkan bersama nasi kandaq termasuk ikan goreng, sotong goreng, ayam goreng, dan telur masin.\n\nGerai ini hanya beroperasi pada waktu pagi sahaja dan seringkali pelanggan akan berbaris panjang untuk mendapatkan hidangan nasi kandaq yang sedap ini. Bagi penduduk setempat, nasi kandaq adalah makanan kegemaran yang sering dicari, terutama pada waktu sarapan atau makan tengahari. Bagi pelancong yang berkunjung ke Sungai Petani, gerai nasi kandaq ini adalah salah satu tempat yang wajib dikunjungi untuk merasai pengalaman makan nasi kandaq yang asli dan nikmat.<|eot_id|>\n```\n\n```python\nmessages = [\n    {'role': 'user', 'content': 'asdsd231asd'},\n    {'role': 'assistant', 'content': 'Saya minta maaf, saya tidak dapat menjana respons yang bermakna kepada input itu. Bolehkah anda berikan lebih banyak konteks atau jelaskan soalan anda?'},\n    {'role': 'user', 'content': 'awak giler eh'},\n]\nprompt = tokenizer.apply_chat_template(messages, tokenize = False)\ninputs = tokenizer([prompt], return_tensors='pt', add_special_tokens=False).to('cuda')\ngenerate_kwargs = dict(\n    inputs,\n    max_new_tokens=1024,\n    top_p=0.95,\n    top_k=50,\n    temperature=0.9,\n    do_sample=True,\n    num_beams=1,\n)\nr = model.generate(**generate_kwargs)\nprint(tokenizer.decode(r[0]))\n```\n\n```\n<|begin_of_text|><|start_header_id|>user<|end_header_id|>\n\nasdsd231asd<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\nSaya minta maaf, saya tidak dapat menjana respons yang bermakna kepada input itu. Bolehkah anda berikan lebih banyak konteks atau jelaskan soalan anda?<|eot_id|><|start_header_id|>user<|end_header_id|>\n\nawak giler eh<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\nSaya memohon maaf jika saya memberi anda maklumat yang salah atau tidak membantu. Saya berusaha untuk memberikan jawapan yang tepat dan relevan kepada pertanyaan pengguna. Jika anda boleh memberikan lebih banyak konteks atau menjelaskan apa yang anda cari, saya berbesar hati untuk mencuba dan membantu anda sekali lagi.<|eot_id|>\n```\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 131,
  "gated": false,
  "private": false,
  "last_modified": "2024-08-23T16:54:32.000Z",
  "created_at": "2024-08-23T13:31:39.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "66c88f3bda96b725102d46db",
  "id": "RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf",
  "modelId": "RichardErkhov/mesolitica_-_malaysian-llama-3-8b-instruct-16k-gguf",
  "sha": "e124e274c17274eaadd9fe5606c53e0ac15387af",
  "createdAt": "2024-08-23T13:31:39.000Z",
  "lastModified": "2024-08-23T16:54:32.000Z",
  "author": "RichardErkhov",
  "downloads": 131,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 24
}