Model Intelligence Sheet

richarderkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf overview

🚀 Meerkat-70B is a new instruction-tuned medical AI system of the Meerkat model family. The model was based on the Meta's Llama-3-70B-Instruct model and fine-tuned using our new synthetic dataset consisting of high-quality chain-of-thought reasoning paths sourced from 18 medical textbooks, along with diverse instruction-following datasets. This equips the model with high-level medical reasoning capabilities required for solving complex medical problems. For further insights into our model, please refer to our paper! 📄 Paper: Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks

ggufarxiv:2404.00376arxiv:2009.13081arxiv:2402.18060arxiv:2203.14371arxiv:2009.03300endpoints_compatibleregion:usconversational

richarderkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf visual

Downloads

1,898

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

34 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
llama-3-meerkat-70b-v1.0.IQ3_M.gguf	GGUF	IQ3_M	29.74 GB	Download
llama-3-meerkat-70b-v1.0.IQ3_S.gguf	GGUF	IQ3_S	28.79 GB	Download
llama-3-meerkat-70b-v1.0.IQ3_XS.gguf	GGUF	IQ3_XS	27.29 GB	Download
llama-3-meerkat-70b-v1.0.IQ4_XS.gguf	GGUF	IQ4_XS	35.64 GB	Download
llama-3-meerkat-70b-v1.0.Q2_K.gguf	GGUF	Q2_K	24.56 GB	Download
llama-3-meerkat-70b-v1.0.Q3_K.gguf	GGUF	Q3_K	31.91 GB	Download
llama-3-meerkat-70b-v1.0.Q3_K_L.gguf	GGUF	Q3_K_L	34.59 GB	Download
llama-3-meerkat-70b-v1.0.Q3_K_M.gguf	GGUF	Q3_K_M	31.91 GB	Download
llama-3-meerkat-70b-v1.0.Q3_K_S.gguf	GGUF	Q3_K_S	28.79 GB	Download
llama-3-meerkat-70b-v1.0.Q4_0.gguf	GGUF	—	37.22 GB	Download
llama-3-meerkat-70b-v1.0_IQ4_NL-00001-of-00002.gguf	GGUF	IQ4_NL	36.77 GB	Download
llama-3-meerkat-70b-v1.0_IQ4_NL-00002-of-00002.gguf	GGUF	IQ4_NL	821.95 MB	Download
llama-3-meerkat-70b-v1.0_Q4_1-00001-of-00002.gguf	GGUF	—	37.25 GB	Download
llama-3-meerkat-70b-v1.0_Q4_1-00002-of-00002.gguf	GGUF	—	4.02 GB	Download
llama-3-meerkat-70b-v1.0_Q4_K-00001-of-00002.gguf	GGUF	Q4_K	37.24 GB	Download
llama-3-meerkat-70b-v1.0_Q4_K-00002-of-00002.gguf	GGUF	Q4_K	2.36 GB	Download
llama-3-meerkat-70b-v1.0_Q4_K_M-00001-of-00002.gguf	GGUF	Q4_K_M	37.24 GB	Download
llama-3-meerkat-70b-v1.0_Q4_K_M-00002-of-00002.gguf	GGUF	Q4_K_M	2.36 GB	Download
llama-3-meerkat-70b-v1.0_Q4_K_S-00001-of-00002.gguf	GGUF	Q4_K_S	36.77 GB	Download
llama-3-meerkat-70b-v1.0_Q4_K_S-00002-of-00002.gguf	GGUF	Q4_K_S	821.95 MB	Download
llama-3-meerkat-70b-v1.0_Q5_0-00001-of-00002.gguf	GGUF	—	37.14 GB	Download
llama-3-meerkat-70b-v1.0_Q5_0-00002-of-00002.gguf	GGUF	—	8.17 GB	Download
llama-3-meerkat-70b-v1.0_Q5_1-00001-of-00002.gguf	GGUF	—	37.20 GB	Download
llama-3-meerkat-70b-v1.0_Q5_1-00002-of-00002.gguf	GGUF	—	12.16 GB	Download
llama-3-meerkat-70b-v1.0_Q5_K-00001-of-00002.gguf	GGUF	Q5_K	37.14 GB	Download
llama-3-meerkat-70b-v1.0_Q5_K-00002-of-00002.gguf	GGUF	Q5_K	9.38 GB	Download
llama-3-meerkat-70b-v1.0_Q5_K_M-00001-of-00002.gguf	GGUF	Q5_K_M	37.14 GB	Download
llama-3-meerkat-70b-v1.0_Q5_K_M-00002-of-00002.gguf	GGUF	Q5_K_M	9.38 GB	Download
llama-3-meerkat-70b-v1.0_Q5_K_S-00001-of-00002.gguf	GGUF	Q5_K_S	37.14 GB	Download
llama-3-meerkat-70b-v1.0_Q5_K_S-00002-of-00002.gguf	GGUF	Q5_K_S	8.17 GB	Download
llama-3-meerkat-70b-v1.0_Q6_K-00001-of-00002.gguf	GGUF	Q6_K	37.13 GB	Download
llama-3-meerkat-70b-v1.0_Q6_K-00002-of-00002.gguf	GGUF	Q6_K	16.79 GB	Download
llama-3-meerkat-70b-v1.0_Q8_0-00001-of-00002.gguf	GGUF	—	37.07 GB	Download
llama-3-meerkat-70b-v1.0_Q8_0-00002-of-00002.gguf	GGUF	—	32.75 GB	Download

Model Details Live

Model Slug

richarderkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf

Author

RichardErkhov

Pipeline Task

—

Library

—

Created

2024-09-07

Last Modified

2024-09-08

Gated

Private

HF SHA

6e284f19c16ceeea0cf01b58700d945c6cd5412e

License

Unknown

Language

Unknown

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "",
    "summary": "🚀 Meerkat-70B is a new instruction-tuned medical AI system of the Meerkat model family. The model was based on the Meta's Llama-3-70B-Instruct model and fine-tuned using our new synthetic dataset consisting of high-quality chain-of-thought reasoning paths sourced from 18 medical textbooks, along with diverse instruction-following datasets. This equips the model with high-level medical reasoning capabilities required for solving complex medical problems. For further insights into our model, please refer to our paper! 📄 **Paper**: Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Quantization made by Richard Erkhov.\n\n[Github](https://github.com/RichardErkhov)\n\n[Discord](https://discord.gg/pvy7H8DZMG)\n\n[Request more models](https://github.com/RichardErkhov/quant_request)\n\n\nllama-3-meerkat-70b-v1.0 - GGUF\n- Model creator: https://huggingface.co/dmis-lab/\n- Original model: https://huggingface.co/dmis-lab/llama-3-meerkat-70b-v1.0/\n\n\n| Name | Quant method | Size |\n| ---- | ---- | ---- |\n| [llama-3-meerkat-70b-v1.0.Q2_K.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.Q2_K.gguf) | Q2_K | 24.56GB |\n| [llama-3-meerkat-70b-v1.0.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.IQ3_XS.gguf) | IQ3_XS | 27.29GB |\n| [llama-3-meerkat-70b-v1.0.IQ3_S.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.IQ3_S.gguf) | IQ3_S | 28.79GB |\n| [llama-3-meerkat-70b-v1.0.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.Q3_K_S.gguf) | Q3_K_S | 28.79GB |\n| [llama-3-meerkat-70b-v1.0.IQ3_M.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.IQ3_M.gguf) | IQ3_M | 29.74GB |\n| [llama-3-meerkat-70b-v1.0.Q3_K.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.Q3_K.gguf) | Q3_K | 31.91GB |\n| [llama-3-meerkat-70b-v1.0.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.Q3_K_M.gguf) | Q3_K_M | 31.91GB |\n| [llama-3-meerkat-70b-v1.0.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.Q3_K_L.gguf) | Q3_K_L | 34.59GB |\n| [llama-3-meerkat-70b-v1.0.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.IQ4_XS.gguf) | IQ4_XS | 35.64GB |\n| [llama-3-meerkat-70b-v1.0.Q4_0.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/blob/main/llama-3-meerkat-70b-v1.0.Q4_0.gguf) | Q4_0 | 37.22GB |\n| [llama-3-meerkat-70b-v1.0.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | IQ4_NL | 37.58GB |\n| [llama-3-meerkat-70b-v1.0.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q4_K_S | 37.58GB |\n| [llama-3-meerkat-70b-v1.0.Q4_K.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q4_K | 39.6GB |\n| [llama-3-meerkat-70b-v1.0.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q4_K_M | 39.6GB |\n| [llama-3-meerkat-70b-v1.0.Q4_1.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q4_1 | 41.27GB |\n| [llama-3-meerkat-70b-v1.0.Q5_0.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q5_0 | 45.32GB |\n| [llama-3-meerkat-70b-v1.0.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q5_K_S | 45.32GB |\n| [llama-3-meerkat-70b-v1.0.Q5_K.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q5_K | 46.52GB |\n| [llama-3-meerkat-70b-v1.0.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q5_K_M | 46.52GB |\n| [llama-3-meerkat-70b-v1.0.Q5_1.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q5_1 | 49.36GB |\n| [llama-3-meerkat-70b-v1.0.Q6_K.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q6_K | 53.91GB |\n| [llama-3-meerkat-70b-v1.0.Q8_0.gguf](https://huggingface.co/RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf/tree/main/) | Q8_0 | 69.83GB |\n\n\n\n\nOriginal model description:\n---\nlicense: cc-by-nc-4.0\npipeline_tag: text-generation\ntags:\n- medical\n- small LM\n- instruction-tuned\n- usmle\n- synthetic data\n---\n\n\n# Meerkat-70B (Version 1.0)\n\n🚀 Meerkat-70B is a new instruction-tuned medical AI system of the Meerkat model family.\nThe model was based on the Meta's Llama-3-70B-Instruct model and fine-tuned using our new synthetic dataset consisting of high-quality chain-of-thought reasoning paths sourced from 18 medical textbooks, along with diverse instruction-following datasets. \nThis equips the model with high-level medical reasoning capabilities required for solving complex medical problems.\nFor further insights into our model, please refer to our paper!\n\n📄 **Paper**: [Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks](https://arxiv.org/abs/2404.00376) \n\n\n## Quick Start\n\n```python\nfrom transformers import AutoTokenizer, AutoModelForCausalLM\nimport torch\n\nmodel_id = \"dmis-lab/llama-3-meerkat-70b-v1.0\"\n\ntokenizer = AutoTokenizer.from_pretrained(model_id)\nmodel = AutoModelForCausalLM.from_pretrained(\n    model_id,\n    torch_dtype=torch.bfloat16,  # You can choose to use this when there's not enough GPU memory available.\n    device_map=\"auto\",\n)\n\n# Multi-turn dialogue example\nmessages =[\n    {\"role\": \"system\", \"content\": \"You are a helpful doctor or healthcare professional. Guide the conversation to provide useful, complete, and scientifically-grounded answers to user questions. You have the option to compose a concise, single-turn conversation if the user's input is comprehensive to provide accurate answers. However, if essential details are missing, you should engage in a multi-turn dialogue, asking follow-up questions to gather a thorough medical history and records.\\n\\n\"},\n    {\"role\": \"user\", \"content\": \"Hello, doctor. I'm really concerned about my 10-year-old son. We recently discovered a painless mass in his left testicle, so we brought him to the pediatrician.\"},\n    {\"role\": \"assistant\", \"content\": \"I understand your concern. Let's gather some more information. Has your son experienced any other symptoms along with the mass?\"},\n    {\"role\": \"user\", \"content\": \"Other than the mass, my son hasn't shown any symptoms. He's been his usual self, playing and eating normally.\"}\n]\n\ninput_ids = tokenizer.apply_chat_template(\n    messages,\n    add_generation_prompt=True,\n    return_tensors=\"pt\"\n).to(model.device)\n\nterminators = [\n    tokenizer.eos_token_id,\n    tokenizer.convert_tokens_to_ids(\"<|eot_id|>\")\n]\n\noutputs = model.generate(\n    input_ids,\n    max_new_tokens=1000,\n    eos_token_id=terminators,\n    do_sample=True,\n    temperature=0.7,\n)\nresponse = outputs[0][input_ids.shape[-1]:]\nprint(tokenizer.decode(response, skip_special_tokens=True))\n```\n\n## Prompt Details\n\nTo reproduce the results reported in our paper, it is advisable to utilize the identical system messages used during model training. Please refer to the guidelines detailed below.\n\n### USMLE\n\nWhen solving USMLE-style questions such as [MedQA](https://arxiv.org/abs/2009.13081) and [MedBullets](https://arxiv.org/abs/2402.18060), use the following system message:\n```\nmessages = [\n    {\"role\": \"system\", \"content\": \"The following is a multiple-choice question about medical knowledge. Solve this in a step-by-step fashion, starting by summarizing the available information. Output a single option from the given options as the final answer. You are strongly required to follow the specified output format; conclude your response with the phrase \\\"the answer is ([option_id]) [answer_string]\\\".\\n\\n\"},\n    {\"role\": \"user\", \"content\": \"Two weeks after undergoing an emergency cardiac catherization with stenting for unstable angina pectoris, a 61-year-old man has decreased urinary output and malaise. He has type 2 diabetes mellitus and osteoarthritis of the hips. Prior to admission, his medications were insulin and naproxen. He was also started on aspirin, clopidogrel, and metoprolol after the coronary intervention. His temperature is 38\\u00b0C (100.4\\u00b0F), pulse is 93/min, and blood pressure is 125/85 mm Hg. Examination shows mottled, reticulated purplish discoloration of the feet. Laboratory studies show:\\nHemoglobin count 14 g/dL\\nLeukocyte count 16,400/mm3\\nSegmented neutrophils 56%\\nEosinophils 11%\\nLymphocytes 31%\\nMonocytes 2%\\nPlatelet count 260,000/mm3\\nErythrocyte sedimentation rate 68 mm/h\\nSerum\\nUrea nitrogen 25 mg/dL\\nCreatinine 4.2 mg/dL\\nRenal biopsy shows intravascular spindle-shaped vacuoles. Which of the following is the most likely cause of this patient's symptoms?\\\" (A) Renal papillary necrosis (B) Cholesterol embolization (C) Eosinophilic granulomatosis with polyangiitis (D) Polyarteritis nodosa\"},\n]\n```\nThe model generates reasoning paths to solve the problem and then sequentially provides the predicted answers. \nSince the model ends its response with \"the answer is,\"  it is straightforward to extract the predicted answer for comparison with the actual answer.\n\n### Multiple-choice Exams\n\nFor other types of multiple-choice exams such as [MedMCQA](https://arxiv.org/abs/2203.14371) or [MMLU](https://arxiv.org/abs/2009.03300), use the following simple system message:\n```\nmessages = [\n    {\"role\": \"system\", \"content\": \"Answer the multiple-choice question about medical knowledge.\\n\\n\"},\n    {\"role\": \"user\", \"content\": \"In a Robertsonian translocation fusion occurs at the: (A) telomeres. (B) centromeres. (C) histones. (D) ends of the long arms.\"},\n]\n```\n\n### Other Use Cases\nOur model was trained using the [AlpaCare](https://github.com/xzhang97666/alpacare) instruction dataset comprising 52K examples, to enhance its generalization capabilities across diverse user prompts. \nFeel free to design and test your prompts and to share your thoughts with us, whether the model exceeds expectations or falls short!\n\n\n## Reproducing MedQA Performance with vLLM\n\nHere is an example code for fast model evaluation in MedQA using vLLM. To adapt this code for other datasets like MedMCQA or MMLU, simply modify the instructions and update the dataset paths as needed.\n```python\n# export CUDA_VISIBLE_DEVICES=0,1\n\nimport re\nfrom datasets import load_dataset\nfrom vllm import LLM, SamplingParams\nUSMLE_INSTRUCTION = (\n    \"The following is a multiple-choice question about medical knowledge. Solve this in\"\n    \" a step-by-step fashion, starting by summarizing the available information. Output\"\n    \" a single option from the given options as the final answer. You are strongly\"\n    \" required to follow the specified output format; conclude your response with the\"\n    ' phrase \"the answer is ([option_id]) [answer_string]\".\\n\\n'\n)\nllm = LLM(\n    model=\"dmis-lab/llama-3-meerkat-70b-v1.0\",\n    dtype=\"bfloat16\",\n    gpu_memory_utilization=0.9,\n    max_model_len=2048,\n    trust_remote_code=True,\n    tensor_parallel_size=2\n)\n\ntokenizer = llm.get_tokenizer()\n\ninputs, labels = [], []\nfor sample in load_dataset(\n    \"GBaker/MedQA-USMLE-4-options\", split=\"test\", trust_remote_code=True\n):\n    options = sorted(sample[\"options\"].items())\n    options = \" \".join(map(lambda x: f\"({x[0]}) {x[1]}\", options))\n    content = tokenizer.apply_chat_template(\n        [{\"role\": \"system\", \"content\": USMLE_INSTRUCTION}, {\"role\": \"user\", \"content\": sample[\"question\"] + \" \" + options}],\n        add_generation_prompt=True,\n        tokenize=False,\n    )\n    inputs.append(content)\n    labels.append(sample[\"answer_idx\"])\n\ngenerated = llm.generate(\n    inputs,\n    SamplingParams(\n        temperature=0.0,\n        stop_token_ids=[tokenizer.vocab[\"<|eot_id|>\"]],\n        max_tokens=1024,\n    ),\n)\ndef extract_answer(text: str, options: str = \"ABCD\") -> str:\n    return (re.findall(rf\"he answer is \\(([{options}])\\)\", text) or [options[0]])[-1]\n\ncorrectness = []\n\nfor g, l in zip(generated, labels):\n    correctness.append(extract_answer(g.outputs[0].text) == l)\n\nprint(sum(correctness) / len(correctness))\n```\n\n\n## Evaluation\n\nWe tested models on seven medical benchmarks: [MedQA](https://arxiv.org/abs/2009.13081), [USMLE sample test](https://www.usmle.org/prepare-your-exam), [Medbullets-4](https://arxiv.org/abs/2402.18060), [Medbullets-5](https://arxiv.org/abs/2402.18060) , [MedMCQA](https://arxiv.org/abs/2203.14371), [MMLU-Medical](https://arxiv.org/abs/2009.03300), and [JAMA Clinical Challenge](https://arxiv.org/abs/2402.18060).\n\n| **Model**                       | **Average** | **MedQA** | **USMLE** | **Medbullets-4** | **Medbullets-5** | **MedMCQA** | **MMLU-Medical** |\n|:--------------------------------|:-----------:|:---------:|:---------:|:----------------:|:----------------:|:-----------:|:----------------:|\n| GPT-4                           | 76.6        | 81.4      | 86.6      | 68.8             | 63.3             | 72.4        | **87.1**         |\n| GPT-3.5                         | 54.8        | 53.6      | 58.5      | 51.0             | 47.4             | 51.0        | 67.3             |\n| MediTron-70B (Ensemble, 5 runs) | -           | 70.2      | -         | -                | -                | 66.0        | 78.0             |\n| MediTron-7B                     | 51.0        | 50.2      | 44.6      | 51.1             | 45.5             | 57.9        | 56.7             |\n| BioMistral-7B                   | 55.4        | 54.3      | 51.4      | 52.3             | 48.7             | 61.1        | 64.6             |\n| Meerkat-7B                      | 62.6        | 70.6      | 70.3      | 58.7             | 52.9             | 60.6        | 70.5             |\n| Meerkat-8B (**New**)            | 67.3        | 74.0      | 74.2      | 62.3             | 55.5             | 62.7        | 75.2             |\n| Meerkat-70B (**New**)           | **77.9**    | **82.6**  | **87.4**  | **71.4**         | **65.3**         | **73.9**    | 86.9             |\n\nPlease note that the scores in MMLU-Medical were calculated based on the average accuracies across six medical-related subjects in the original MMLU benchmark, and each result for a single subject is presented below.\n\n| **Model**                       | **Average** | **Cliniq Knowledge** | **Medical Genetics** | **Anatomy** | **Professional Medicine** | **College Biology** | **College Medicine** |\n|:--------------------------------|:-----------:|:--------------------:|:--------------------:|:-----------:|:-------------------------:|:-------------------:|:--------------------:|\n| GPT-4                           | **87.1**    | 86.4                 | **92.0**             | 80.0        | **93.8**                  | **93.8**            | 76.3                 |\n| GPT-3.5                         | 67.3        | 68.7                 | 68.0                 | 60.7        | 69.9                      | 72.9                | 63.6                 |\n| MediTron-70B (Ensemble, 5 runs) | 78.0        | 75.5                 | 85.9                 | 69.4        | 82.3                      | 86.7                | 68.0                 |\n| MediTron-7B                     | 56.7        | 57.7                 | 63.8                 | 56.9        | 56.0                      | 57.1                | 48.9                 |\n| BioMistral-7B                   | 64.6        | 59.9                 | 64.0                 | 56.5        | 60.4                      | 59.0                | 54.7                 |\n| Meerkat-7B                      | 70.5        | 71.6                 | 74.8                 | 63.2        | 77.3                      | 70.8                | 65.2                 |\n| Meerkat-8B (**New**)            | 75.2        | 74.3                 | 76.7                 | 74.8        | 75.3                      | 76.1                | 74.3                 |\n| Meerkat-70B (**New**)           | 86.9        | **87.2**             | 88.2                 | **84.4**    | 87.2                      | 87.9                | **86.6**             |\n\n\n## Reference\n\nPlease see the information below to cite our paper.\n```bibtex\n@article{kim2024small,\n  title={Small language models learn enhanced reasoning skills from medical textbooks},\n  author={Kim, Hyunjae and Hwang, Hyeon and Lee, Jiwoo and Park, Sihyeon and Kim, Dain and Lee, Taewhoo and Yoon, Chanwoong and Sohn, Jiwoong and Choi, Donghee and Kang, Jaewoo},\n  journal={arXiv preprint arXiv:2404.00376},\n  year={2024}\n}\n```\n\n## Acknowledgement\n\nResearch supported with Cloud TPUs from Google’s TPU Research Cloud (TRC).\n\n## Contact\n\nFeel free to email `hyunjae-kim@korea.ac.kr` if you have any questions.\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "arxiv:2404.00376",
    "arxiv:2009.13081",
    "arxiv:2402.18060",
    "arxiv:2203.14371",
    "arxiv:2009.03300",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 1898,
  "gated": false,
  "private": false,
  "last_modified": "2024-09-08T04:46:54.000Z",
  "created_at": "2024-09-07T07:30:13.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "66dc010521f901b0499adc0c",
  "id": "RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf",
  "modelId": "RichardErkhov/dmis-lab_-_llama-3-meerkat-70b-v1.0-gguf",
  "sha": "6e284f19c16ceeea0cf01b58700d945c6cd5412e",
  "createdAt": "2024-09-07T07:30:13.000Z",
  "lastModified": "2024-09-08T04:46:54.000Z",
  "author": "RichardErkhov",
  "downloads": 1898,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 36
}