brittlewis12/llama-3.1-tulu-3.1-8b-gguf Q6_K GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

brittlewis12/llama-3.1-tulu-3.1-8b-gguf overview

Original model: Tülu 3.1 8B Model creator: allenai Version 3.1 update: The new version of our Tülu model is from an improvement only in the final RL stage of training. We switched from PPO to GRPO (no reward model) and did further hyperparameter tuning to achieve substantial performance improvements across the board over the original Tülu 3 8B model, as shown in the comparison below: Tülu 3 is a leading instruction following model family, offering a post-training package with fully open-source data, code, and recipes designed to serve as a comprehensive guide for modern techniques. This is one step of a bigger process to training fully open-source models, like our OLMo models. Tülu 3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval. This repo contains GGUF format model files for the Allen Institute for AI’s Tülu 3.1 8B. ### What is GGUF? GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp. Converted with llama.cpp build 4699 (revision 31afcbe), using autogguf-rs. ### Prompt template ---

gguftext-generationendataset:allenai/RLVR-GSM-MATH-IF-Mixed-Constraintsbase_model:allenai/Llama-3.1-Tulu-3.1-8Bbase_model:quantized:allenai/Llama-3.1-Tulu-3.1-8Blicense:llama3.1endpoints_compatibleregion:usconversational

brittlewis12/llama-3.1-tulu-3.1-8b-gguf visual

Downloads

150

Likes

Pipeline

text-generation

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

28 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
llama-3.1-tulu-3.1-8b.IQ1_M.gguf	GGUF	IQ1_M	2.01 GB	Download
llama-3.1-tulu-3.1-8b.IQ1_S.gguf	GGUF	IQ1_S	1.88 GB	Download
llama-3.1-tulu-3.1-8b.IQ2_M.gguf	GGUF	IQ2_M	2.75 GB	Download
llama-3.1-tulu-3.1-8b.IQ2_S.gguf	GGUF	IQ2_S	2.57 GB	Download
llama-3.1-tulu-3.1-8b.IQ2_XS.gguf	GGUF	IQ2_XS	2.43 GB	Download
llama-3.1-tulu-3.1-8b.IQ2_XXS.gguf	GGUF	IQ2_XXS	2.23 GB	Download
llama-3.1-tulu-3.1-8b.IQ3_M.gguf	GGUF	IQ3_M	3.52 GB	Download
llama-3.1-tulu-3.1-8b.IQ3_S.gguf	GGUF	IQ3_S	3.43 GB	Download
llama-3.1-tulu-3.1-8b.IQ3_XS.gguf	GGUF	IQ3_XS	3.28 GB	Download
llama-3.1-tulu-3.1-8b.IQ3_XXS.gguf	GGUF	IQ3_XXS	3.05 GB	Download
llama-3.1-tulu-3.1-8b.IQ4_NL.gguf	GGUF	IQ4_NL	4.36 GB	Download
llama-3.1-tulu-3.1-8b.IQ4_XS.gguf	GGUF	IQ4_XS	4.14 GB	Download
llama-3.1-tulu-3.1-8b.Q2_K.gguf	GGUF	Q2_K	2.96 GB	Download
llama-3.1-tulu-3.1-8b.Q2_K_S.gguf	GGUF	Q2_K_S	2.78 GB	Download
llama-3.1-tulu-3.1-8b.Q3_K_L.gguf	GGUF	Q3_K_L	4.03 GB	Download
llama-3.1-tulu-3.1-8b.Q3_K_M.gguf	GGUF	Q3_K_M	3.74 GB	Download
llama-3.1-tulu-3.1-8b.Q3_K_S.gguf	GGUF	Q3_K_S	3.41 GB	Download
llama-3.1-tulu-3.1-8b.Q4_0.gguf	GGUF	—	4.34 GB	Download
llama-3.1-tulu-3.1-8b.Q4_1.gguf	GGUF	—	4.78 GB	Download
llama-3.1-tulu-3.1-8b.Q4_K_M.gguf	GGUF	Q4_K_M	4.58 GB	Download
llama-3.1-tulu-3.1-8b.Q4_K_S.gguf	GGUF	Q4_K_S	4.37 GB	Download
llama-3.1-tulu-3.1-8b.Q5_0.gguf	GGUF	—	5.21 GB	Download
llama-3.1-tulu-3.1-8b.Q5_1.gguf	GGUF	—	5.65 GB	Download
llama-3.1-tulu-3.1-8b.Q5_K_M.gguf	GGUF	Q5_K_M	5.34 GB	Download
llama-3.1-tulu-3.1-8b.Q5_K_S.gguf	GGUF	Q5_K_S	5.21 GB	Download
llama-3.1-tulu-3.1-8b.Q6_K.gguf	GGUF	Q6_K	6.14 GB	Download
llama-3.1-tulu-3.1-8b.Q8_0.gguf	GGUF	—	7.95 GB	Download
llama-3.1-tulu-3.1-8b.f16.gguf	GGUF	F16	14.97 GB	Download

Model Details Live

Model Slug

brittlewis12/llama-3.1-tulu-3.1-8b-gguf

Author

brittlewis12

Pipeline Task

text-generation

Library

—

Created

2025-02-12

Last Modified

2025-02-15

Gated

Private

HF SHA

79835e9a7a46b3c896e8dcf2f29da171fa2eb522

License

llama3.1

Language

Base Model

allenai/Llama-3.1-Tulu-3.1-8B

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "base_model": "allenai/Llama-3.1-Tulu-3.1-8B",
    "pipeline_tag": "text-generation",
    "inference": true,
    "language": [
      "en"
    ],
    "license": "llama3.1",
    "model_creator": "allenai",
    "model_name": "Llama-3.1-Tulu-3.1-8B",
    "model_type": "llama",
    "datasets": [
      "allenai/RLVR-GSM-MATH-IF-Mixed-Constraints"
    ],
    "quantized_by": "brittlewis12",
    "frontmatter": {
      "base_model": "allenai/Llama-3.1-Tulu-3.1-8B",
      "pipeline_tag": "text-generation",
      "inference": "true",
      "language": [
        "en"
      ],
      "license": "llama3.1",
      "model_creator": "allenai",
      "model_name": "Llama-3.1-Tulu-3.1-8B",
      "model_type": "llama",
      "datasets": [
        "allenai/RLVR-GSM-MATH-IF-Mixed-Constraints"
      ],
      "quantized_by": "brittlewis12"
    },
    "hero_image_url": "https://pbs.twimg.com/profile_images/1744049151241797632/0mIP-P9e_400x400.jpg",
    "summary": "**Original model**: Tülu 3.1 8B **Model creator**: allenai > **Version 3.1 update**: The new version of our Tülu model is from an improvement only in the final RL stage of training. We switched from PPO to GRPO (no reward model) and did further hyperparameter tuning to achieve substantial performance improvements across the board over the original Tülu 3 8B model, as shown in the comparison below: > Tülu 3 is a leading instruction following model family, offering a post-training package with fully open-source data, code, and recipes designed to serve as a comprehensive guide for modern techniques. This is one step of a bigger process to training fully open-source models, like our OLMo models. Tülu 3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval. This repo contains GGUF format model files for the Allen Institute for AI’s Tülu 3.1 8B. ### What is GGUF? GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp. Converted with llama.cpp build 4699 (revision 31afcbe), using autogguf-rs. ### Prompt template ``  {{system_message}}  {{prompt}}  {{assistant_message}} `` ---",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: allenai/Llama-3.1-Tulu-3.1-8B\npipeline_tag: text-generation\ninference: true\nlanguage:\n- en\nlicense: llama3.1\nmodel_creator: allenai\nmodel_name: Llama-3.1-Tulu-3.1-8B\nmodel_type: llama\ndatasets:\n  - allenai/RLVR-GSM-MATH-IF-Mixed-Constraints\nquantized_by: brittlewis12\n\n---\n\n# Tülu 3.1 8B GGUF\n\n**Original model**: [Tülu 3.1 8B](https://huggingface.co/allenai/Llama-3.1-Tulu-3.1-8B)\n\n**Model creator**: [allenai](https://huggingface.co/allenai)\n \n> **Version 3.1 update**: The new version of our Tülu model is from an improvement only in the final RL stage of training. We switched from PPO to GRPO (no reward model) and did further hyperparameter tuning to achieve substantial performance improvements across the board over the original Tülu 3 8B model, as shown in the comparison below:\n\n> Tülu 3 is a leading instruction following model family, offering a post-training package with fully open-source data, code, and recipes designed to serve as a comprehensive guide for modern techniques. This is one step of a bigger process to training fully open-source models, like our OLMo models. Tülu 3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval.\n\n\nThis repo contains GGUF format model files for the Allen Institute for AI’s Tülu 3.1 8B.\n\n### What is GGUF?\n\nGGUF is a file format for representing AI models. It is the third version of the format, \nintroduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp. \nConverted with llama.cpp build 4699 (revision [31afcbe](https://github.com/ggerganov/llama.cpp/commits/31afcbee0eebbc998ab311aa314e389d60aa2127)),\nusing [autogguf-rs](https://github.com/brittlewis12/autogguf-rs).\n\n### Prompt template\n\n```\n<|system|>\n{{system_message}}\n<|user|>\n{{prompt}}\n<|assistant|>\n{{assistant_message}}<|end_of_text|>\n\n```\n\n---\n\n## Download & run with [cnvrs](https://twitter.com/cnvrsai) on iPhone, iPad, and Mac!\n\n![cnvrs.ai](https://pbs.twimg.com/profile_images/1744049151241797632/0mIP-P9e_400x400.jpg)\n\n[cnvrs](https://testflight.apple.com/join/sFWReS7K) is the best app for private, local AI on your device:\n- create & save **Characters** with custom system prompts & temperature settings\n- download and experiment with any **GGUF model** you can [find on HuggingFace](https://huggingface.co/models?library=gguf)!\n    * or, use an API key with the chat completions-compatible model provider of your choice -- ChatGPT, Claude, Gemini, DeepSeek, & more!\n- make it your own with custom **Theme colors**\n- powered by Metal ⚡️ & [Llama.cpp](https://github.com/ggerganov/llama.cpp), with **haptics** during response streaming!\n- **try it out** yourself today, on [Testflight](https://testflight.apple.com/join/sFWReS7K)!\n- follow [cnvrs on twitter](https://twitter.com/cnvrsai) to stay up to date\n\n---\n\n## Original Model Evaluation\n\n| Benchmark (eval)                | Tülu 3 SFT 8B | Tülu 3 DPO 8B | Tülu 3 8B | **Tülu 3.1 8B (NEW)** | Llama 3.1 8B Instruct | Qwen 2.5 7B Instruct | Magpie 8B | Gemma 2 9B Instruct | Ministral 8B Instruct |\n|---------------------------------|--------------|--------------|-----------|------------|------------------------|----------------------|-----------|---------------------|-----------------------|\n| **Avg.**                        | 60.4         | 64.4         | 64.8  | 66.3       | 62.2                  | **66.5**            | 44.7      | 55.2               | 58.3                 |\n| **MMLU (0 shot, CoT)**          | 65.9         | 68.7         | 68.2      | 69.5       | 71.2                  | **76.6**            | 62.0      | 74.6               | 68.5                 |\n| **PopQA (15 shot)**             | **29.3**     | 29.3         | 29.1      | 30.2       | 20.2                  | 18.1                | 22.5      | 28.3               | 20.2                 |\n| **TruthfulQA (6 shot)**         | 46.8         | 56.1         | 55.0      | 59.9       | 55.1                  | **63.1**            | 57.0      | 61.4               | 55.5                 |\n| **BigBenchHard (3 shot, CoT)**  | **67.9**     | 65.8         | 66.0      | 68.9       | 62.8                  | 70.2                | 0.9       | 2.5                | 56.2                 |\n| **DROP (3 shot)**               | 61.3         | 62.5         | 62.6  | **63.9**   | 61.5                  | 54.4                | 49.4      | 58.8               | 56.2                 |\n| **MATH (4 shot CoT, Flex)**     | 31.5         | 42.0         |43.7  | 47.8       | 42.5                  | **69.9**            | 5.1       | 29.8               | 40.0                 |\n| **GSM8K (8 shot, CoT)**         | 76.2         | 84.3         | 87.6  | **90.0**   | 83.4                  | 83.8                | 61.2      | 79.7               | 80.0                 |\n| **HumanEval (pass@10)**         | 86.2         | 83.9         | 83.9      | 84.8       | 86.3                  | **93.1**            | 75.4      | 71.7               | 91.0                 |\n| **HumanEval+ (pass@10)**        | 81.4         | 78.6         | 79.2      | 80.4       | 82.9                  | **89.7**            | 69.1      | 67.0               | 88.5                 |\n| **IFEval (prompt loose)**       | 72.8         | 81.1         | 82.4  | **83.9**   | 80.6                  | 74.7                | 38.8      | 69.9               | 56.4                 |\n| **AlpacaEval 2 (LC % win)**     | 12.4         | 33.5         | 34.5      | 34.9       | 24.2                  | 29.0                | **49.0**  | 43.7               | 31.4                 |\n| **Safety (6 task avg.)**        | **93.1**     | 87.2         | 85.5      | 81.2       | 75.2                  | 75.0                | 46.4      | 75.5               | 56.2                 |\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "text-generation",
    "en",
    "dataset:allenai/RLVR-GSM-MATH-IF-Mixed-Constraints",
    "base_model:allenai/Llama-3.1-Tulu-3.1-8B",
    "base_model:quantized:allenai/Llama-3.1-Tulu-3.1-8B",
    "license:llama3.1",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 3,
  "downloads": 150,
  "gated": false,
  "private": false,
  "last_modified": "2025-02-15T05:30:24.000Z",
  "created_at": "2025-02-12T22:58:47.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "67ad27a7e9e63551553e5d68",
  "id": "brittlewis12/Llama-3.1-Tulu-3.1-8B-GGUF",
  "modelId": "brittlewis12/Llama-3.1-Tulu-3.1-8B-GGUF",
  "sha": "79835e9a7a46b3c896e8dcf2f29da171fa2eb522",
  "createdAt": "2025-02-12T22:58:47.000Z",
  "lastModified": "2025-02-15T05:30:24.000Z",
  "author": "brittlewis12",
  "downloads": 150,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 31
}

brittlewis12/llama-3.1-tulu-3.1-8b-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard