GraySoft
Projects Models About FAQ Contact Download guIDE →

brittlewis12/llama-3.1-tulu-3.1-8b-gguf Q6_K GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

brittlewis12/llama-3.1-tulu-3.1-8b-gguf overview

Original model: Tülu 3.1 8B Model creator: allenai Version 3.1 update: The new version of our Tülu model is from an improvement only in the final RL stage of training. We switched from PPO to GRPO (no reward model) and did further hyperparameter tuning to achieve substantial performance improvements across the board over the original Tülu 3 8B model, as shown in the comparison below: Tülu 3 is a leading instruction following model family, offering a post-training package with fully open-source data, code, and recipes designed to serve as a comprehensive guide for modern techniques. This is one step of a bigger process to training fully open-source models, like our OLMo models. Tülu 3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval. This repo contains GGUF format model files for the Allen Institute for AI’s Tülu 3.1 8B. ### What is GGUF? GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp. Converted with llama.cpp build 4699 (revision 31afcbe), using autogguf-rs. ### Prompt template ---

gguftext-generationendataset:allenai/RLVR-GSM-MATH-IF-Mixed-Constraintsbase_model:allenai/Llama-3.1-Tulu-3.1-8Bbase_model:quantized:allenai/Llama-3.1-Tulu-3.1-8Blicense:llama3.1endpoints_compatibleregion:usconversational
brittlewis12/llama-3.1-tulu-3.1-8b-gguf visual
Downloads
150
Likes
3
Pipeline
text-generation
Library
Visibility
Public
Access
Open

Repository Files & Downloads

28 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
llama-3.1-tulu-3.1-8b.IQ1_M.gguf GGUF IQ1_M 2.01 GB Download
llama-3.1-tulu-3.1-8b.IQ1_S.gguf GGUF IQ1_S 1.88 GB Download
llama-3.1-tulu-3.1-8b.IQ2_M.gguf GGUF IQ2_M 2.75 GB Download
llama-3.1-tulu-3.1-8b.IQ2_S.gguf GGUF IQ2_S 2.57 GB Download
llama-3.1-tulu-3.1-8b.IQ2_XS.gguf GGUF IQ2_XS 2.43 GB Download
llama-3.1-tulu-3.1-8b.IQ2_XXS.gguf GGUF IQ2_XXS 2.23 GB Download
llama-3.1-tulu-3.1-8b.IQ3_M.gguf GGUF IQ3_M 3.52 GB Download
llama-3.1-tulu-3.1-8b.IQ3_S.gguf GGUF IQ3_S 3.43 GB Download
llama-3.1-tulu-3.1-8b.IQ3_XS.gguf GGUF IQ3_XS 3.28 GB Download
llama-3.1-tulu-3.1-8b.IQ3_XXS.gguf GGUF IQ3_XXS 3.05 GB Download
llama-3.1-tulu-3.1-8b.IQ4_NL.gguf GGUF IQ4_NL 4.36 GB Download
llama-3.1-tulu-3.1-8b.IQ4_XS.gguf GGUF IQ4_XS 4.14 GB Download
llama-3.1-tulu-3.1-8b.Q2_K.gguf GGUF Q2_K 2.96 GB Download
llama-3.1-tulu-3.1-8b.Q2_K_S.gguf GGUF Q2_K_S 2.78 GB Download
llama-3.1-tulu-3.1-8b.Q3_K_L.gguf GGUF Q3_K_L 4.03 GB Download
llama-3.1-tulu-3.1-8b.Q3_K_M.gguf GGUF Q3_K_M 3.74 GB Download
llama-3.1-tulu-3.1-8b.Q3_K_S.gguf GGUF Q3_K_S 3.41 GB Download
llama-3.1-tulu-3.1-8b.Q4_0.gguf GGUF 4.34 GB Download
llama-3.1-tulu-3.1-8b.Q4_1.gguf GGUF 4.78 GB Download
llama-3.1-tulu-3.1-8b.Q4_K_M.gguf GGUF Q4_K_M 4.58 GB Download
llama-3.1-tulu-3.1-8b.Q4_K_S.gguf GGUF Q4_K_S 4.37 GB Download
llama-3.1-tulu-3.1-8b.Q5_0.gguf GGUF 5.21 GB Download
llama-3.1-tulu-3.1-8b.Q5_1.gguf GGUF 5.65 GB Download
llama-3.1-tulu-3.1-8b.Q5_K_M.gguf GGUF Q5_K_M 5.34 GB Download
llama-3.1-tulu-3.1-8b.Q5_K_S.gguf GGUF Q5_K_S 5.21 GB Download
llama-3.1-tulu-3.1-8b.Q6_K.gguf GGUF Q6_K 6.14 GB Download
llama-3.1-tulu-3.1-8b.Q8_0.gguf GGUF 7.95 GB Download
llama-3.1-tulu-3.1-8b.f16.gguf GGUF F16 14.97 GB Download

Model Details Live

Model Slug
brittlewis12/llama-3.1-tulu-3.1-8b-gguf
Author
brittlewis12
Pipeline Task
text-generation
Library
Created
2025-02-12
Last Modified
2025-02-15
Gated
No
Private
No
HF SHA
79835e9a7a46b3c896e8dcf2f29da171fa2eb522
License
llama3.1
Language
en
Base Model
allenai/Llama-3.1-Tulu-3.1-8B

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "base_model": "allenai/Llama-3.1-Tulu-3.1-8B",
    "pipeline_tag": "text-generation",
    "inference": true,
    "language": [
      "en"
    ],
    "license": "llama3.1",
    "model_creator": "allenai",
    "model_name": "Llama-3.1-Tulu-3.1-8B",
    "model_type": "llama",
    "datasets": [
      "allenai/RLVR-GSM-MATH-IF-Mixed-Constraints"
    ],
    "quantized_by": "brittlewis12",
    "frontmatter": {
      "base_model": "allenai/Llama-3.1-Tulu-3.1-8B",
      "pipeline_tag": "text-generation",
      "inference": "true",
      "language": [
        "en"
      ],
      "license": "llama3.1",
      "model_creator": "allenai",
      "model_name": "Llama-3.1-Tulu-3.1-8B",
      "model_type": "llama",
      "datasets": [
        "allenai/RLVR-GSM-MATH-IF-Mixed-Constraints"
      ],
      "quantized_by": "brittlewis12"
    },
    "hero_image_url": "https://pbs.twimg.com/profile_images/1744049151241797632/0mIP-P9e_400x400.jpg",
    "summary": "**Original model**: Tülu 3.1 8B **Model creator**: allenai > **Version 3.1 update**: The new version of our Tülu model is from an improvement only in the final RL stage of training. We switched from PPO to GRPO (no reward model) and did further hyperparameter tuning to achieve substantial performance improvements across the board over the original Tülu 3 8B model, as shown in the comparison below: > Tülu 3 is a leading instruction following model family, offering a post-training package with fully open-source data, code, and recipes designed to serve as a comprehensive guide for modern techniques. This is one step of a bigger process to training fully open-source models, like our OLMo models. Tülu 3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval. This repo contains GGUF format model files for the Allen Institute for AI’s Tülu 3.1 8B. ### What is GGUF? GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp. Converted with llama.cpp build 4699 (revision 31afcbe), using autogguf-rs. ### Prompt template ``  {{system_message}}  {{prompt}}  {{assistant_message}} `` ---",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nbase_model: allenai/Llama-3.1-Tulu-3.1-8B\npipeline_tag: text-generation\ninference: true\nlanguage:\n- en\nlicense: llama3.1\nmodel_creator: allenai\nmodel_name: Llama-3.1-Tulu-3.1-8B\nmodel_type: llama\ndatasets:\n  - allenai/RLVR-GSM-MATH-IF-Mixed-Constraints\nquantized_by: brittlewis12\n\n---\n\n# Tülu 3.1 8B GGUF\n\n**Original model**: [Tülu 3.1 8B](https://huggingface.co/allenai/Llama-3.1-Tulu-3.1-8B)\n\n**Model creator**: [allenai](https://huggingface.co/allenai)\n \n> **Version 3.1 update**: The new version of our Tülu model is from an improvement only in the final RL stage of training. We switched from PPO to GRPO (no reward model) and did further hyperparameter tuning to achieve substantial performance improvements across the board over the original Tülu 3 8B model, as shown in the comparison below:\n\n> Tülu 3 is a leading instruction following model family, offering a post-training package with fully open-source data, code, and recipes designed to serve as a comprehensive guide for modern techniques. This is one step of a bigger process to training fully open-source models, like our OLMo models. Tülu 3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval.\n\n\nThis repo contains GGUF format model files for the Allen Institute for AI’s Tülu 3.1 8B.\n\n### What is GGUF?\n\nGGUF is a file format for representing AI models. It is the third version of the format, \nintroduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp. \nConverted with llama.cpp build 4699 (revision [31afcbe](https://github.com/ggerganov/llama.cpp/commits/31afcbee0eebbc998ab311aa314e389d60aa2127)),\nusing [autogguf-rs](https://github.com/brittlewis12/autogguf-rs).\n\n### Prompt template\n\n```\n<|system|>\n{{system_message}}\n<|user|>\n{{prompt}}\n<|assistant|>\n{{assistant_message}}<|end_of_text|>\n\n```\n\n---\n\n## Download & run with [cnvrs](https://twitter.com/cnvrsai) on iPhone, iPad, and Mac!\n\n![cnvrs.ai](https://pbs.twimg.com/profile_images/1744049151241797632/0mIP-P9e_400x400.jpg)\n\n[cnvrs](https://testflight.apple.com/join/sFWReS7K) is the best app for private, local AI on your device:\n- create & save **Characters** with custom system prompts & temperature settings\n- download and experiment with any **GGUF model** you can [find on HuggingFace](https://huggingface.co/models?library=gguf)!\n    * or, use an API key with the chat completions-compatible model provider of your choice -- ChatGPT, Claude, Gemini, DeepSeek, & more!\n- make it your own with custom **Theme colors**\n- powered by Metal ⚡️ & [Llama.cpp](https://github.com/ggerganov/llama.cpp), with **haptics** during response streaming!\n- **try it out** yourself today, on [Testflight](https://testflight.apple.com/join/sFWReS7K)!\n- follow [cnvrs on twitter](https://twitter.com/cnvrsai) to stay up to date\n\n---\n\n## Original Model Evaluation\n\n| Benchmark (eval)                | Tülu 3 SFT 8B | Tülu 3 DPO 8B | Tülu 3 8B | **Tülu 3.1 8B (NEW)** | Llama 3.1 8B Instruct | Qwen 2.5 7B Instruct | Magpie 8B | Gemma 2 9B Instruct | Ministral 8B Instruct |\n|---------------------------------|--------------|--------------|-----------|------------|------------------------|----------------------|-----------|---------------------|-----------------------|\n| **Avg.**                        | 60.4         | 64.4         | 64.8  | 66.3       | 62.2                  | **66.5**            | 44.7      | 55.2               | 58.3                 |\n| **MMLU (0 shot, CoT)**          | 65.9         | 68.7         | 68.2      | 69.5       | 71.2                  | **76.6**            | 62.0      | 74.6               | 68.5                 |\n| **PopQA (15 shot)**             | **29.3**     | 29.3         | 29.1      | 30.2       | 20.2                  | 18.1                | 22.5      | 28.3               | 20.2                 |\n| **TruthfulQA (6 shot)**         | 46.8         | 56.1         | 55.0      | 59.9       | 55.1                  | **63.1**            | 57.0      | 61.4               | 55.5                 |\n| **BigBenchHard (3 shot, CoT)**  | **67.9**     | 65.8         | 66.0      | 68.9       | 62.8                  | 70.2                | 0.9       | 2.5                | 56.2                 |\n| **DROP (3 shot)**               | 61.3         | 62.5         | 62.6  | **63.9**   | 61.5                  | 54.4                | 49.4      | 58.8               | 56.2                 |\n| **MATH (4 shot CoT, Flex)**     | 31.5         | 42.0         |43.7  | 47.8       | 42.5                  | **69.9**            | 5.1       | 29.8               | 40.0                 |\n| **GSM8K (8 shot, CoT)**         | 76.2         | 84.3         | 87.6  | **90.0**   | 83.4                  | 83.8                | 61.2      | 79.7               | 80.0                 |\n| **HumanEval (pass@10)**         | 86.2         | 83.9         | 83.9      | 84.8       | 86.3                  | **93.1**            | 75.4      | 71.7               | 91.0                 |\n| **HumanEval+ (pass@10)**        | 81.4         | 78.6         | 79.2      | 80.4       | 82.9                  | **89.7**            | 69.1      | 67.0               | 88.5                 |\n| **IFEval (prompt loose)**       | 72.8         | 81.1         | 82.4  | **83.9**   | 80.6                  | 74.7                | 38.8      | 69.9               | 56.4                 |\n| **AlpacaEval 2 (LC % win)**     | 12.4         | 33.5         | 34.5      | 34.9       | 24.2                  | 29.0                | **49.0**  | 43.7               | 31.4                 |\n| **Safety (6 task avg.)**        | **93.1**     | 87.2         | 85.5      | 81.2       | 75.2                  | 75.0                | 46.4      | 75.5               | 56.2                 |\n\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "text-generation",
    "en",
    "dataset:allenai/RLVR-GSM-MATH-IF-Mixed-Constraints",
    "base_model:allenai/Llama-3.1-Tulu-3.1-8B",
    "base_model:quantized:allenai/Llama-3.1-Tulu-3.1-8B",
    "license:llama3.1",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 3,
  "downloads": 150,
  "gated": false,
  "private": false,
  "last_modified": "2025-02-15T05:30:24.000Z",
  "created_at": "2025-02-12T22:58:47.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "67ad27a7e9e63551553e5d68",
  "id": "brittlewis12/Llama-3.1-Tulu-3.1-8B-GGUF",
  "modelId": "brittlewis12/Llama-3.1-Tulu-3.1-8B-GGUF",
  "sha": "79835e9a7a46b3c896e8dcf2f29da171fa2eb522",
  "createdAt": "2025-02-12T22:58:47.000Z",
  "lastModified": "2025-02-15T05:30:24.000Z",
  "author": "brittlewis12",
  "downloads": 150,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 31
}