maziyarpanahi/llama-3.1-nemotron-70b-instruct-hf-gguf Q3_K_L GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

maziyarpanahi/llama-3.1-nemotron-70b-instruct-hf-gguf overview

MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF

ggufquantized2-bit3-bit4-bit5-bit6-bit8-bitGGUFtext-generationbase_model:mistralai/Mistral-Small-Instruct-2409base_model:quantized:mistralai/Mistral-Small-Instruct-2409region:usimatrixconversational

maziyarpanahi/llama-3.1-nemotron-70b-instruct-hf-gguf visual

Downloads

277

Likes

Pipeline

text-generation

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

28 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Llama-3.1-Nemotron-70B-Instruct-HF.IQ1_M.gguf	GGUF	IQ1_M	15.60 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.IQ1_S.gguf	GGUF	IQ1_S	14.29 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.IQ2_XS.gguf	GGUF	IQ2_XS	19.69 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.IQ3_XS.gguf	GGUF	IQ3_XS	27.29 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.IQ4_XS.gguf	GGUF	IQ4_XS	35.30 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.Q2_K.gguf	GGUF	Q2_K	24.56 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.Q3_K_L.gguf	GGUF	Q3_K_L	34.59 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.Q3_K_M.gguf	GGUF	Q3_K_M	31.91 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.Q3_K_S.gguf	GGUF	Q3_K_S	28.79 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.Q4_K_M.gguf	GGUF	Q4_K_M	39.60 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.Q4_K_S.gguf	GGUF	Q4_K_S	37.58 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.Q5_K_M.gguf	GGUF	Q5_K_M	46.52 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF.Q5_K_S.gguf	GGUF	Q5_K_S	45.32 GB	Download
Mistral-Small-Instruct-2409.IQ1_M.gguf	GGUF	IQ1_M	4.91 GB	Download
Mistral-Small-Instruct-2409.IQ1_S.gguf	GGUF	IQ1_S	4.50 GB	Download
Mistral-Small-Instruct-2409.IQ2_XS.gguf	GGUF	IQ2_XS	6.19 GB	Download
Mistral-Small-Instruct-2409.IQ3_XS.gguf	GGUF	IQ3_XS	8.55 GB	Download
Mistral-Small-Instruct-2409.IQ4_XS.gguf	GGUF	IQ4_XS	11.12 GB	Download
Mistral-Small-Instruct-2409.Q2_K.gguf	GGUF	Q2_K	7.70 GB	Download
Mistral-Small-Instruct-2409.Q3_K_L.gguf	GGUF	Q3_K_L	10.92 GB	Download
Mistral-Small-Instruct-2409.Q3_K_M.gguf	GGUF	Q3_K_M	10.02 GB	Download
Mistral-Small-Instruct-2409.Q3_K_S.gguf	GGUF	Q3_K_S	8.98 GB	Download
Mistral-Small-Instruct-2409.Q4_K_M.gguf	GGUF	Q4_K_M	12.42 GB	Download
Mistral-Small-Instruct-2409.Q4_K_S.gguf	GGUF	Q4_K_S	11.79 GB	Download
Mistral-Small-Instruct-2409.Q5_K_M.gguf	GGUF	Q5_K_M	14.64 GB	Download
Mistral-Small-Instruct-2409.Q5_K_S.gguf	GGUF	Q5_K_S	14.27 GB	Download
Mistral-Small-Instruct-2409.Q6_K.gguf	GGUF	Q6_K	17.00 GB	Download
Mistral-Small-Instruct-2409.Q8_0.gguf	GGUF	—	22.02 GB	Download

Model Details Live

Model Slug

maziyarpanahi/llama-3.1-nemotron-70b-instruct-hf-gguf

Author

MaziyarPanahi

Pipeline Task

text-generation

Library

—

Created

2024-10-16

Last Modified

2024-10-17

Gated

Private

HF SHA

71fb57a34a909ba518283737137b753850784e21

License

Unknown

Language

Unknown

Base Model

mistralai/Mistral-Small-Instruct-2409

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "tags": [
      "quantized",
      "2-bit",
      "3-bit",
      "4-bit",
      "5-bit",
      "6-bit",
      "8-bit",
      "GGUF",
      "text-generation",
      "text-generation"
    ],
    "model_name": "Mistral-Small-Instruct-2409-GGUF",
    "base_model": "mistralai/Mistral-Small-Instruct-2409",
    "inference": false,
    "model_creator": "mistralai",
    "pipeline_tag": "text-generation",
    "quantized_by": "MaziyarPanahi",
    "frontmatter": {
      "tags": [
        "quantized",
        "2-bit",
        "3-bit",
        "4-bit",
        "5-bit",
        "6-bit",
        "8-bit",
        "GGUF",
        "text-generation",
        "text-generation"
      ],
      "model_name": "Mistral-Small-Instruct-2409-GGUF",
      "base_model": "mistralai/Mistral-Small-Instruct-2409",
      "inference": "false",
      "model_creator": "mistralai",
      "pipeline_tag": "text-generation",
      "quantized_by": "MaziyarPanahi"
    },
    "hero_image_url": "",
    "summary": "# MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\ntags:\n- quantized\n- 2-bit\n- 3-bit\n- 4-bit\n- 5-bit\n- 6-bit\n- 8-bit\n- GGUF\n- text-generation\n- text-generation\nmodel_name: Mistral-Small-Instruct-2409-GGUF\nbase_model: mistralai/Mistral-Small-Instruct-2409\ninference: false\nmodel_creator: mistralai\npipeline_tag: text-generation\nquantized_by: MaziyarPanahi\n---\n# [MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF](https://huggingface.co/MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF)\n- Model creator: [mistralai](https://huggingface.co/mistralai)\n- Original model: [mistralai/Mistral-Small-Instruct-2409](https://huggingface.co/mistralai/Mistral-Small-Instruct-2409)\n\n## Description\n[MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF](https://huggingface.co/MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF) contains GGUF format model files for [mistralai/Mistral-Small-Instruct-2409](https://huggingface.co/mistralai/Mistral-Small-Instruct-2409).\n\n### About GGUF\n\nGGUF is a new format introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp.\n\nHere is an incomplete list of clients and libraries that are known to support GGUF:\n\n* [llama.cpp](https://github.com/ggerganov/llama.cpp). The source project for GGUF. Offers a CLI and a server option.\n* [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), a Python library with GPU accel, LangChain support, and OpenAI-compatible API server.\n* [LM Studio](https://lmstudio.ai/), an easy-to-use and powerful local GUI for Windows and macOS (Silicon), with GPU acceleration. Linux available, in beta as of 27/11/2023.\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI, with many features and powerful extensions. Supports GPU acceleration.\n* [KoboldCpp](https://github.com/LostRuins/koboldcpp), a fully featured web UI, with GPU accel across all platforms and GPU architectures. Especially good for story telling.\n* [GPT4All](https://gpt4all.io/index.html), a free and open source local running GUI, supporting Windows, Linux and macOS with full GPU accel.\n* [LoLLMS Web UI](https://github.com/ParisNeo/lollms-webui), a great web UI with many interesting and unique features, including a full model library for easy model selection.\n* [Faraday.dev](https://faraday.dev/), an attractive and easy to use character-based chat GUI for Windows and macOS (both Silicon and Intel), with GPU acceleration.\n* [candle](https://github.com/huggingface/candle), a Rust ML framework with a focus on performance, including GPU support, and ease of use.\n* [ctransformers](https://github.com/marella/ctransformers), a Python library with GPU accel, LangChain support, and OpenAI-compatible AI server. Note, as of time of writing (November 27th 2023), ctransformers has not been updated in a long time and does not support many recent models.\n\n## Special thanks\n\n🙏 Special thanks to [Georgi Gerganov](https://github.com/ggerganov) and the whole team working on [llama.cpp](https://github.com/ggerganov/llama.cpp/) for making all of this possible.",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "quantized",
    "2-bit",
    "3-bit",
    "4-bit",
    "5-bit",
    "6-bit",
    "8-bit",
    "GGUF",
    "text-generation",
    "base_model:mistralai/Mistral-Small-Instruct-2409",
    "base_model:quantized:mistralai/Mistral-Small-Instruct-2409",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 3,
  "downloads": 277,
  "gated": false,
  "private": false,
  "last_modified": "2024-10-17T21:39:39.000Z",
  "created_at": "2024-10-16T17:59:01.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "670ffee57918039dee1605b7",
  "id": "MaziyarPanahi/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF",
  "modelId": "MaziyarPanahi/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF",
  "sha": "71fb57a34a909ba518283737137b753850784e21",
  "createdAt": "2024-10-16T17:59:01.000Z",
  "lastModified": "2024-10-17T21:39:39.000Z",
  "author": "MaziyarPanahi",
  "downloads": 277,
  "likes": 3,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 31
}

maziyarpanahi/llama-3.1-nemotron-70b-instruct-hf-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard