gaianet/llama-3.1-nemotron-70b-instruct-hf-gguf Q5_0 GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

gaianet/llama-3.1-nemotron-70b-instruct-hf-gguf overview

Comprehensive model page for gaianet/llama-3.1-nemotron-70b-instruct-hf-gguf

transformersggufllamatext-generationnvidiallama3.1base_model:nvidia/Llama-3.1-Nemotron-70B-Instruct-HFbase_model:quantized:nvidia/Llama-3.1-Nemotron-70B-Instruct-HFlicense:llama3.1region:usconversational

gaianet/llama-3.1-nemotron-70b-instruct-hf-gguf visual

Downloads

112

Likes

Pipeline

text-generation

Library

transformers

Visibility

Public

Access

Open

Repository Files & Downloads

20 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Llama-3.1-Nemotron-70B-Instruct-HF-Q2_K.gguf	GGUF	Q2_K	24.56 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q3_K_L.gguf	GGUF	Q3_K_L	34.59 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q3_K_M.gguf	GGUF	Q3_K_M	31.91 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q3_K_S.gguf	GGUF	Q3_K_S	28.79 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q4_0.gguf	GGUF	—	37.22 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q4_K_M.gguf	GGUF	Q4_K_M	39.60 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q4_K_S.gguf	GGUF	Q4_K_S	37.58 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q5_0.gguf	GGUF	—	45.32 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q5_K_M.gguf	GGUF	Q5_K_M	46.52 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q5_K_S.gguf	GGUF	Q5_K_S	45.32 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q6_K-00001-of-00002.gguf	GGUF	Q6_K	27.79 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q6_K-00002-of-00002.gguf	GGUF	Q6_K	26.12 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q8_0-00001-of-00003.gguf	GGUF	—	27.76 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q8_0-00002-of-00003.gguf	GGUF	—	27.71 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-Q8_0-00003-of-00003.gguf	GGUF	—	14.36 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-f16-00001-of-00005.gguf	GGUF	F16	27.90 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-f16-00002-of-00005.gguf	GGUF	F16	27.53 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-f16-00003-of-00005.gguf	GGUF	F16	27.81 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-f16-00004-of-00005.gguf	GGUF	F16	27.53 GB	Download
Llama-3.1-Nemotron-70B-Instruct-HF-f16-00005-of-00005.gguf	GGUF	F16	20.65 GB	Download

Model Details Live

Model Slug

gaianet/llama-3.1-nemotron-70b-instruct-hf-gguf

Author

gaianet

Pipeline Task

text-generation

Library

transformers

Created

2024-10-17

Last Modified

2024-10-18

Gated

Private

HF SHA

f45d1e6d22227fff72eceb58bd8735a18759962a

License

llama3.1

Language

Unknown

Base Model

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "llama3.1",
    "model_name": "Llama-3.1-Nemotron-70B-Instruct-HF",
    "base_model": "nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",
    "inference": false,
    "pipeline_tag": "text-generation",
    "library_name": "transformers",
    "model_creator": "nvidia",
    "quantized_by": "Second State Inc.",
    "tags": [
      "nvidia",
      "llama3.1"
    ],
    "frontmatter": {
      "license": "llama3.1",
      "model_name": "Llama-3.1-Nemotron-70B-Instruct-HF",
      "base_model": "nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",
      "inference": "false",
      "pipeline_tag": "text-generation",
      "library_name": "transformers",
      "model_creator": "nvidia",
      "quantized_by": "Second State Inc.",
      "tags": [
        "nvidia",
        "llama3.1"
      ]
    },
    "hero_image_url": "https://github.com/GaiaNet-AI/.github/assets/45785633/d6976adc-f97d-4f86-a648-0f2f5c8e7eee",
    "summary": "",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: llama3.1\nmodel_name: Llama-3.1-Nemotron-70B-Instruct-HF\nbase_model: nvidia/Llama-3.1-Nemotron-70B-Instruct-HF\ninference: false\npipeline_tag: text-generation\nlibrary_name: transformers\nmodel_creator: nvidia\nquantized_by: Second State Inc.\ntags:\n- nvidia\n- llama3.1\n---\n\n![](https://github.com/GaiaNet-AI/.github/assets/45785633/d6976adc-f97d-4f86-a648-0f2f5c8e7eee)\n\n# Llama-3.1-Nemotron-70B-Instruct-HF-GGUF\n\n## Original Model\n\n[nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF)\n\n## Run with Gaianet\n\n**Prompt template:**\n\nprompt template: `llama-3-chat`\n\n**Context size:**\n\nchat_ctx_size: `128000`\n\n**Run with GaiaNet:**\n\n- Quick start: https://docs.gaianet.ai/node-guide/quick-start\n\n- Customize your node: https://docs.gaianet.ai/node-guide/customize\n\n*Quantized with llama.cpp b3932*\n",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "llama",
    "text-generation",
    "nvidia",
    "llama3.1",
    "base_model:nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",
    "base_model:quantized:nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",
    "license:llama3.1",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 112,
  "gated": false,
  "private": false,
  "last_modified": "2024-10-18T04:38:33.000Z",
  "created_at": "2024-10-17T05:13:10.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "67109ce6f1279fe0dfd965b7",
  "id": "gaianet/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF",
  "modelId": "gaianet/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF",
  "sha": "f45d1e6d22227fff72eceb58bd8735a18759962a",
  "createdAt": "2024-10-17T05:13:10.000Z",
  "lastModified": "2024-10-18T04:38:33.000Z",
  "author": "gaianet",
  "downloads": 112,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 23
}

gaianet/llama-3.1-nemotron-70b-instruct-hf-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard