joseph717171/hermes-3-llama-3.2-3b-oq8_0-f32.ef32.iq4_k-q8_0-gguf - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

joseph717171/hermes-3-llama-3.2-3b-oq8_0-f32.ef32.iq4_k-q8_0-gguf overview

Custom GGUF quants of Hermes-3-Llama-3.2-3B, where the Output Tensors are quantized to Q8_0 or upcast to F32, while the Embeddings are kept at F32. Enjoy! 🧠🔥🚀

ggufendpoints_compatibleregion:usimatrixconversational

joseph717171/hermes-3-llama-3.2-3b-oq8_0-f32.ef32.iq4_k-q8_0-gguf visual

Downloads

135

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

6 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Hermes-3-Llama-3.2-3B-OF32.EF32.IQ4_K_M.gguf	GGUF	F32	3.05 GB	Download
Hermes-3-Llama-3.2-3B-OF32.EF32.IQ6_K.gguf	GGUF	F32	3.63 GB	Download
Hermes-3-Llama-3.2-3B-OF32.EF32.IQ8_0.gguf	GGUF	F32	4.26 GB	Download
Hermes-3-Llama-3.2-3B-OQ8_0.EF32.IQ4_K_M.gguf	GGUF	F32	3.05 GB	Download
Hermes-3-Llama-3.2-3B-OQ8_0.EF32.IQ6_K.gguf	GGUF	F32	3.63 GB	Download
Hermes-3-Llama-3.2-3B-OQ8_0.EF32.IQ8_0.gguf	GGUF	F32	4.26 GB	Download

Model Details Live

Model Slug

joseph717171/hermes-3-llama-3.2-3b-oq8_0-f32.ef32.iq4_k-q8_0-gguf

Author

Joseph717171

Pipeline Task

—

Library

—

Created

2024-12-13

Last Modified

2024-12-13

Gated

Private

HF SHA

1988f06c1a99c46242c7e7dd0b128d04c93e27ca

License

Unknown

Language

Unknown

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "",
    "summary": "Custom GGUF quants of Hermes-3-Llama-3.2-3B, where the Output Tensors are quantized to Q8_0 or upcast to F32, while the Embeddings are kept at F32. Enjoy! 🧠🔥🚀",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Custom GGUF quants of Hermes-3-Llama-3.2-3B, where the Output Tensors are quantized to Q8_0 or upcast to F32, while the Embeddings are kept at F32. Enjoy! 🧠🔥🚀  ",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 1,
  "downloads": 135,
  "gated": false,
  "private": false,
  "last_modified": "2024-12-13T23:42:38.000Z",
  "created_at": "2024-12-13T00:38:13.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "675b81f576d0e555d616477a",
  "id": "Joseph717171/Hermes-3-Llama-3.2-3B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF",
  "modelId": "Joseph717171/Hermes-3-Llama-3.2-3B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF",
  "sha": "1988f06c1a99c46242c7e7dd0b128d04c93e27ca",
  "createdAt": "2024-12-13T00:38:13.000Z",
  "lastModified": "2024-12-13T23:42:38.000Z",
  "author": "Joseph717171",
  "downloads": 135,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 8
}

joseph717171/hermes-3-llama-3.2-3b-oq8_0-f32.ef32.iq4_k-q8_0-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard