GraySoft
Projects Models About FAQ Contact Download guIDE โ†’

joseph717171/hermes-3-llama-3.2-3b-oq8_0-f32.ef32.iq4_k-q8_0-gguf - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

joseph717171/hermes-3-llama-3.2-3b-oq8_0-f32.ef32.iq4_k-q8_0-gguf overview

Custom GGUF quants of Hermes-3-Llama-3.2-3B, where the Output Tensors are quantized to Q8_0 or upcast to F32, while the Embeddings are kept at F32. Enjoy! ๐Ÿง ๐Ÿ”ฅ๐Ÿš€

ggufendpoints_compatibleregion:usimatrixconversational
joseph717171/hermes-3-llama-3.2-3b-oq8_0-f32.ef32.iq4_k-q8_0-gguf visual
Downloads
135
Likes
1
Pipeline
โ€”
Library
โ€”
Visibility
Public
Access
Open

Repository Files & Downloads

6 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
Hermes-3-Llama-3.2-3B-OF32.EF32.IQ4_K_M.gguf GGUF F32 3.05 GB Download
Hermes-3-Llama-3.2-3B-OF32.EF32.IQ6_K.gguf GGUF F32 3.63 GB Download
Hermes-3-Llama-3.2-3B-OF32.EF32.IQ8_0.gguf GGUF F32 4.26 GB Download
Hermes-3-Llama-3.2-3B-OQ8_0.EF32.IQ4_K_M.gguf GGUF F32 3.05 GB Download
Hermes-3-Llama-3.2-3B-OQ8_0.EF32.IQ6_K.gguf GGUF F32 3.63 GB Download
Hermes-3-Llama-3.2-3B-OQ8_0.EF32.IQ8_0.gguf GGUF F32 4.26 GB Download

Model Details Live

Model Slug
joseph717171/hermes-3-llama-3.2-3b-oq8_0-f32.ef32.iq4_k-q8_0-gguf
Author
Joseph717171
Pipeline Task
โ€”
Library
โ€”
Created
2024-12-13
Last Modified
2024-12-13
Gated
No
Private
No
HF SHA
1988f06c1a99c46242c7e7dd0b128d04c93e27ca
License
Unknown
Language
Unknown
Base Model
Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "frontmatter": {},
    "hero_image_url": "",
    "summary": "Custom GGUF quants of Hermes-3-Llama-3.2-3B, where the Output Tensors are quantized to Q8_0 or upcast to F32, while the Embeddings are kept at F32. Enjoy! ๐Ÿง ๐Ÿ”ฅ๐Ÿš€",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "Custom GGUF quants of Hermes-3-Llama-3.2-3B, where the Output Tensors are quantized to Q8_0 or upcast to F32, while the Embeddings are kept at F32. Enjoy! ๐Ÿง ๐Ÿ”ฅ๐Ÿš€  ",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "endpoints_compatible",
    "region:us",
    "imatrix",
    "conversational"
  ],
  "likes": 1,
  "downloads": 135,
  "gated": false,
  "private": false,
  "last_modified": "2024-12-13T23:42:38.000Z",
  "created_at": "2024-12-13T00:38:13.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "675b81f576d0e555d616477a",
  "id": "Joseph717171/Hermes-3-Llama-3.2-3B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF",
  "modelId": "Joseph717171/Hermes-3-Llama-3.2-3B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF",
  "sha": "1988f06c1a99c46242c7e7dd0b128d04c93e27ca",
  "createdAt": "2024-12-13T00:38:13.000Z",
  "lastModified": "2024-12-13T23:42:38.000Z",
  "author": "Joseph717171",
  "downloads": 135,
  "likes": 1,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 8
}