joseph717171/deephermes-3-llama-3.1-8b-preview-oq8_0-f32.ef32.iq4_k-q8_0-gguf - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
joseph717171/deephermes-3-llama-3.1-8b-preview-oq8_0-f32.ef32.iq4_k-q8_0-gguf overview
Custom GGUF quants of NousResearch/DeepHermes-3-Llama-3-8B-Preview, where the Output Tensors are kept at F32 or quantized to Q8_0, while the Embeddings are kept at F32. Enjoy! ๐ง ๐ฅ๐
Downloads
1,318
Likes
1
Pipeline
โ
Library
โ
Visibility
Public
Access
Open
Repository Files & Downloads
14 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| DeepHermes-3-Llama-3-8B-Preview-OF32.EF32.IQ4_K_M.gguf | GGUF | F32 | 7.82 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OF32.EF32.IQ5_K_M.gguf | GGUF | F32 | 8.52 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OF32.EF32.IQ5_K_S.gguf | GGUF | F32 | 8.39 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OF32.EF32.IQ6_K.gguf | GGUF | F32 | 9.25 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OF32.EF32.IQ8_0.gguf | GGUF | F32 | 10.83 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OQ8_0.EF32.IQ4_K_M.gguf | GGUF | F32 | 6.38 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OQ8_0.EF32.IQ5_K_M.gguf | GGUF | F32 | 7.08 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OQ8_0.EF32.IQ6_K.gguf | GGUF | F32 | 7.82 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OQ8_0.EF32.IQ8_0.gguf | GGUF | F32 | 9.39 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OQ8_0.EQ8_0.IQ4_K_M.gguf | GGUF | IQ4_K_M | 4.95 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OQ8_0.EQ8_0.IQ5_K_M.gguf | GGUF | IQ5_K_M | 5.64 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OQ8_0.EQ8_0.IQ5_K_S.gguf | GGUF | IQ5_K_S | 5.52 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OQ8_0.EQ8_0.IQ6_K.gguf | GGUF | IQ6_K | 6.38 GB | Download |
| DeepHermes-3-Llama-3-8B-Preview-OQ8_0.EQ8_0.IQ8_0.gguf | GGUF | โ | 7.95 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"frontmatter": {},
"hero_image_url": "",
"summary": "Custom GGUF quants of NousResearch/DeepHermes-3-Llama-3-8B-Preview, where the Output Tensors are kept at F32 or quantized to Q8_0, while the Embeddings are kept at F32. Enjoy! ๐ง ๐ฅ๐",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "Custom GGUF quants of [NousResearch/DeepHermes-3-Llama-3-8B-Preview](https://huggingface.co/NousResearch/DeepHermes-3-Llama-3-8B-Preview), where the Output Tensors are kept at F32 or quantized to Q8_0, while the Embeddings are kept at F32. Enjoy! ๐ง ๐ฅ๐ \n",
"related_quantizations": []
},
"tags": [
"gguf",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 1,
"downloads": 1318,
"gated": false,
"private": false,
"last_modified": "2025-03-31T01:36:34.000Z",
"created_at": "2025-02-14T04:27:44.000Z",
"pipeline_tag": "",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "67aec640e09495b151e70ee4",
"id": "Joseph717171/DeepHermes-3-Llama-3.1-8B-Preview-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF",
"modelId": "Joseph717171/DeepHermes-3-Llama-3.1-8B-Preview-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF",
"sha": "4dfe3db09c5bb581ac469a0537eb883e4e7f00ad",
"createdAt": "2025-02-14T04:27:44.000Z",
"lastModified": "2025-03-31T01:36:34.000Z",
"author": "Joseph717171",
"downloads": 1318,
"likes": 1,
"gated": false,
"private": false,
"pipeline_tag": "",
"library_name": "",
"siblings_count": 17
}