hitonet/hito-1.7b-gguf Q5_K_S GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
hitonet/hito-1.7b-gguf overview
Quantized versions for llama.cpp, Ollama, LM Studio, and more Original Model Website Chat API --- ---
Downloads
153
Likes
1
Pipeline
text-generation
Library
—
Visibility
Public
Access
Open
Repository Files & Downloads
13 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| hito-1.7b-F16.gguf | GGUF | F16 | 3.21 GB | Download |
| hito-1.7b-Q2_K.gguf | GGUF | Q2_K | 741.44 MB | Download |
| hito-1.7b-Q3_K_L.gguf | GGUF | Q3_K_L | 956.69 MB | Download |
| hito-1.7b-Q3_K_M.gguf | GGUF | Q3_K_M | 895.69 MB | Download |
| hito-1.7b-Q3_K_S.gguf | GGUF | Q3_K_S | 826.76 MB | Download |
| hito-1.7b-Q4_0.gguf | GGUF | — | 1005.26 MB | Download |
| hito-1.7b-Q4_K_M.gguf | GGUF | Q4_K_M | 1.03 GB | Download |
| hito-1.7b-Q4_K_S.gguf | GGUF | Q4_K_S | 1010.76 MB | Download |
| hito-1.7b-Q5_0.gguf | GGUF | — | 1.15 GB | Download |
| hito-1.7b-Q5_K_M.gguf | GGUF | Q5_K_M | 1.17 GB | Download |
| hito-1.7b-Q5_K_S.gguf | GGUF | Q5_K_S | 1.15 GB | Download |
| hito-1.7b-Q6_K.gguf | GGUF | Q6_K | 1.32 GB | Download |
| hito-1.7b-Q8_0.gguf | GGUF | — | 1.71 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "apache-2.0",
"language": [
"en"
],
"tags": [
"qwen3",
"fine-tuned",
"hito",
"hitonet",
"reasoning",
"thinking",
"llama-cpp",
"ollama",
"conversational",
"gguf"
],
"pipeline_tag": "text-generation",
"base_model": "hitonet/hito-1.7b",
"frontmatter": {
"license": "apache-2.0",
"language": [
"en"
],
"tags": [
"qwen3",
"fine-tuned",
"hito",
"hitonet",
"reasoning",
"thinking",
"llama-cpp",
"ollama",
"conversational",
"gguf"
],
"pipeline_tag": "text-generation",
"base_model": "hitonet/hito-1.7b"
},
"hero_image_url": "https://img.shields.io/badge/Model_Weights-Apache_2.0_(Open)-green?style=flat-square",
"summary": "### Quantized versions for llama.cpp, Ollama, LM Studio, and more     --- ---",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: apache-2.0\nlanguage:\n- en\ntags:\n- qwen3\n- fine-tuned\n- hito\n- hitonet\n- reasoning\n- thinking\n- llama-cpp\n- ollama\n- conversational\n- gguf\npipeline_tag: text-generation\nbase_model: hitonet/hito-1.7b\n---\n\n<div align=\"center\">\n\n# Hito 1.7B - GGUF\n\n### Quantized versions for llama.cpp, Ollama, LM Studio, and more\n\n[](https://huggingface.co/hitonet/hito-1.7b)\n[](https://hitonet.com)\n[](https://chat.hitonet.com)\n[](https://platform.hitonet.com)\n\n---\n\n<img src=\"https://img.shields.io/badge/Model_Weights-Apache_2.0_(Open)-green?style=flat-square\" alt=\"Model License\"/>\n<img src=\"https://img.shields.io/badge/Training_Method-Commercial_License_Required-red?style=flat-square\" alt=\"Method License\"/>\n\n</div>\n\n---\n\n## About\n\nThis repository contains **GGUF quantized versions** of [hitonet/hito-1.7b](https://huggingface.co/hitonet/hito-1.7b).\n\nHito is a 1.7B parameter model with structured thinking patterns that enable better accuracy and transparency.\n\nFor the original model (safetensors), training details, benchmarks, and full documentation, see the [main repository](https://huggingface.co/hitonet/hito-1.7b).\n\n---\n\n## Available Quantizations\n\n### Recommended\n\n| File | Quant | Size | Quality | Notes |\n|------|-------|------|---------|-------|\n| **[hito-1.7b-Q4_K_M.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q4_K_M.gguf)** | Q4_K_M | 1.1 GB | **BEST** | Perfect balance of size and quality |\n| [hito-1.7b-Q5_K_M.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q5_K_M.gguf) | Q5_K_M | 1.2 GB | Excellent | Slightly better than Q4_K_M |\n| [hito-1.7b-Q8_0.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q8_0.gguf) | Q8_0 | 1.8 GB | Excellent | Highest quality quantization |\n\n### Good Quality\n\n| File | Quant | Size | Quality | Notes |\n|------|-------|------|---------|-------|\n| [hito-1.7b-Q4_0.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q4_0.gguf) | Q4_0 | 1.0 GB | Good | Legacy format, works well |\n| [hito-1.7b-Q4_K_S.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q4_K_S.gguf) | Q4_K_S | 1.0 GB | Good | Smaller Q4 variant |\n| [hito-1.7b-Q5_0.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q5_0.gguf) | Q5_0 | 1.2 GB | Good | Legacy 5-bit |\n| [hito-1.7b-Q5_K_S.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q5_K_S.gguf) | Q5_K_S | 1.2 GB | Good | Smaller Q5 variant |\n| [hito-1.7b-Q6_K.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q6_K.gguf) | Q6_K | 1.4 GB | Excellent | Near full quality |\n| [hito-1.7b-F16.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-F16.gguf) | F16 | 3.3 GB | Reference | Full precision GGUF |\n\n### Low Quality (Not Recommended)\n\n| File | Quant | Size | Quality | Notes |\n|------|-------|------|---------|-------|\n| [hito-1.7b-Q3_K_L.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q3_K_L.gguf) | Q3_K_L | 957 MB | Fair | May get stuck in thinking |\n| [hito-1.7b-Q3_K_M.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q3_K_M.gguf) | Q3_K_M | 896 MB | Fair | Occasional issues |\n| [hito-1.7b-Q3_K_S.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q3_K_S.gguf) | Q3_K_S | 827 MB | Fair | Noticeable quality loss |\n\n### Broken (Do Not Use)\n\n| File | Quant | Size | Quality | Notes |\n|------|-------|------|---------|-------|\n| [hito-1.7b-Q2_K.gguf](https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q2_K.gguf) | Q2_K | 742 MB | Broken | Produces gibberish |\n\n---\n\n## Quick Start\n\n### Ollama\n\n```bash\n# Download the recommended quantization\nwget https://huggingface.co/hitonet/hito-1.7b-GGUF/resolve/main/hito-1.7b-Q4_K_M.gguf\n\n# Create Modelfile\ncat > Modelfile << 'EOF'\nFROM hito-1.7b-Q4_K_M.gguf\nSYSTEM \"You are Hito by Hitonet.com.\"\nPARAMETER temperature 0.7\nPARAMETER stop \"<|im_end|>\"\nEOF\n\n# Create and run\nollama create hito -f Modelfile\nollama run hito\n```\n\n### llama.cpp\n\n```bash\n./llama-cli -m hito-1.7b-Q4_K_M.gguf \\\n -sys \"You are Hito by Hitonet.com.\" \\\n -p \"What is your name?\" \\\n -n 256\n```\n\n### LM Studio\n\n1. Download any GGUF file from this repository\n2. Open LM Studio → Load Model\n3. Set system prompt: `You are Hito by Hitonet.com.`\n4. Start chatting!\n\n---\n\n## Compatibility\n\nThese GGUF files work with:\n\n- **Ollama** (recommended)\n- **llama.cpp**\n- **LM Studio**\n- **Jan**\n- **GPT4All**\n- **llama-cpp-python**\n- Any llama.cpp-compatible application\n\n---\n\n## What Makes Hito Special\n\n- **Structured Thinking**: Uses `<think>` tags for transparent reasoning\n- **Self-Correcting**: Catches errors mid-reasoning \n- **Humble by Design**: Admits uncertainty rather than hallucinating\n- **Efficient**: Only 1.7B parameters, runs on CPU\n\nFor full documentation, benchmarks, and training details, see the [main repository](https://huggingface.co/hitonet/hito-1.7b).\n\n---\n\n## Licensing\n\n| Component | License | Commercial Use |\n|-----------|---------|----------------|\n| **Model Weights** | Apache 2.0 | ✅ Free to use |\n| **Training Methodology** | Proprietary | ⚠️ **Commercial License Required** |\n\n### Model Weights (Apache 2.0)\nThe model weights are open source under Apache 2.0. You may use, modify, and distribute them freely.\n\n### Training Methodology (Commercial License Required)\nThe training methodology and cognitive framework used to create this model are proprietary to Hitonet.\n\n**Commercial use of the training methodology requires a license.**\n\n**Attribution is mandatory** when using this model or discussing its capabilities.\n\nFor commercial licensing inquiries: **legal@hitonet.com**\n\n---\n\n<div align=\"center\">\n<b>Made with genuine curiosity by Hitonet</b>\n</div>\n",
"related_quantizations": []
},
"tags": [
"gguf",
"qwen3",
"fine-tuned",
"hito",
"hitonet",
"reasoning",
"thinking",
"llama-cpp",
"ollama",
"conversational",
"text-generation",
"en",
"base_model:hitonet/hito-1.7b",
"base_model:quantized:hitonet/hito-1.7b",
"license:apache-2.0",
"endpoints_compatible",
"region:us"
],
"likes": 1,
"downloads": 153,
"gated": false,
"private": false,
"last_modified": "2025-12-11T19:54:33.000Z",
"created_at": "2025-12-04T08:49:32.000Z",
"pipeline_tag": "text-generation",
"library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
"_id": "69314b1c30dc4e50d14c1f0e",
"id": "hitonet/hito-1.7b-GGUF",
"modelId": "hitonet/hito-1.7b-GGUF",
"sha": "01aa2127f8b5c0e18f19c78bf6b1d3ae7f8969cc",
"createdAt": "2025-12-04T08:49:32.000Z",
"lastModified": "2025-12-11T19:54:33.000Z",
"author": "hitonet",
"downloads": 153,
"likes": 1,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "",
"siblings_count": 15
}