GraySoft
Projects Models About FAQ Contact Download guIDE →

daksh-neo/grpo-tax-qwen-1.5b-gguf Q4_K_M GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

daksh-neo/grpo-tax-qwen-1.5b-gguf overview

Built with NEO — Your Autonomous AI Agent GGUF quantized versions of Qwen2.5-1.5B-Instruct fine-tuned with GRPO (Group Relative Policy Optimization) on tax and financial reasoning tasks.

ggufqwen2grpotaxfinancefine-tunedtext-generationenbase_model:Qwen/Qwen2.5-1.5B-Instructbase_model:quantized:Qwen/Qwen2.5-1.5B-Instructlicense:apache-2.0endpoints_compatibleregion:usconversational
daksh-neo/grpo-tax-qwen-1.5b-gguf visual
Downloads
121
Likes
0
Pipeline
text-generation
Library
Visibility
Public
Access
Open

Repository Files & Downloads

2 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
grpo-tax-qwen-1.5b-Q4_K_M.gguf GGUF Q4_K_M 940.37 MB Download
grpo-tax-qwen-1.5b-Q8_0.gguf GGUF 1.53 GB Download

Model Details Live

Model Slug
daksh-neo/grpo-tax-qwen-1.5b-gguf
Author
daksh-neo
Pipeline Task
text-generation
Library
Created
2026-03-31
Last Modified
2026-04-10
Gated
No
Private
No
HF SHA
9d1c4d8442568934b268a1df21ab9673cb27b42a
License
apache-2.0
Language
en
Base Model
Qwen/Qwen2.5-1.5B-Instruct

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "language": [
      "en"
    ],
    "base_model": "Qwen/Qwen2.5-1.5B-Instruct",
    "tags": [
      "gguf",
      "qwen2",
      "grpo",
      "tax",
      "finance",
      "fine-tuned"
    ],
    "pipeline_tag": "text-generation",
    "frontmatter": {
      "license": "apache-2.0",
      "language": [
        "en"
      ],
      "base_model": "Qwen/Qwen2.5-1.5B-Instruct",
      "tags": [
        "gguf",
        "qwen2",
        "grpo",
        "tax",
        "finance",
        "fine-tuned"
      ],
      "pipeline_tag": "text-generation"
    },
    "hero_image_url": "",
    "summary": "> Built with NEO — Your Autonomous AI Agent GGUF quantized versions of **Qwen2.5-1.5B-Instruct** fine-tuned with **GRPO (Group Relative Policy Optimization)** on tax and financial reasoning tasks.",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nlanguage:\n- en\nbase_model: Qwen/Qwen2.5-1.5B-Instruct\ntags:\n- gguf\n- qwen2\n- grpo\n- tax\n- finance\n- fine-tuned\npipeline_tag: text-generation\n---\n\n# grpo-tax-qwen-1.5b-GGUF\n\n> Built with [NEO — Your Autonomous AI Agent](https://heyneo.com)\n\nGGUF quantized versions of **Qwen2.5-1.5B-Instruct** fine-tuned with **GRPO (Group Relative Policy Optimization)** on tax and financial reasoning tasks.\n\n## Model Details\n\n| Property | Value |\n|----------|-------|\n| Base Model | Qwen/Qwen2.5-1.5B-Instruct |\n| Fine-tuning Method | GRPO (Group Relative Policy Optimization) |\n| Domain | Tax & Financial Reasoning |\n| Architecture | Qwen2 |\n| Context Length | 32,768 tokens |\n| Format | GGUF |\n\n## Available Quantizations\n\n| File | Quantization | Size | Use Case |\n|------|-------------|------|----------|\n| `grpo-tax-qwen-1.5b-Q4_K_M.gguf` | Q4_K_M | ~1.0 GB | Best balance of speed and quality |\n| `grpo-tax-qwen-1.5b-Q8_0.gguf` | Q8_0 | ~1.6 GB | Higher quality, more RAM required |\n\n## Usage\n\n### With llama.cpp\n\n```bash\n# Download the model\nhuggingface-cli download daksh-neo/grpo-tax-qwen-1.5b-gguf grpo-tax-qwen-1.5b-Q4_K_M.gguf\n\n# Run inference\n./llama-cli -m grpo-tax-qwen-1.5b-Q4_K_M.gguf \\\n  -p \"<|im_start|>system\\nYou are a tax expert assistant.<|im_end|>\\n<|im_start|>user\\nWhat is the standard deduction for 2024?<|im_end|>\\n<|im_start|>assistant\\n\" \\\n  -n 512 --temp 0.7\n```\n\n### With Ollama\n\n```bash\n# Create a Modelfile\ncat > Modelfile << 'EOF'\nFROM ./grpo-tax-qwen-1.5b-Q4_K_M.gguf\nTEMPLATE \"\"\"<|im_start|>system\n{{ .System }}<|im_end|>\n<|im_start|>user\n{{ .Prompt }}<|im_end|>\n<|im_start|>assistant\n\"\"\"\nSYSTEM \"You are a helpful tax and financial assistant.\"\nEOF\n\nollama create grpo-tax-qwen-1.5b -f Modelfile\nollama run grpo-tax-qwen-1.5b\n```\n\n### With Python (llama-cpp-python)\n\n```python\nfrom llama_cpp import Llama\n\nllm = Llama.from_pretrained(\n    repo_id=\"daksh-neo/grpo-tax-qwen-1.5b-gguf\",\n    filename=\"grpo-tax-qwen-1.5b-Q4_K_M.gguf\",\n    n_ctx=4096,\n)\n\nresponse = llm.create_chat_completion(\n    messages=[\n        {\"role\": \"system\", \"content\": \"You are a helpful tax assistant.\"},\n        {\"role\": \"user\", \"content\": \"Explain what a W-2 form is.\"}\n    ]\n)\nprint(response[\"choices\"][0][\"message\"][\"content\"])\n```\n\n## Training Details\n\nThis model was fine-tuned using GRPO (Group Relative Policy Optimization), a reinforcement learning from human feedback (RLHF) variant that optimizes the model's responses on tax and financial reasoning tasks without requiring a separate reward model. GRPO trains by comparing groups of sampled responses and reinforcing higher-quality answers.\n\n**Training focus areas:**\n- Federal and state tax regulations\n- Tax form interpretation (W-2, 1099, Schedule C, etc.)\n- Deductions and credits\n- Tax planning strategies\n- Financial compliance questions\n\n## Limitations\n\n- This model is fine-tuned on tax knowledge up to its training cutoff and may not reflect the latest tax law changes.\n- Always consult a qualified tax professional for official tax advice.\n- The model is not a substitute for professional legal or financial guidance.\n\n## Related Models\n\n- [daksh-neo/grpo-tax-qwen-3b-gguf](https://huggingface.co/daksh-neo/grpo-tax-qwen-3b-gguf) — Larger 3B version with higher accuracy\n\n## License\n\nApache 2.0 — see [Qwen2.5 license](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct/blob/main/LICENSE) for base model terms.\n\n---\n\n<div align=\"center\">\nBuilt with <a href=\"https://heyneo.com\">NEO</a> — Your Autonomous AI Agent\n</div>\n",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "qwen2",
    "grpo",
    "tax",
    "finance",
    "fine-tuned",
    "text-generation",
    "en",
    "base_model:Qwen/Qwen2.5-1.5B-Instruct",
    "base_model:quantized:Qwen/Qwen2.5-1.5B-Instruct",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 121,
  "gated": false,
  "private": false,
  "last_modified": "2026-04-10T06:06:50.000Z",
  "created_at": "2026-03-31T11:45:52.000Z",
  "pipeline_tag": "text-generation",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "69cbb3f091619429e24826fb",
  "id": "daksh-neo/grpo-tax-qwen-1.5b-gguf",
  "modelId": "daksh-neo/grpo-tax-qwen-1.5b-gguf",
  "sha": "9d1c4d8442568934b268a1df21ab9673cb27b42a",
  "createdAt": "2026-03-31T11:45:52.000Z",
  "lastModified": "2026-04-10T06:06:50.000Z",
  "author": "daksh-neo",
  "downloads": 121,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "",
  "siblings_count": 4
}