Question 1

What is forkjoin-ai/qwen2.5-72b-instruct-gguf?

Accepted Answer

--- language: - en license: apache-2.0 library_name: gguf tags: - gguf - qwen2 - instruct - affectively - edgework - aether - distributed-inference - edge-deployment base_model: Qwen/Qwen2.5-72B-Instruct base_model_relation: quantized pipeline_tag: text-generation --- # Qwen2.5 72b Instruct (GGUF, Q4_K_M) > **Production-ready** GGUF quantization of [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct) for distributed text generation and conversation — powered by the [Aether](https://github.com/forkjoin-ai/aether) edge inference runtime on [Edgework.ai](https://edgework.ai). ## Model Details | Property | Value | |----------|-------| | Base model | [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct) | | Parameters | 72B | | Architecture | Qwen2 | | Quantization | Q4_K_M | | Format | GGUF | | Size | ~43 GB | | License | apache-2.0 | ## Usag…

Question 2

What license applies to forkjoin-ai/qwen2.5-72b-instruct-gguf?

Accepted Answer

License: apache-2.0. Verify terms on Hugging Face before commercial use.

Question 3

How do I run forkjoin-ai/qwen2.5-72b-instruct-gguf locally?

Accepted Answer

Download a GGUF file from this page and load it in guIDE or llama.cpp. Pipeline task: text-generation.

Question 4

How much VRAM or disk space does forkjoin-ai/qwen2.5-72b-instruct-gguf need?

Accepted Answer

Runs locally from ~44.16 GB disk (32 GB+ VRAM class GPUs with llama.cpp / guIDE).

forkjoin-ai/qwen2.5-72b-instruct-gguf overview

Repository Files & Downloads

Model Details

Model README

Qwen2.5 72b Instruct (GGUF, Q4_K_M)

Model Details

Usage

With llama.cpp

With Aether (Distributed Inference)

Also available: `.knot` (sovereign format)

Deployment Architecture

About

Run forkjoin-ai/qwen2.5-72b-instruct-gguf with guIDE

Model ID	forkjoin-ai/qwen2.5-72b-instruct-gguf
Author	forkjoin-ai
Pipeline	text-generation
License	apache-2.0
Base model	Qwen/Qwen2.5-72B-Instruct
Last modified	2026-06-08T21:25:26.000Z

forkjoin-ai/qwen2.5-72b-instruct-gguf overview

Repository Files & Downloads

Model Details

Model README

Qwen2.5 72b Instruct (GGUF, Q4_K_M)

Model Details

Usage

With llama.cpp

With Aether (Distributed Inference)

Also available: .knot (sovereign format)

Deployment Architecture

About

Run forkjoin-ai/qwen2.5-72b-instruct-gguf with guIDE

Also available: `.knot` (sovereign format)