inferenceillusionist/euryale-1.3-longlora-70b-rope8-32k-imat-gguf IQ2_XXS GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

inferenceillusionist/euryale-1.3-longlora-70b-rope8-32k-imat-gguf overview

Special request. Quantized from fp16 with love. Please note I have not tested context to the full 32k, but these quants have all passed the standard suite of coherence and KL-divergence benchmark tests. Any feedback is welcomed. * Quantizations made possible using .imatrix file from this repo (special thanks to ikawrakow again) For a brief rundown of iMatrix quant performance please see this PR All quants are verified working prior to uploading to repo for your safety and convenience. Importance matrix quantizations are a work in progress, IQ3 and above is recommended for best results. Tip: Pick a size that can fit in your GPU while still allowing some room for context for best speed. You may need to pad this further depending on if you are running image gen or TTS as well. Original model card can be found here

ggufmergestorywritingtext adventureiMatendpoints_compatibleregion:us

inferenceillusionist/euryale-1.3-longlora-70b-rope8-32k-imat-gguf visual

Downloads

421

Likes

Pipeline

—

Library

—

Visibility

Public

Access

Open

Repository Files & Downloads

16 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ1_M.gguf	GGUF	IQ1_M	14.85 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ2_M.gguf	GGUF	IQ2_M	21.64 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ2_S.gguf	GGUF	IQ2_S	19.89 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ2_XS.gguf	GGUF	IQ2_XS	18.94 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ2_XXS.gguf	GGUF	IQ2_XXS	17.03 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ3_M.gguf	GGUF	IQ3_M	28.82 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ3_S.gguf	GGUF	IQ3_S	27.86 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ3_XS.gguf	GGUF	IQ3_XS	26.37 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ3_XXS.gguf	GGUF	IQ3_XXS	24.76 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-IQ4_XS.gguf	GGUF	IQ4_XS	34.30 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-Q2_K.gguf	GGUF	Q2_K	23.71 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-Q3_K_M.gguf	GGUF	Q3_K_M	30.99 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-Q4_K_M.gguf	GGUF	Q4_K_M	38.58 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-Q4_K_S.gguf	GGUF	Q4_K_S	36.55 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-Q5_K_M.gguf	GGUF	Q5_K_M	45.41 GB	Download
Euryale-1.3-longLORA-70b-rope8-32k-iMat-Q5_K_S.gguf	GGUF	Q5_K_S	44.20 GB	Download

Model Details Live

Model Slug

inferenceillusionist/euryale-1.3-longlora-70b-rope8-32k-imat-gguf

Author

InferenceIllusionist

Pipeline Task

—

Library

—

Created

2024-04-02

Last Modified

2024-04-18

Gated

Private

HF SHA

b335e1b4071dc19a3ac2d060d0a911cf948a0f77

License

Unknown

Language

Unknown

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "tags": [
      "merge",
      "gguf",
      "storywriting",
      "text adventure",
      "iMat"
    ],
    "frontmatter": {
      "tags": [
        "merge",
        "gguf",
        "storywriting",
        "text adventure",
        "iMat"
      ]
    },
    "hero_image_url": "https://i.imgur.com/P68dXux.png",
    "summary": "Special request. Quantized from fp16 with love. Please note I have not tested context to the full 32k, but these quants have all passed the standard suite of coherence and KL-divergence benchmark tests. Any feedback is welcomed. * Quantizations made possible using .imatrix file from this repo (special thanks to ikawrakow again) For a brief rundown of iMatrix quant performance please see this PR All quants are verified working prior to uploading to repo for your safety and convenience.  Importance matrix quantizations are a work in progress, IQ3 and above is recommended for best results. Tip: Pick a size that can fit in your GPU while still allowing some room for context for best speed. You may need to pad this further depending on if you are running image gen or TTS as well. Original model card can be found here",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\ntags:\n- merge\n- gguf\n- storywriting\n- text adventure\n- iMat\n---\n<img src=\"https://i.imgur.com/P68dXux.png\" width=\"400\"/>\n\n# Euryale-1.3-longLORA-70b-rope8-32k-iMat-GGUF\n\n\n<b>Special request.</b> Quantized from fp16 with love. Please note I have not tested context to the full 32k, but these quants have all passed the standard suite of coherence and KL-divergence benchmark tests. Any feedback is welcomed.\n* Quantizations made possible using .imatrix file from [this](https://huggingface.co/datasets/ikawrakow/imatrix-from-wiki-train) repo (special thanks to [ikawrakow](https://huggingface.co/ikawrakow) again)\n\nFor a brief rundown of iMatrix quant performance please see this [PR](https://github.com/ggerganov/llama.cpp/pull/5747)\n\n<i>All quants are verified working prior to uploading to repo for your safety and convenience. </i>\n\nImportance matrix quantizations are a work in progress, IQ3 and above is recommended for best results. \n\n<b>Tip:</b> Pick a size that can fit in your GPU while still allowing some room for context for best speed. You may need to pad this further depending on if you are running image gen or TTS as well.\n\nOriginal model card can be found [here](https://huggingface.co/grimulkan/Euryale-1.3-longLORA-70b-rope8-32k-fp16)",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "merge",
    "storywriting",
    "text adventure",
    "iMat",
    "endpoints_compatible",
    "region:us"
  ],
  "likes": 0,
  "downloads": 421,
  "gated": false,
  "private": false,
  "last_modified": "2024-04-18T07:28:35.000Z",
  "created_at": "2024-04-02T01:28:57.000Z",
  "pipeline_tag": "",
  "library_name": ""
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "660b5f59e230cb210ae0a5d7",
  "id": "InferenceIllusionist/Euryale-1.3-longLORA-70b-rope8-32k-iMat-GGUF",
  "modelId": "InferenceIllusionist/Euryale-1.3-longLORA-70b-rope8-32k-iMat-GGUF",
  "sha": "b335e1b4071dc19a3ac2d060d0a911cf948a0f77",
  "createdAt": "2024-04-02T01:28:57.000Z",
  "lastModified": "2024-04-18T07:28:35.000Z",
  "author": "InferenceIllusionist",
  "downloads": 421,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 18
}

inferenceillusionist/euryale-1.3-longlora-70b-rope8-32k-imat-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard