duyntnet/aya-expanse-32b-imatrix-gguf Q6_K GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

duyntnet/aya-expanse-32b-imatrix-gguf overview

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. It focuses on pairing a highly performant pre-trained Command family of models with the result of a year’s dedicated research from Cohere For AI, including data arbitrage, multilingual preference training, safety tuning, and model merging. The result is a powerful multilingual large language model serving 23 languages. We cover 23 languages: Arabic, Chinese (simplified & traditional), Czech, Dutch, English, French, German, Greek, Hebrew, Hebrew, Hindi, Indonesian, Italian, Japanese, Korean, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Turkish, Ukrainian, and Vietnamese This model card corresponds to the 32-billion version of the Aya Expanse model. We also released an 8-billion version which you can find here. Try Aya Expanse Before downloading the weights, you can try out Aya Expanse (32B) in our hosted Hugging Face Space. ### Usage Please install transformers from the source repository. python # pip install 'git+https://github.com/huggingface/transformers.git' from transformers import AutoTokenizer, AutoModelForCausalLM modelid = "CohereForAI/aya-expanse-32b" tokenizer = AutoTokenizer.frompretrained(modelid) model = AutoModelForCausalLM.frompretrained(modelid) # Format message with the chat template messages = [{"role": "user", "content": "Anneme onu ne kadar sevdiğimi anlatan bir mektup yaz"}] inputids = tokenizer.applychattemplate(messages, tokenize=True, addgenerationprompt=True, return_tensors="pt")

transformersggufimatrixaya-expanse-32btext-generationenarxiv:2408.14960arxiv:2407.02552arxiv:2406.18682arxiv:2410.10801license:otherregion:usconversational

duyntnet/aya-expanse-32b-imatrix-gguf visual

Downloads

Likes

Pipeline

text-generation

Library

transformers

Visibility

Public

Access

Open

Repository Files & Downloads

27 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
aya-expanse-32b-IQ1_M.gguf	GGUF	IQ1_M	7.81 GB	Download
aya-expanse-32b-IQ1_S.gguf	GGUF	IQ1_S	7.24 GB	Download
aya-expanse-32b-IQ2_M.gguf	GGUF	IQ2_M	10.81 GB	Download
aya-expanse-32b-IQ2_S.gguf	GGUF	IQ2_S	10.05 GB	Download
aya-expanse-32b-IQ2_XS.gguf	GGUF	IQ2_XS	9.61 GB	Download
aya-expanse-32b-IQ2_XXS.gguf	GGUF	IQ2_XXS	8.77 GB	Download
aya-expanse-32b-IQ3_M.gguf	GGUF	IQ3_M	14.20 GB	Download
aya-expanse-32b-IQ3_S.gguf	GGUF	IQ3_S	13.74 GB	Download
aya-expanse-32b-IQ3_XS.gguf	GGUF	IQ3_XS	13.08 GB	Download
aya-expanse-32b-IQ3_XXS.gguf	GGUF	IQ3_XXS	12.12 GB	Download
aya-expanse-32b-IQ4_NL.gguf	GGUF	IQ4_NL	17.47 GB	Download
aya-expanse-32b-IQ4_XS.gguf	GGUF	IQ4_XS	16.60 GB	Download
aya-expanse-32b-Q2_K.gguf	GGUF	Q2_K	11.93 GB	Download
aya-expanse-32b-Q2_K_S.gguf	GGUF	Q2_K_S	11.14 GB	Download
aya-expanse-32b-Q3_K_L.gguf	GGUF	Q3_K_L	16.36 GB	Download
aya-expanse-32b-Q3_K_M.gguf	GGUF	Q3_K_M	15.12 GB	Download
aya-expanse-32b-Q3_K_S.gguf	GGUF	Q3_K_S	13.70 GB	Download
aya-expanse-32b-Q4_0.gguf	GGUF	—	17.49 GB	Download
aya-expanse-32b-Q4_1.gguf	GGUF	—	19.19 GB	Download
aya-expanse-32b-Q4_K_M.gguf	GGUF	Q4_K_M	18.44 GB	Download
aya-expanse-32b-Q4_K_S.gguf	GGUF	Q4_K_S	17.55 GB	Download
aya-expanse-32b-Q5_0.gguf	GGUF	—	21.01 GB	Download
aya-expanse-32b-Q5_1.gguf	GGUF	—	22.71 GB	Download
aya-expanse-32b-Q5_K_M.gguf	GGUF	Q5_K_M	21.47 GB	Download
aya-expanse-32b-Q5_K_S.gguf	GGUF	Q5_K_S	20.95 GB	Download
aya-expanse-32b-Q6_K.gguf	GGUF	Q6_K	24.68 GB	Download
aya-expanse-32b-Q8_0.gguf	GGUF	—	31.97 GB	Download

Model Details Live

Model Slug

duyntnet/aya-expanse-32b-imatrix-gguf

Author

duyntnet

Pipeline Task

text-generation

Library

transformers

Created

2024-10-27

Last Modified

2024-10-27

Gated

Private

HF SHA

cb7a65b19143195c0fd1fe96657a8cadc2d94986

License

other

Language

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "other",
    "language": [
      "en"
    ],
    "pipeline_tag": "text-generation",
    "inference": false,
    "tags": [
      "transformers",
      "gguf",
      "imatrix",
      "aya-expanse-32b"
    ],
    "frontmatter": {
      "license": "other",
      "language": [
        "en"
      ],
      "pipeline_tag": "text-generation",
      "inference": "false",
      "tags": [
        "transformers",
        "gguf",
        "imatrix",
        "aya-expanse-32b"
      ]
    },
    "hero_image_url": "",
    "summary": "Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. It focuses on pairing a highly performant pre-trained Command family of models with the result of a year’s dedicated research from Cohere For AI, including data arbitrage, multilingual preference training, safety tuning, and model merging. The result is a powerful multilingual large language model serving 23 languages. We cover 23 languages: Arabic, Chinese (simplified & traditional), Czech, Dutch, English, French, German, Greek, Hebrew, Hebrew, Hindi, Indonesian, Italian, Japanese, Korean, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Turkish, Ukrainian, and Vietnamese This model card corresponds to the 32-billion version of the Aya Expanse model. We also released an 8-billion version which you can find here. **Try Aya Expanse** Before downloading the weights, you can try out Aya Expanse (32B) in our hosted Hugging Face Space. ### Usage Please install transformers from the source repository. ```python # pip install 'git+https://github.com/huggingface/transformers.git' from transformers import AutoTokenizer, AutoModelForCausalLM model_id = \"CohereForAI/aya-expanse-32b\" tokenizer = AutoTokenizer.from_pretrained(model_id) model = AutoModelForCausalLM.from_pretrained(model_id) # Format message with the chat template messages = [{\"role\": \"user\", \"content\": \"Anneme onu ne kadar sevdiğimi anlatan bir mektup yaz\"}] input_ids = tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True, return_tensors=\"pt\")",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- aya-expanse-32b\n---\nQuantizations of https://huggingface.co/CohereForAI/aya-expanse-32b\n\n\n### Inference Clients/UIs\n* [llama.cpp](https://github.com/ggerganov/llama.cpp)\n* [KoboldCPP](https://github.com/LostRuins/koboldcpp)\n* [ollama](https://github.com/ollama/ollama)\n* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)\n* [GPT4All](https://github.com/nomic-ai/gpt4all)\n* [jan](https://github.com/janhq/jan)\n---\n\n# From original readme\n\nAya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. It focuses on pairing a highly performant pre-trained [Command family](https://huggingface.co/CohereForAI/c4ai-command-r-plus) of models with the result of a year’s dedicated research from [Cohere For AI](https://cohere.for.ai/), including [data arbitrage](https://arxiv.org/pdf/2408.14960), [multilingual preference training](https://arxiv.org/abs/2407.02552), [safety tuning](https://arxiv.org/abs/2406.18682), and [model merging](https://arxiv.org/abs/2410.10801). The result is a powerful multilingual large language model serving 23 languages.\n\nWe cover 23 languages: Arabic, Chinese (simplified & traditional), Czech, Dutch, English, French, German, Greek, Hebrew, Hebrew, Hindi, Indonesian, Italian, Japanese, Korean, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Turkish, Ukrainian, and Vietnamese\n\nThis model card corresponds to the 32-billion version of the Aya Expanse model. We also released an 8-billion version which you can find [here](https://huggingface.co/CohereForAI/aya-expanse-8B).\n\n- Developed by: [Cohere For AI](https://cohere.for.ai/) \n- Point of Contact: Cohere For AI: [cohere.for.ai](https://cohere.for.ai/)\n- License: [CC-BY-NC](https://cohere.com/c4ai-cc-by-nc-license), requires also adhering to [C4AI's Acceptable Use Policy](https://docs.cohere.com/docs/c4ai-acceptable-use-policy)\n- Model: Aya Expanse 32B\n- Model Size: 32 billion parameters\n\n**Try Aya Expanse**\n\nBefore downloading the weights, you can try out Aya Expanse (32B) in our hosted [Hugging Face Space](https://huggingface.co/spaces/CohereForAI/aya_expanse).\n\n\n### Usage\n\nPlease install transformers from the source repository.\n\n```python\n# pip install 'git+https://github.com/huggingface/transformers.git'\nfrom transformers import AutoTokenizer, AutoModelForCausalLM\n\nmodel_id = \"CohereForAI/aya-expanse-32b\"\ntokenizer = AutoTokenizer.from_pretrained(model_id)\nmodel = AutoModelForCausalLM.from_pretrained(model_id)\n\n# Format message with the chat template\nmessages = [{\"role\": \"user\", \"content\": \"Anneme onu ne kadar sevdiğimi anlatan bir mektup yaz\"}]\ninput_ids = tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True, return_tensors=\"pt\")\n## <BOS_TOKEN><|START_OF_TURN_TOKEN|><|USER_TOKEN|>Anneme onu ne kadar sevdiğimi anlatan bir mektup yaz<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>\n\ngen_tokens = model.generate(\n    input_ids, \n    max_new_tokens=100, \n    do_sample=True, \n    temperature=0.3,\n    )\n\ngen_text = tokenizer.decode(gen_tokens[0])\nprint(gen_text)\n```",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "imatrix",
    "aya-expanse-32b",
    "text-generation",
    "en",
    "arxiv:2408.14960",
    "arxiv:2407.02552",
    "arxiv:2406.18682",
    "arxiv:2410.10801",
    "license:other",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 81,
  "gated": false,
  "private": false,
  "last_modified": "2024-10-27T14:04:25.000Z",
  "created_at": "2024-10-27T04:24:52.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "671dc0944dea4d6f302ff52a",
  "id": "duyntnet/aya-expanse-32b-imatrix-GGUF",
  "modelId": "duyntnet/aya-expanse-32b-imatrix-GGUF",
  "sha": "cb7a65b19143195c0fd1fe96657a8cadc2d94986",
  "createdAt": "2024-10-27T04:24:52.000Z",
  "lastModified": "2024-10-27T14:04:25.000Z",
  "author": "duyntnet",
  "downloads": 81,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 29
}

duyntnet/aya-expanse-32b-imatrix-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard