duyntnet/llama-3-8b-synthia-v3.5-imatrix-gguf IQ3_XXS GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

duyntnet/llama-3-8b-synthia-v3.5-imatrix-gguf overview

Comprehensive model page for duyntnet/llama-3-8b-synthia-v3.5-imatrix-gguf

transformersggufimatrixLlama-3-8B-Synthia-v3.5text-generationenlicense:otherregion:usconversational

duyntnet/llama-3-8b-synthia-v3.5-imatrix-gguf visual

Downloads

Likes

Pipeline

text-generation

Library

transformers

Visibility

Public

Access

Open

Repository Files & Downloads

27 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
Llama-3-8B-Synthia-v3.5-IQ1_M.gguf	GGUF	IQ1_M	2.01 GB	Download
Llama-3-8B-Synthia-v3.5-IQ1_S.gguf	GGUF	IQ1_S	1.88 GB	Download
Llama-3-8B-Synthia-v3.5-IQ2_M.gguf	GGUF	IQ2_M	2.75 GB	Download
Llama-3-8B-Synthia-v3.5-IQ2_S.gguf	GGUF	IQ2_S	2.57 GB	Download
Llama-3-8B-Synthia-v3.5-IQ2_XS.gguf	GGUF	IQ2_XS	2.43 GB	Download
Llama-3-8B-Synthia-v3.5-IQ2_XXS.gguf	GGUF	IQ2_XXS	2.23 GB	Download
Llama-3-8B-Synthia-v3.5-IQ3_M.gguf	GGUF	IQ3_M	3.52 GB	Download
Llama-3-8B-Synthia-v3.5-IQ3_S.gguf	GGUF	IQ3_S	3.43 GB	Download
Llama-3-8B-Synthia-v3.5-IQ3_XS.gguf	GGUF	IQ3_XS	3.28 GB	Download
Llama-3-8B-Synthia-v3.5-IQ3_XXS.gguf	GGUF	IQ3_XXS	3.05 GB	Download
Llama-3-8B-Synthia-v3.5-IQ4_NL.gguf	GGUF	IQ4_NL	4.36 GB	Download
Llama-3-8B-Synthia-v3.5-IQ4_XS.gguf	GGUF	IQ4_XS	4.14 GB	Download
Llama-3-8B-Synthia-v3.5-Q2_K.gguf	GGUF	Q2_K	2.96 GB	Download
Llama-3-8B-Synthia-v3.5-Q2_K_S.gguf	GGUF	Q2_K_S	2.78 GB	Download
Llama-3-8B-Synthia-v3.5-Q3_K_L.gguf	GGUF	Q3_K_L	4.03 GB	Download
Llama-3-8B-Synthia-v3.5-Q3_K_M.gguf	GGUF	Q3_K_M	3.74 GB	Download
Llama-3-8B-Synthia-v3.5-Q3_K_S.gguf	GGUF	Q3_K_S	3.41 GB	Download
Llama-3-8B-Synthia-v3.5-Q4_0.gguf	GGUF	—	4.35 GB	Download
Llama-3-8B-Synthia-v3.5-Q4_1.gguf	GGUF	—	4.78 GB	Download
Llama-3-8B-Synthia-v3.5-Q4_K_M.gguf	GGUF	Q4_K_M	4.58 GB	Download
Llama-3-8B-Synthia-v3.5-Q4_K_S.gguf	GGUF	Q4_K_S	4.37 GB	Download
Llama-3-8B-Synthia-v3.5-Q5_0.gguf	GGUF	—	5.23 GB	Download
Llama-3-8B-Synthia-v3.5-Q5_1.gguf	GGUF	—	5.65 GB	Download
Llama-3-8B-Synthia-v3.5-Q5_K_M.gguf	GGUF	Q5_K_M	5.34 GB	Download
Llama-3-8B-Synthia-v3.5-Q5_K_S.gguf	GGUF	Q5_K_S	5.21 GB	Download
Llama-3-8B-Synthia-v3.5-Q6_K.gguf	GGUF	Q6_K	6.14 GB	Download
Llama-3-8B-Synthia-v3.5-Q8_0.gguf	GGUF	—	7.95 GB	Download

Model Details Live

Model Slug

duyntnet/llama-3-8b-synthia-v3.5-imatrix-gguf

Author

duyntnet

Pipeline Task

text-generation

Library

transformers

Created

2024-06-05

Last Modified

2024-06-06

Gated

Private

HF SHA

2885ddc86b5ca4b63e4075bbb7e8f270cdda5c7f

License

other

Language

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "other",
    "language": [
      "en"
    ],
    "pipeline_tag": "text-generation",
    "inference": false,
    "tags": [
      "transformers",
      "gguf",
      "imatrix",
      "Llama-3-8B-Synthia-v3.5"
    ],
    "frontmatter": {
      "license": "other",
      "language": [
        "en"
      ],
      "pipeline_tag": "text-generation",
      "inference": "false",
      "tags": [
        "transformers",
        "gguf",
        "imatrix",
        "Llama-3-8B-Synthia-v3.5"
      ]
    },
    "hero_image_url": "",
    "summary": "",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- Llama-3-8B-Synthia-v3.5\n---\nQuantizations of https://huggingface.co/migtissera/Llama-3-8B-Synthia-v3.5\n\n\n# From original readme\n\n## Sample code to run inference\n\n```python\nimport torch, json\nfrom transformers import AutoModelForCausalLM, AutoTokenizer\n\nmodel_path = \"/home/migel/Tess-2.0-Llama-3-8B\"\noutput_file_path = \"/home/migel/conversations.jsonl\"\n\nmodel = AutoModelForCausalLM.from_pretrained(\n    model_path,\n    torch_dtype=torch.float16,\n    device_map=\"auto\",\n    load_in_4bit=False,\n    trust_remote_code=False,\n)\n\ntokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)\n\ndef generate_text(instruction):\n    tokens = tokenizer.encode(instruction)\n    tokens = torch.LongTensor(tokens).unsqueeze(0)\n    tokens = tokens.to(\"cuda\")\n\n    instance = {\n        \"input_ids\": tokens,\n        \"top_p\": 1.0,\n        \"temperature\": 0.75,\n        \"generate_len\": 1024,\n        \"top_k\": 50,\n    }\n\n    length = len(tokens[0])\n    with torch.no_grad():\n        rest = model.generate(\n            input_ids=tokens,\n            max_length=length + instance[\"generate_len\"],\n            use_cache=True,\n            do_sample=True,\n            top_p=instance[\"top_p\"],\n            temperature=instance[\"temperature\"],\n            top_k=instance[\"top_k\"],\n            num_return_sequences=1,\n            pad_token_id=tokenizer.eos_token_id,\n        )\n    output = rest[0][length:]\n    string = tokenizer.decode(output, skip_special_tokens=True)\n    return f\"{string}\"\n\nconversation = \"\"\"<|begin_of_text|><|start_header_id|>system<|end_header_id|>\\n\\nYou are Synthia, a helful, female AI assitant. You always provide detailed answers without hesitation.<|eot_id|><|start_header_id|>user<|end_header_id|>\\n\\n\"\"\"\n\nwhile True:\n    user_input = input(\"You: \")\n    llm_prompt = f\"{conversation}{user_input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>\\n\\n\"\n    answer = generate_text(llm_prompt)\n    print(answer)\n\n    conversation = f\"{llm_prompt}{answer}<|eot_id|><|start_header_id|>user<|end_header_id|>\\n\\n\"\n\n    json_data = {\"prompt\": user_input, \"answer\": answer}\n\n    with open(output_file_path, \"a\") as output_file:\n        output_file.write(json.dumps(json_data) + \"\\n\")\n```",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "imatrix",
    "Llama-3-8B-Synthia-v3.5",
    "text-generation",
    "en",
    "license:other",
    "region:us",
    "conversational"
  ],
  "likes": 0,
  "downloads": 83,
  "gated": false,
  "private": false,
  "last_modified": "2024-06-06T02:38:19.000Z",
  "created_at": "2024-06-05T22:46:49.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "6660ead9f57dbfbdc33ce476",
  "id": "duyntnet/Llama-3-8B-Synthia-v3.5-imatrix-GGUF",
  "modelId": "duyntnet/Llama-3-8B-Synthia-v3.5-imatrix-GGUF",
  "sha": "2885ddc86b5ca4b63e4075bbb7e8f270cdda5c7f",
  "createdAt": "2024-06-05T22:46:49.000Z",
  "lastModified": "2024-06-06T02:38:19.000Z",
  "author": "duyntnet",
  "downloads": 83,
  "likes": 0,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 29
}

duyntnet/llama-3-8b-synthia-v3.5-imatrix-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard