duyntnet/nexusraven-v2-13b-imatrix-gguf IQ2_XS GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.

Model Intelligence Sheet

duyntnet/nexusraven-v2-13b-imatrix-gguf overview

Quickstart You can run the model on a GPU using the following code. This should generate the following: If you would like to prevent the generation of the explanation of the function call (for example, to save on inference tokens), please set a stopping criteria of \. Please follow this prompting template to maximize the performance of RavenV2. ### Using with OpenAI FC Schematics If you currently have a workflow that is built around OpenAI's function calling and you want to try NexusRaven-V2, we have a package that helps you drop in NexusRaven-V2. ### Using With LangChain We've also included a small demo for using Raven with langchain!

transformersggufimatrixNexusRaven-V2-13Btext-generationenlicense:otherregion:us

duyntnet/nexusraven-v2-13b-imatrix-gguf visual

Downloads

172

Likes

Pipeline

text-generation

Library

transformers

Visibility

Public

Access

Open

Repository Files & Downloads

27 files detected

Direct downloads for all repository files

File	Type	Quantization	Size	Link
NexusRaven-V2-13B-IQ1_M.gguf	GGUF	IQ1_M	2.92 GB	Download
NexusRaven-V2-13B-IQ1_S.gguf	GGUF	IQ1_S	2.70 GB	Download
NexusRaven-V2-13B-IQ2_M.gguf	GGUF	IQ2_M	4.21 GB	Download
NexusRaven-V2-13B-IQ2_S.gguf	GGUF	IQ2_S	3.91 GB	Download
NexusRaven-V2-13B-IQ2_XS.gguf	GGUF	IQ2_XS	3.62 GB	Download
NexusRaven-V2-13B-IQ2_XXS.gguf	GGUF	IQ2_XXS	3.30 GB	Download
NexusRaven-V2-13B-IQ3_M.gguf	GGUF	IQ3_M	5.57 GB	Download
NexusRaven-V2-13B-IQ3_S.gguf	GGUF	IQ3_S	5.27 GB	Download
NexusRaven-V2-13B-IQ3_XS.gguf	GGUF	IQ3_XS	4.99 GB	Download
NexusRaven-V2-13B-IQ3_XXS.gguf	GGUF	IQ3_XXS	4.62 GB	Download
NexusRaven-V2-13B-IQ4_NL.gguf	GGUF	IQ4_NL	6.86 GB	Download
NexusRaven-V2-13B-IQ4_XS.gguf	GGUF	IQ4_XS	6.49 GB	Download
NexusRaven-V2-13B-Q2_K.gguf	GGUF	Q2_K	4.52 GB	Download
NexusRaven-V2-13B-Q2_K_S.gguf	GGUF	Q2_K_S	4.13 GB	Download
NexusRaven-V2-13B-Q3_K_L.gguf	GGUF	Q3_K_L	6.45 GB	Download
NexusRaven-V2-13B-Q3_K_M.gguf	GGUF	Q3_K_M	5.90 GB	Download
NexusRaven-V2-13B-Q3_K_S.gguf	GGUF	Q3_K_S	5.27 GB	Download
NexusRaven-V2-13B-Q4_0.gguf	GGUF	—	6.88 GB	Download
NexusRaven-V2-13B-Q4_1.gguf	GGUF	—	7.61 GB	Download
NexusRaven-V2-13B-Q4_K_M.gguf	GGUF	Q4_K_M	7.33 GB	Download
NexusRaven-V2-13B-Q4_K_S.gguf	GGUF	Q4_K_S	6.91 GB	Download
NexusRaven-V2-13B-Q5_0.gguf	GGUF	—	8.38 GB	Download
NexusRaven-V2-13B-Q5_1.gguf	GGUF	—	9.10 GB	Download
NexusRaven-V2-13B-Q5_K_M.gguf	GGUF	Q5_K_M	8.60 GB	Download
NexusRaven-V2-13B-Q5_K_S.gguf	GGUF	Q5_K_S	8.36 GB	Download
NexusRaven-V2-13B-Q6_K.gguf	GGUF	Q6_K	9.95 GB	Download
NexusRaven-V2-13B-Q8_0.gguf	GGUF	—	12.88 GB	Download

Model Details Live

Model Slug

duyntnet/nexusraven-v2-13b-imatrix-gguf

Author

duyntnet

Pipeline Task

text-generation

Library

transformers

Created

2024-05-09

Last Modified

2024-05-09

Gated

Private

HF SHA

63f87df681eab13f3fc0446a38a70574529f3068

License

other

Language

Base Model

Unknown

Metadata Inspector

Normalized metadata (stored in metadata_json)

{
  "metadata": {},
  "card_data": {
    "license": "other",
    "language": [
      "en"
    ],
    "pipeline_tag": "text-generation",
    "inference": false,
    "tags": [
      "transformers",
      "gguf",
      "imatrix",
      "NexusRaven-V2-13B"
    ],
    "frontmatter": {
      "license": "other",
      "language": [
        "en"
      ],
      "pipeline_tag": "text-generation",
      "inference": "false",
      "tags": [
        "transformers",
        "gguf",
        "imatrix",
        "NexusRaven-V2-13B"
      ]
    },
    "hero_image_url": "",
    "summary": "### Quickstart You can run the model on a GPU using the following code. ``python # Please pip install transformers accelerate from transformers import pipeline pipeline = pipeline( \"text-generation\", model=\"Nexusflow/NexusRaven-V2-13B\", torch_dtype=\"auto\", device_map=\"auto\", ) prompt_template = \\ ''' Function: def get_weather_data(coordinates): \"\"\" Fetches weather data from the Open-Meteo API for the given latitude and longitude. Args: coordinates (tuple): The latitude of the location. Returns: float: The current temperature in the coordinates you've asked for \"\"\" Function: def get_coordinates_from_city(city_name): \"\"\" Fetches the latitude and longitude of a given city name using the Maps.co Geocoding API. Args: city_name (str): The name of the city. Returns: tuple: The latitude and longitude of the city. \"\"\" User Query: {query} ''' prompt = prompt_template.format(query=\"What's the weather like in Seattle right now?\") result = pipeline(prompt, max_new_tokens=2048, return_full_text=False, do_sample=False, temperature=0.001)[0][\"generated_text\"] print (result) ` This should generate the following: ` Call: get_weather_data(coordinates=get_coordinates_from_city(city_name='Seattle')) Thought: The function call get_weather_data(coordinates=get_coordinates_from_city(city_name='Seattle')) answers the question \"What's the weather like in Seattle right now?\" by following these steps: 1. get_coordinates_from_city(city_name='Seattle'): This function call fetches the latitude and longitude of the city \"Seattle\" using the Maps.co Geocoding API. 2. get_weather_data(coordinates=...): This function call fetches the current weather data for the coordinates returned by the previous function call. Therefore, the function call get_weather_data(coordinates=get_coordinates_from_city(city_name='Seattle')) answers the question \"What's the weather like in Seattle right now?\" by first fetching the coordinates of the city \"Seattle\" and then fetching the current weather data for those coordinates. `` If you would like to prevent the generation of the explanation of the function call (for example, to save on inference tokens), please set a stopping criteria of \\. Please follow this prompting template to maximize the performance of RavenV2. ### Using with OpenAI FC Schematics If you currently have a workflow that is built around OpenAI's function calling and you want to try NexusRaven-V2, we have a package that helps you drop in NexusRaven-V2. ### Using With LangChain We've also included a small demo for using Raven with langchain!",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: other\nlanguage:\n- en\npipeline_tag: text-generation\ninference: false\ntags:\n- transformers\n- gguf\n- imatrix\n- NexusRaven-V2-13B\n---\nQuantizations of https://huggingface.co/Nexusflow/NexusRaven-V2-13B\n\n# From original readme\n\n### Quickstart\nYou can run the model on a GPU using the following code. \n```python\n# Please `pip install transformers accelerate`\nfrom transformers import pipeline\n\n\npipeline = pipeline(\n    \"text-generation\",\n    model=\"Nexusflow/NexusRaven-V2-13B\",\n    torch_dtype=\"auto\",\n    device_map=\"auto\",\n)\n\nprompt_template = \\\n'''\nFunction:\ndef get_weather_data(coordinates):\n    \"\"\"\n    Fetches weather data from the Open-Meteo API for the given latitude and longitude.\n\n    Args:\n    coordinates (tuple): The latitude of the location.\n\n    Returns:\n    float: The current temperature in the coordinates you've asked for\n    \"\"\"\n\nFunction:\ndef get_coordinates_from_city(city_name):\n    \"\"\"\n    Fetches the latitude and longitude of a given city name using the Maps.co Geocoding API.\n\n    Args:\n    city_name (str): The name of the city.\n\n    Returns:\n    tuple: The latitude and longitude of the city.\n    \"\"\"\n\nUser Query: {query}<human_end>\n\n'''\n\nprompt = prompt_template.format(query=\"What's the weather like in Seattle right now?\")\n\nresult = pipeline(prompt, max_new_tokens=2048, return_full_text=False, do_sample=False, temperature=0.001)[0][\"generated_text\"]\nprint (result)\n```\n\nThis should generate the following:\n```\nCall: get_weather_data(coordinates=get_coordinates_from_city(city_name='Seattle'))<bot_end>\nThought: The function call `get_weather_data(coordinates=get_coordinates_from_city(city_name='Seattle'))` answers the question \"What's the weather like in Seattle right now?\" by following these steps:\n\n1. `get_coordinates_from_city(city_name='Seattle')`: This function call fetches the latitude and longitude of the city \"Seattle\" using the Maps.co Geocoding API.\n2. `get_weather_data(coordinates=...)`: This function call fetches the current weather data for the coordinates returned by the previous function call.\n\nTherefore, the function call `get_weather_data(coordinates=get_coordinates_from_city(city_name='Seattle'))` answers the question \"What's the weather like in Seattle right now?\" by first fetching the coordinates of the city \"Seattle\" and then fetching the current weather data for those coordinates.\n```\n\nIf you would like to prevent the generation of the explanation of the function call (for example, to save on inference tokens), please set a stopping criteria of \\<bot_end\\>. \n\nPlease follow this prompting template to maximize the performance of RavenV2. \n\n### Using with OpenAI FC Schematics\n\n[If you currently have a workflow that is built around OpenAI's function calling and you want to try NexusRaven-V2, we have a package that helps you drop in NexusRaven-V2.](https://github.com/nexusflowai/nexusraven-pip)\n\n### Using With LangChain\n\nWe've also included a [small demo for using Raven with langchain](langdemo.py)! ",
    "related_quantizations": []
  },
  "tags": [
    "transformers",
    "gguf",
    "imatrix",
    "NexusRaven-V2-13B",
    "text-generation",
    "en",
    "license:other",
    "region:us"
  ],
  "likes": 2,
  "downloads": 172,
  "gated": false,
  "private": false,
  "last_modified": "2024-05-09T09:17:24.000Z",
  "created_at": "2024-05-09T04:48:16.000Z",
  "pipeline_tag": "text-generation",
  "library_name": "transformers"
}

Source payload excerpt (from Hugging Face API)

{
  "_id": "663c55907d5f1fbe088db69e",
  "id": "duyntnet/NexusRaven-V2-13B-imatrix-GGUF",
  "modelId": "duyntnet/NexusRaven-V2-13B-imatrix-GGUF",
  "sha": "63f87df681eab13f3fc0446a38a70574529f3068",
  "createdAt": "2024-05-09T04:48:16.000Z",
  "lastModified": "2024-05-09T09:17:24.000Z",
  "author": "duyntnet",
  "downloads": 172,
  "likes": 2,
  "gated": false,
  "private": false,
  "pipeline_tag": "text-generation",
  "library_name": "transformers",
  "siblings_count": 29
}

duyntnet/nexusraven-v2-13b-imatrix-gguf overview

Repository Files & Downloads

Model Details Live

Metadata Inspector

More models in this shard