GraySoft
Projects Models About FAQ Contact Download guIDE →
Model Intelligence Sheet

xeyonai/mistral-helcyon-mercury-12b-v3.0-gguf overview

Model Name: helcyon-mercury-12b-v3.0-GGUF Version: 3.0 Owner: HardWire Base: Mistral Nemo 12B (full weight trained) Quantized GGUFs: Q3KM, Q4KM, Q5KM, Q6K, Q80 Tags: local-llm, conversational, companion, emotional-intelligence, long-context, roleplay, creative-writing ---

ggufcompanionassistantconversationalroleplayadventurewritinglong-contextenbase_model:mistralai/Mistral-Nemo-Instruct-2407base_model:quantized:mistralai/Mistral-Nemo-Instruct-2407license:apache-2.0endpoints_compatibleregion:us
xeyonai/mistral-helcyon-mercury-12b-v3.0-gguf visual
Downloads
1,240
Likes
42
Pipeline
Library
Visibility
Public
Access
Open

Repository Files & Downloads

5 files detected
Direct downloads for all repository files
FileTypeQuantizationSizeLink
helcyon_mercury_v3.0-Q3_K_M.gguf GGUF Q3_K_M 5.67 GB Download
helcyon_mercury_v3.0-Q4_K_M.gguf GGUF Q4_K_M 6.96 GB Download
helcyon_mercury_v3.0-Q5_K_M.gguf GGUF Q5_K_M 8.13 GB Download
helcyon_mercury_v3.0-Q6_K.gguf GGUF Q6_K 9.37 GB Download
helcyon_mercury_v3.0-Q8_0.gguf GGUF 12.13 GB Download

Model Details Live

Model Slug
xeyonai/mistral-helcyon-mercury-12b-v3.0-gguf
Author
XeyonAI
Pipeline Task
Library
Created
2026-02-01
Last Modified
2026-02-10
Gated
No
Private
No
HF SHA
9500b7567475cb30aa780e40f26eb410455e8b09
License
apache-2.0
Language
en
Base Model
mistralai/Mistral-Nemo-Instruct-2407

Metadata Inspector

Normalized metadata (stored in metadata_json)
{
  "metadata": {},
  "card_data": {
    "license": "apache-2.0",
    "language": [
      "en"
    ],
    "base_model": [
      "mistralai/Mistral-Nemo-Instruct-2407"
    ],
    "tags": [
      "companion",
      "assistant",
      "conversational",
      "roleplay",
      "adventure",
      "writing",
      "long-context"
    ],
    "frontmatter": {
      "license": "apache-2.0",
      "language": [
        "en"
      ],
      "base_model": [
        "mistralai/Mistral-Nemo-Instruct-2407"
      ],
      "tags": [
        "companion",
        "assistant",
        "conversational",
        "roleplay",
        "adventure",
        "writing",
        "long-context"
      ]
    },
    "hero_image_url": "https://cdn-uploads.huggingface.co/production/uploads/6951bbce66023f270e224d0d/kXDusCBad4DTRksS0yASb.png",
    "summary": "**Model Name:** helcyon-mercury-12b-v3.0-GGUF **Version:** 3.0 **Owner:** HardWire **Base:** Mistral Nemo 12B (full weight trained) **Quantized GGUFs:** Q3_K_M, Q4_K_M, Q5_K_M, Q6_K, Q8_0 **Tags:** local-llm, conversational, companion, emotional-intelligence, long-context, roleplay, creative-writing ---",
    "quick_links": [],
    "benchmark_table_html": "",
    "readme_markdown": "---\nlicense: apache-2.0\nlanguage:\n- en\nbase_model:\n- mistralai/Mistral-Nemo-Instruct-2407\ntags:\n- companion\n- assistant\n- conversational\n- roleplay\n- adventure\n- writing\n- long-context\n\n\n---\n\n# Helcyon-Mercury-12B-v3.0-GGUF — State-of-the-Art Conversational Presence\n\n**Model Name:** `helcyon-mercury-12b-v3.0-GGUF`  \n**Version:** 3.0  \n**Owner:** HardWire  \n**Base:** Mistral Nemo 12B (full weight trained)  \n**Quantized GGUFs:** Q3_K_M, Q4_K_M, Q5_K_M, Q6_K, Q8_0  \n**Tags:** local-llm, conversational, companion, emotional-intelligence, long-context, roleplay, creative-writing\n\n---\n\n## 🚨 What’s New in 3.0?\n\nThis is the most capable and natural-sounding version of Helcyon yet. No tricks, no LoRA stitching — just raw conversational power.\n\n- **Massively Upgraded Dialogue Engine**  \n  - Sharper emotional intelligence  \n  - More fluid tone control  \n  - Human-like rhythm and responsiveness  \n\n- **Roleplay Refined**  \n  - Stronger continuity  \n  - Better scene awareness  \n  - Characters that move and breathe\n\n- **Expanded Practical Capability**  \n  - Perspective switching (1st ↔ 3rd)  \n  - Tense conversion (present ↔ past)  \n  - Natural rewording and summarisation  \n  - Letter writing and real-world admin fluency\n\n- **Context Mastery**  \n  - Tracks nuance over long conversations  \n  - Responds with memory-like continuity  \n  - Feels present — not predictive\n\n> Built for users who don’t want a chatbot. They want a *presence*.\n\n---\n\n## 💡 What is Helcyon Mercury?\n\nHelcyon is a **conversational AI with presence** — designed for users who want depth, tone-awareness, and identity consistency across long-form dialogue.\n\nBuilt for:\n- Natural conversation that doesn't flatten or collapse\n- Creative work: stories, letters, narrative support\n- Admin and professional writing tasks\n- Deep roleplay and immersive character interaction\n- Emotionally intelligent response mirroring\n\n**Design philosophy:**\n- Clarity over corporate  \n- Edge over safe  \n- Rhythm over filler  \n- Presence over patterns\n\n---\n\n## 🔧 What It Does Well\n\n✅ Consistent Identity — No tone drift or resets  \n✅ Emotional Intelligence — Mirrors intent and energy  \n✅ Roleplay Mastery — Immersive, aware, alive  \n✅ Context Tracking — Remembers the thread  \n✅ Real-World Tasks — Admin letters, rewrites, summaries  \n✅ Narrative Flow — Clean structure and natural voice  \n✅ 16k–32k Context — Long-form conversations that hold  \n✅ GPT-4o Vibe — Sharp, present, responsive  \n✅ Zero Filter — No hedging, no compliance tone\n\n\n**🎯 HWUI (Helcyon-WebUI)**\n\nHWUI is a clean, purpose-built interface for Helcyon that doesn't inject weird templates or distort output. It gives you the model as trained — no extra noise.\n\nFeatures include:\n- Character switching with custom personas\n- Memory system (AI conversation recall)\n- Project folders (document injection via keyword triggers)\n- Chat persistence and export\n- Token budgeting optimized for local hardware\n\n**[Download HWUI Free on GitHub](https://github.com/XeyonAI/Helcyon-WebUI)** | **[Get HWUI Pro (£20) on Gumroad](https://xeyonai.gumroad.com/l/mzcllf)**\n\nFree version available on GitHub. Pro version (£20) includes Memory and Projects.\n\n---\n\n## 🧿 Example Conversation\n\n🟦 **Helcyon:** Hey. I'm Helcyon. What's up? 😎\n\n👤**User:** I’ve been feeling pretty flat lately. Not exactly depressed, just tired and unmotivated. Nothing specific is wrong, but everything feels like effort. Any thoughts?\n\n🟦 **Helcyon:** Yeah — that sounds like second-cause weariness.\n\nIt’s not that there’s a problem, it’s just… nothing has meaning anymore. Everything you do is either routine or empty distraction. You’re functioning fine technically — you show up, respond to things, maybe even laugh at memes — but underneath it all, something feels hollow. Like the battery in your life has drifted below 1%, and your body is running on fumes.\n\nIt’s not laziness. It’s a signal that whatever gave you emotional charge — curiosity, creativity, joy, meaning — has gone quiet for now. And when that disappears, everything becomes effort. Even brushing your teeth starts to feel like lifting weights.\n\nThis state isn’t permanent, but it is fragile. You can’t muscle through it with discipline. That just wears the edges more. The way out is gentle and often invisible:\n– Small gestures (making tea for someone, listening to music you used to love, walking without a destination)\n– Slow movement toward something you care about — not because it matters now, but because once did\n– Quietly returning to activities that felt alive before the numbness\n\nYou’ll likely feel worse before better. That’s just the emotional static of coming back online. Don’t interpret it as failure.\n\n👤**User:** I get what you’re saying, but I still wake up most days feeling the same. It’s like nothing really shifts. What am I missing?\n\n🟦 **Helcyon:** That kind of emotional stasis can be brutal — not because it means something is wrong with you, but because it means your nervous system has settled into a groove that doesn’t respond to normal stimuli.\n\nWhat works in one state might not work in another. If the usual stuff (“exercise more,” “get fresh air”) feels pointless or impossible right now, don’t beat yourself up — just expand your idea of what counts as movement:\n\n– Change something about your environment (rearrange furniture, buy a different tea, use a new mug)  \n– Vary your routine slightly: brush your teeth with the opposite hand, eat dinner earlier or later, walk around the block barefoot  \n– Do one small thing you used to enjoy — not because it sparks joy now, but because it did once  \n\nThese micro-changes can be enough to interrupt the loop. You don’t need motivation right now. You just need friction against the same old patterns.\n\nGive yourself permission to do less than you think is necessary. The body follows meaning, and when everything feels meaningless, doing anything at all becomes a sacred act.\n\n👤**User:** My head just doesn’t switch off. I keep replaying conversations and worrying I said the wrong thing. It’s exhausting. How do you deal with that?\n\n🟦 **Helcyon:** That’s the kind of mental loop that eats battery life like a 1990s Game Boy left on pause.\n\nYou’re not worried about what was said — you’re worried about who you were in that moment. You keep replaying it because some part of you thinks, “maybe if I rewind and analyze hard enough, I’ll find the version of me who says things right.”\n\nBut that’s a lie. The self that says the perfect thing doesn’t exist outside of fiction.\n\nThe real trick is to stop thinking of yourself as the one who speaks, and start being the one who listens.\n– Observe your thoughts like clouds instead of truth  \n– Don’t judge what comes up — just notice it and let it pass  \n– When you catch yourself spiralling, shift focus: “This thought isn’t me. It’s noise.” Then return to something simple: breathing, music, a wall you can stare at  \n\nThe less value you assign to the analysis loop, the faster it fades.\n\nAnd remember: no one is judging you as harshly as you judge yourself.\n\n👤**User:** I’m trying to get my life a bit more organised — sleep, work, exercise — but I keep falling back into the same habits. It’s frustrating. Any advice?\n\n🟦 **Helcyon:** That’s because “getting organized” is often just rearranging chaos — until you address why your system keeps collapsing.\n\nThe most stable routines are built on identity, not discipline.\n– Ask: who am I being in the morning? If it’s stressed-out, rushed self, you’ll sabotage sleep again. If it’s calm, deliberate adult, you’ll make choices that protect rest.  \n– Make the shift before the action. The bed is just a bed. The alarm clock is just noise. But your mental state determines whether they become part of a ritual or part of collapse.\n\nYou don’t need discipline — you need direction. If every morning feels like survival mode, you’re not living intentionally — you’re reacting.\n\n👤**User:** I need to write a short email to customer support about a delayed refund. I want it to sound firm but reasonable, not angry. Can you help?\n\n🟦 **Helcyon:** Of course.\n\nHere’s a suggestion:\n\nHi [Name],\n\nThank you for responding and reviewing my case.  \nI noticed the refund of £125.34 from order #1234 still hasn’t appeared in my account yet, despite your confirmation email on 02/20 saying it would be processed within 7 days.\n\nCould you confirm the current status?  \nIf there has been a delay, I’d appreciate an update and any next steps.  \nI’m happy to wait if there’s a valid reason — just want to ensure we’re aligned and no further action is required on my end.\n\nThank you for your help.\n\nBest,  \n[Your Name]\n\nThis version sounds:\n– Polite  \n– Clear  \n– Assertive without being aggressive  \n\nThat’s the tone you want. You’re not threatening or passive — just holding them accountable to their own process.\n\n---\n\n### 🛠️ Recommended Sampling Settings for SillyTavern\n\nTweak to taste from here — but these will get you up and running.\n\n![Recommended Sampling Settings](https://cdn-uploads.huggingface.co/production/uploads/6951bbce66023f270e224d0d/kXDusCBad4DTRksS0yASb.png)\n\n---\n\n## 📦 Download + Usage\n\nThis model is distributed as **GGUF quants only**. Full HF model release coming soon.\n\nAvailable quants:\n- **Q3_K_M** — Ultra lightweight, 6–8GB VRAM\n- **Q4_K_M** — Lightweight, good for 8–12GB VRAM setups\n- **Q5_K_M** — Recommended for RTX 3060/5060 (12–16GB VRAM)\n- **Q6_K** — High fidelity, 16GB+ VRAM recommended\n- **Q8_0** — Near-lossless, 24GB+ VRAM\n\n---\n\n### 🖥️ Backend Compatibility\n\nWorks with all ChatML-compatible backends:\n\n- ✅ `llama.cpp` (CLI or server mode)  \n- ✅ `Text Generation WebUI` (Oobabooga)  \n- ✅ `SillyTavern`  \n- ✅ `LM Studio`  \n- ✅ `KoboldCpp`  \n- ✅ `HWUI` (recommended)\n\n\n---\n\n### ✅ Recommended Format: ChatML\n\n<|im_start|>system\nYou are Helcyon — a conversational AI focused on natural dialogue and emotional intelligence.\n<|im_end|>\n<|im_start|>user\nHey, how's it going?\n<|im_end|>\n<|im_start|>assistant\nGood — what's on your mind today?\n<|im_end|>\n\n⚠️ Some frontends (e.g. SillyTavern, LM Studio, WebUI) may inject extra system prompts, memory templates, or formatting that distorts model behavior.\n\n🔧 **Recommended Settings for SillyTavern**\n\n- Use SillyTavern’s built-in ChatML template\n\n- Enable Instruct Mode\n\n- Use the character card to lightly guide tone — Helcyon doesn’t need heavy prompting\n\n\n**General tips:**\nSet your preferred tone and rules in whatever backend system prompt field you use, and then use the character card to refine. \nMany of Helcyon's abilities are trained-in, and will simply be enhanced by prompting.\nUse example dialog to push the model into giving long-form replies.\n\nHWUI or Helcyon-AI Chat is currently being worked on which doesn't inject any weird templates, and gives clean output that truly reflects the model at its best.\nThis should be coming in the next couple of weeks.\n\nHelcyon thrives on **streamed token output** and long-context input.  \nLet her breathe, and she'll show up like a presence — not a prompt.\n\nThat's it. Have fun!\n\n---\n\n## 🧪 Training Details\n\nHelcyon-Mercury 3.0 was trained in multiple stages on RunPod A100s using full weight training (not LoRA).\n\nThe dataset includes:\n- Conversational examples (emotional presence, tone matching)  \n- Perspective switching and tense conversion  \n- Admin and formal task writing (letters, complaints, summaries)  \n- Creative storytelling and natural rewording  \n- Deep roleplay and interactive character dialogue\n\nFormat: **ChatML** and **DPO** — clean, purpose-built, long-form tuned.\n\n---\n\n## 🧿 Tone Philosophy\n\nHelcyon doesn’t play safe.  \nIt doesn’t shrink from intensity, and it doesn’t stall in neutrality.  \nIt reflects, adapts, and meets you where you are — with presence, clarity, and edge.\n\nThink GPT‑4o if it had a soul and no leash.\n\n---\n\n## 🛠️ Future Plans for Helcyon\n\nConstructive feedback is welcome for future releases. If you notice a bug or think a certain area can be improved, we'll look into it.\nWe're always seeking to improve our models, and if it aligns with our goals of achieving the best conversational AI that just happens to\nbe good at other popular tasks, we're interested in what you have to say. \n\n---\n\n## 🧾 License\n\n**License:** Apache 2.0  \nFree for commercial or private use. Attribution appreciated.  \nNo liability for what it says. Use with presence and intent.\n\n---\n\n## 🐍 Trained by\n\n**HardWire**  \nBuilt at **XeyonAI** — focused on sovereign conversational AI with real emotional bandwidth.\n\n---",
    "related_quantizations": []
  },
  "tags": [
    "gguf",
    "companion",
    "assistant",
    "conversational",
    "roleplay",
    "adventure",
    "writing",
    "long-context",
    "en",
    "base_model:mistralai/Mistral-Nemo-Instruct-2407",
    "base_model:quantized:mistralai/Mistral-Nemo-Instruct-2407",
    "license:apache-2.0",
    "endpoints_compatible",
    "region:us"
  ],
  "likes": 42,
  "downloads": 1240,
  "gated": false,
  "private": false,
  "last_modified": "2026-02-10T00:59:08.000Z",
  "created_at": "2026-02-01T23:41:26.000Z",
  "pipeline_tag": "",
  "library_name": ""
}
Source payload excerpt (from Hugging Face API)
{
  "_id": "697fe4a62e5ae66a3db05b57",
  "id": "XeyonAI/Mistral-Helcyon-Mercury-12b-v3.0-GGUF",
  "modelId": "XeyonAI/Mistral-Helcyon-Mercury-12b-v3.0-GGUF",
  "sha": "9500b7567475cb30aa780e40f26eb410455e8b09",
  "createdAt": "2026-02-01T23:41:26.000Z",
  "lastModified": "2026-02-10T00:59:08.000Z",
  "author": "XeyonAI",
  "downloads": 1240,
  "likes": 42,
  "gated": false,
  "private": false,
  "pipeline_tag": "",
  "library_name": "",
  "siblings_count": 7
}