gabriellarson/lucy-gguf Q3_K_L GGUF - Free GGUF Download is indexed on GraySoft with repository links, GGUF quant files, and Hugging Face metadata. This page helps you pick a local model for guIDE or other runtimes. See related models in the same shard below.
Model Intelligence Sheet
gabriellarson/lucy-gguf overview
Lucy: Edgerunning Agentic Web Search on Mobile with a 1.7B model. GitHub License Authors: Alan Dao, Bach Vu Dinh, Alex Nguyen !image/gif
Downloads
162
Likes
0
Pipeline
text-generation
Library
transformers
Visibility
Public
Access
Open
Repository Files & Downloads
26 files detected
Direct downloads for all repository files
| File | Type | Quantization | Size | Link |
|---|---|---|---|---|
| Lucy-1.7B-F16.gguf | GGUF | F16 | 3.21 GB | Download |
| Lucy-IQ1_M.gguf | GGUF | IQ1_M | 518.60 MB | Download |
| Lucy-IQ1_S.gguf | GGUF | IQ1_S | 491.88 MB | Download |
| Lucy-IQ2_M.gguf | GGUF | IQ2_M | 662.98 MB | Download |
| Lucy-IQ2_S.gguf | GGUF | IQ2_S | 627.35 MB | Download |
| Lucy-IQ2_XS.gguf | GGUF | IQ2_XS | 602.26 MB | Download |
| Lucy-IQ2_XXS.gguf | GGUF | IQ2_XXS | 563.13 MB | Download |
| Lucy-IQ3_M.gguf | GGUF | IQ3_M | 854.17 MB | Download |
| Lucy-IQ3_S.gguf | GGUF | IQ3_S | 827.08 MB | Download |
| Lucy-IQ3_XS.gguf | GGUF | IQ3_XS | 795.58 MB | Download |
| Lucy-IQ3_XXS.gguf | GGUF | IQ3_XXS | 719.41 MB | Download |
| Lucy-IQ4_NL.gguf | GGUF | IQ4_NL | 1005.58 MB | Download |
| Lucy-IQ4_XS.gguf | GGUF | IQ4_XS | 963.58 MB | Download |
| Lucy-Q2_K.gguf | GGUF | Q2_K | 741.76 MB | Download |
| Lucy-Q2_K_S.gguf | GGUF | Q2_K_S | 699.01 MB | Download |
| Lucy-Q3_K_L.gguf | GGUF | Q3_K_L | 957.01 MB | Download |
| Lucy-Q3_K_M.gguf | GGUF | Q3_K_M | 896.01 MB | Download |
| Lucy-Q3_K_S.gguf | GGUF | Q3_K_S | 827.08 MB | Download |
| Lucy-Q4_0.gguf | GGUF | — | 1007.83 MB | Download |
| Lucy-Q4_K_M.gguf | GGUF | Q4_K_M | 1.03 GB | Download |
| Lucy-Q4_K_S.gguf | GGUF | Q4_K_S | 1011.08 MB | Download |
| Lucy-Q5_0.gguf | GGUF | — | 1.15 GB | Download |
| Lucy-Q5_K_M.gguf | GGUF | Q5_K_M | 1.17 GB | Download |
| Lucy-Q5_K_S.gguf | GGUF | Q5_K_S | 1.15 GB | Download |
| Lucy-Q6_K.gguf | GGUF | Q6_K | 1.32 GB | Download |
| Lucy-Q8_0.gguf | GGUF | — | 1.71 GB | Download |
Model Details Live
Metadata Inspector
Normalized metadata (stored in metadata_json)
{
"metadata": {},
"card_data": {
"license": "apache-2.0",
"language": [
"en"
],
"base_model": [
"Menlo/Lucy"
],
"pipeline_tag": "text-generation",
"library_name": "transformers",
"frontmatter": {
"license": "apache-2.0",
"language": [
"en"
],
"base_model": [
"Menlo/Lucy"
],
"pipeline_tag": "text-generation",
"library_name": "transformers"
},
"hero_image_url": "https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/PA6JCiYLPJX_WFO42ClTd.jpeg",
"summary": "# Lucy: Edgerunning Agentic Web Search on Mobile with a 1.7B model.   **Authors:** Alan Dao, Bach Vu Dinh, Alex Nguyen !image/gif",
"quick_links": [],
"benchmark_table_html": "",
"readme_markdown": "---\nlicense: apache-2.0\nlanguage:\n- en\nbase_model:\n- Menlo/Lucy\npipeline_tag: text-generation\nlibrary_name: transformers\n---\n# Lucy: Edgerunning Agentic Web Search on Mobile with a 1.7B model.\n\n[](https://github.com/menloresearch/deep-research) \n[](https://opensource.org/licenses/Apache-2.0)\n\n<div align=\"center\">\n <img src=\"https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/PA6JCiYLPJX_WFO42ClTd.jpeg\" width=\"300\" alt=\"Lucy-128k\">\n</div>\n\n**Authors:** [Alan Dao](https://scholar.google.com/citations?user=eGWws2UAAAAJ&hl=en), [Bach Vu Dinh](https://scholar.google.com/citations?user=7Lr6hdoAAAAJ&hl=vi), [Alex Nguyen](https://github.com/nguyenhoangthuan99)\n\n\n\n\n\n## Overview\n\nLucy is a compact but capable 1.7B model focused on agentic web search and lightweight browsing. Built on [Qwen3-1.7B](https://huggingface.co/Qwen/Qwen3-1.7B), Lucy inherits deep research capabilities from larger models while being optimized to run efficiently on mobile devices, even with CPU-only configurations.\n\nWe achieved this through machine-generated task vectors that optimize thinking processes, smooth reward functions across multiple categories, and pure reinforcement learning without any supervised fine-tuning.\n\n## What Lucy Excels At\n\n- **🔍 Strong Agentic Search**: Powered by MCP-enabled tools (e.g., Serper with Google Search)\n- **🌐 Basic Browsing Capabilities**: Through Crawl4AI (MCP server to be released), Serper,...\n- **📱 Mobile-Optimized**: Lightweight enough to run on CPU or mobile devices with decent speed\n- **🎯 Focused Reasoning**: Machine-generated task vectors optimize thinking processes for search tasks\n\n## Evaluation\nFollowing the same MCP benchmark methodology used for [Jan-Nano](https://huggingface.co/Menlo/Jan-nano) and [Jan-Nano-128k](https://huggingface.co/Menlo/Jan-nano-128k), Lucy demonstrates impressive performance despite being only a 1.7B model, achieving higher accuracy than DeepSeek-v3 on [SimpleQA](https://openai.com/index/introducing-simpleqa/).\n\n\n\n## 🖥️ How to Run Locally\n\nLucy can be deployed using various methods including vLLM, llama.cpp, or through local applications like Jan, LMStudio, and other compatible inference engines. The model supports integration with search APIs and web browsing tools through the MCP.\n\n### Deployment\n\nDeploy using VLLM:\n```bash\nvllm serve Menlo/Lucy-128k \\\n --host 0.0.0.0 \\\n --port 1234 \\\n --enable-auto-tool-choice \\\n --tool-call-parser hermes \n```\n\nOr `llama-server` from `llama.cpp`:\n```bash\nllama-server ... \n```\n\n### Recommended Sampling Parameters\n\n```yaml\nTemperature: 0.7\nTop-p: 0.9\nTop-k: 20\nMin-p: 0.0\n```\n\n## 🤝 Community & Support\n\n- **Discussions**: [HuggingFace Community](https://huggingface.co/Menlo/Lucy-128k/discussions)\n\n## 📄 Citation\n\n**Paper (coming soon)**: *Lucy: edgerunning agentic web search on mobile with machine generated task vectors.*",
"related_quantizations": []
},
"tags": [
"transformers",
"gguf",
"text-generation",
"en",
"base_model:Menlo/Lucy",
"base_model:quantized:Menlo/Lucy",
"license:apache-2.0",
"endpoints_compatible",
"region:us",
"conversational"
],
"likes": 0,
"downloads": 162,
"gated": false,
"private": false,
"last_modified": "2025-07-18T05:35:02.000Z",
"created_at": "2025-07-18T05:19:35.000Z",
"pipeline_tag": "text-generation",
"library_name": "transformers"
}
Source payload excerpt (from Hugging Face API)
{
"_id": "6879d96793090553a9300909",
"id": "gabriellarson/Lucy-GGUF",
"modelId": "gabriellarson/Lucy-GGUF",
"sha": "6320540721053b8930adf80e0b35a9626ceeed38",
"createdAt": "2025-07-18T05:19:35.000Z",
"lastModified": "2025-07-18T05:35:02.000Z",
"author": "gabriellarson",
"downloads": 162,
"likes": 0,
"gated": false,
"private": false,
"pipeline_tag": "text-generation",
"library_name": "transformers",
"siblings_count": 28
}