--- datasets: - independently-platform/tasky language: - en - it base_model: - google/functiongemma-270m-it library_name: transformers --- # Tasky ## About the model This model is a fine-tuned **function-calling assistant** for a todo/task application. It maps user requests to one of four tools and produces valid tool arguments according to the schema in `AI-TRAINING-TOOLS.md`. - **Base model:** `google/functiongemma-270m-it` - **Primary languages:** English and Italian (with light spelling errors/typos to mimic real users) - **Task:** Structured tool selection + argument generation ## Intended Use Use this model to translate natural language task requests into tool calls for: - `create_tasks` - `search_tasks` - `update_tasks` - `delete_tasks` It is designed for **task/todo management** workflows and should be paired with strict validation of tool arguments before execution. ### Example **Input (user):** Aggiungi un task per pagare la bolletta della luce domani mattina **Expected output (model):** ```json { "tool_name": "create_tasks", "tool_arguments": "{\"tasks\":[{\"content\":\"pagare la bolletta della luce\",\"dueDate\":\"2026-01-13T09:00:00.000Z\"}]}" } ## Training Data Synthetic, bilingual tool-calling data built from the tool schema, including: - Multiple phrasings and paraphrases - Mixed English/Italian prompts - Light typos and user mistakes in user_content - Broad coverage of optional parameters Splits: - Train: 1,500 examples - Eval: 500 examples ## Training Procedure - Fine-tuning on synthetic tool-calling samples - Deduplicated examples - Balanced coverage of all tools and key parameters ## Evaluation Reported success rate: 99.5% on the 500‑example eval split vs 0% base model. Success was measured as exact match on the predicted tool name and the JSON arguments after normalization. ## Limitations - Trained for a specific tool schema; not a general-purpose assistant. - Outputs may include incorrect or incomplete tool arguments; validate before execution. - Language coverage is strongest in English and Italian. - Synthetic data may not capture all real-world user phrasing or ambiguity.