IQL Fire Rescue Operator Model
This model uses Implicit Q-Learning (IQL) to select optimal conversation policies for emergency response scenarios.
Model Description
- Task: Policy selection for fire rescue emergency conversations
- Algorithm: Implicit Q-Learning (IQL)
- Input: Conversation history (embedded using sentence-transformers)
- Output: Best policy and Q-values for all policies
Policies
provide_information- Share facts, data, risk levelsoffer_assistance- Provide help, resources, supportexpress_urgency- Convey time pressure, dangerask_question- Gather information from residentgive_direction- Provide specific instructionsacknowledge_concern- Validate resident's feelingsbuild_rapport- Establish trust and connection
Usage
import requests
API_URL = "https://api-inference.huggingface.co/models/YOUR_USERNAME/iql-fire-rescue"
headers = {"Authorization": f"Bearer YOUR_HF_TOKEN"}
def query(payload):
response = requests.post(API_URL, headers=headers, json=payload)
return response.json()
# Example
output = query({
"inputs": "Conversation state here",
"parameters": {"character": "bob"}
})
Training Data
Trained on emergency response conversations with multiple character profiles.
Citation
If you use this model, please cite:
@misc{iql-fire-rescue,
title={IQL-based Emergency Response Operator},
author={Your Name},
year={2025},
publisher={Hugging Face}
}
- Downloads last month
- 11