ConflictEnv Final Reasoning Model

This is the final fine-tuned model for the ConflictEnv executive assistant task. It has been trained using GRPO to handle complex scheduling conflicts with a focus on reasoning-first behavior.

Usage

Start prompts with Scenario: ... Details: ... and expect a <thought> block followed by a JSON action.

Downloads last month
30
Safetensors
Model size
2B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using purvansh01/conflict-env-final 1