Add v3 thinking control patch - task-aware system prompts + think efficiency reward 0f39df7 verified rtferraz commited on 14 days ago