Submitted by Fengji Zhang 5 A^2Search: Ambiguity-Aware Question Answering with Reinforcement Learning City University of Hong Kong 6 3