Add Phase 3.1 training: gen_weight 2.0, gen_len 32, scheduled sampling, beam search 206e1ad verified JorgeAV commited on Apr 25