WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models
Paper • 2112.06598 • Published • 1
gpt2-large transferred to Ukrainian using the method from the NAACL2022 paper WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.