
Multi-lingual language models (LM), such as mBERT, XLM-R, mT5, mBART, have been remarkably successful in enabling natural language tasks in low-resource languages through cross-lingual transfer from high-resource ones. In this work, we try to better understand how such models, specifically mT5, transfer *any* linguistic and semantic knowledge across languages, even though no explicit cross-lingual signals are provided during pre-training. Rather, only unannotated texts from each language are presented to the model separately and independently of one another, and the model appears to implicitlyResearch goal: How does mT5 compare to XLM-R and mBART in preventing target language collapse during zero-shot cross-lingual summarization across low-resource languages?Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.5/10.
