Kreutzer, J.; Caswell, I; Wang, L; Wahab, A.; van Esch, D; Ulzii-Orshikh, N; Tapo, A; Subramani, N; Sokolov, A; Sikasote, C; Setyawan, M; Sarin, S.; Samb, S.; Sagot, B; Rivera, C.; Rios, A; Papadimitriou, I.; Osei, S.; Suarez, P. O; Adeyemi, M
[Article-Full-text]
With the success of large-scale pre-training and multilingual modeling in Natural Language Processing (NLP), recent years have seen a proliferation of large, Web-mined text datasets covering hundreds of languages. We ...