MasakhaNER: Named Entity Recognition for African LanguagesGoogle Research, Canada; Masakhane NLP.
Brandeis University, United States; Masakhane NLP.
Brandeis University, United States; Masakhane NLP.
Graduate School of Systems and Information Engineering, University of Tsukuba, Japan; Masakhane NLP.
Language Technologies Institute, Carnegie Mellon University, United States.
DeepMind, United Kingdom.
Duolingo, United States.
African Institute for Mathematical Sciences (AIMS-AMMI), Ethiopia; Masakhane NLP.
University of Porto, Nigeria; Bayero University, Kano, Nigeria.
Technical University of Munich, Germany; Masakhane NLP.
Makerere University, Kampala, Uganda; Masakhane NLP.
African Leadership University, Rwanda; Masakhane NLP.
University of Lagos, Nigeria; Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Max Planck Institute for Informatics, Germany; Masakhane NLP.
LT Group, Universität Hamburg, Germany.
University of Chinese Academy of Science, China; Masakhane NLP.
Lancaster University, United Kingdom; Masakhane NLP.
University of Electronic Science and Technology of China, China; Masakhane NLP.
Makerere University, Kampala, Uganda.
United States International University - Africa (USIU-A), Kenya; Masakhane NLP.
Niger-Volta LTI; Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Lancaster University, United Kingdom.
Masakhane NLP.
Makerere University, Kampala, Uganda.
Masakhane NLP.
Masakhane NLP.
African University of Science and Technology, Abuja, Nigeria.
Makerere University, Kampala, Uganda.
Masakhane NLP.
Masakhane NLP.
Makerere University, Kampala, Uganda.
Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Makerere University, Kampala, Uganda.
Makerere University, Kampala, Uganda.
University of Ibadan, Nigeria; Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Namibia University of Science and Technology, Namibia; Masakhane NLP.
Instadeep, Nigeria; Masakhane NLP.
Jacobs University Bremen, Germany; Masakhane NLP.
University of Waterloo, Canada; Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
Masakhane NLP.
African Institute for Mathematical Sciences (AIMS-AMMI), Ethiopia; Masakhane NLP.
Show others and affiliations
2021 (English)In: Transactions of the Association for Computational Linguistics, E-ISSN 2307-387X, Vol. 9, p. 1116-1131Article in journal (Refereed) Published
Abstract [en]
We take a step towards addressing the under-representation of the African continent in NLP research by bringing together different stakeholders to create the first large, publicly available, high-quality dataset for named entity recognition (NER) in ten African languages. We detail the characteristics of these languages to help researchers and practitioners better understand the challenges they pose for NER tasks. We analyze our datasets and conduct an extensive empirical evaluation of state-of-the-art methods across both supervised and transfer learning settings. Finally, we release the data, code, and models to inspire future research on African NLP.
Place, publisher, year, edition, pages
MIT Press, 2021. Vol. 9, p. 1116-1131
Keywords [en]
NER, Low resource, NLP
National Category
Language Technology (Computational Linguistics)
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-87532DOI: 10.1162/tacl_a_00416ISI: 000751952200066Scopus ID: 2-s2.0-85119703625OAI: oai:DiVA.org:ltu-87532DiVA, id: diva2:1603787
Funder
EU, Horizon 2020, 3081705
Note
Validerad;2021;Nivå 1;2021-10-25 (alebob)
2021-10-182021-10-182024-01-08Bibliographically approved