Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2008
License: CC BY
Data sources: ZENODO
ZENODO
Article . 2008
License: CC BY
Data sources: Datacite
ZENODO
Article . 2008
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

NLP for African Languages in Ghana: Challenges and Opportunities

Authors: Yawie, Bawumia; Prempeh, Adjo;

NLP for African Languages in Ghana: Challenges and Opportunities

Abstract

Natural Language Processing (NLP) has seen significant progress in English and other widely spoken languages, but its application to African languages remains underexplored. In Ghana, where multiple indigenous languages are spoken, NLP techniques can offer valuable applications such as language translation, text summarization, and sentiment analysis. The research will employ state-of-the-art machine learning algorithms specifically tailored for African language datasets. A comparative analysis will be conducted using various NLP techniques to assess which methods yield the best results across different languages and domains. Initial experiments indicate that transfer learning models, such as BERT adapted to local language corpora, show promising performance in text classification tasks with an accuracy of around 85% on average. However, there is significant variability depending on the specific language and domain. Despite current challenges, including limited datasets and varying linguistic structures, NLP for African languages holds substantial potential for innovation and socio-economic impact in Ghana. Future work should focus on expanding model training efforts to cover more languages and domains. Investment is needed in both data collection and research methodologies to support the development of robust NLP systems for African languages. Collaboration between academia, industry, and government can accelerate this process. Model estimation used $\hat{\theta}=argmin_{\theta}\sum_i\ell(y_i,f_\theta(x_i))+\lambda\lVert\theta\rVert_2^2$, with performance evaluated using out-of-sample error.

Related Organizations
Keywords

Machine Learning, Geographical Information Systems, Text Mining, N-grams, Multilingualism, African Linguistics, Semantics

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average