keyboard_arrow_up
Laying the groundwork for Natural Language Processing (NLP) in Ngiemboon: A Descriptive Study of its Part-of-Speech System

Authors

Patrice Yemmene 1,4, Prosper Djiaffeua 2 and Basile Difouo 3,4, 1 University of Wisconsin Milwaukee, Cameroon 2 University of Yaoundé I, Cameroon, 3 University of Maroua, Cameroon, 4 Laboratoire Technologies Educatives, Cameroun

Abstract

In this paper, we discuss the necessity for enhancing NLP capabilities for African under-resourced languages, particularly those spoken in Cameroon. We use the Ngiemboon language as a focal point for developing innovative tagging solutions. We lay the groundwork for creating a part-of-speech (POS) tag set for the Ngiemboon language, focusing on a descriptive study of its parts of speech. We establish NLP as an interdisciplinary field that automates language understanding and generation, highlighting various applications such as machine translation and chatbots. We emphasize the role of POS tagging as a fundamental step in NLP. We highlight the linguistic description of the language as a prerequisite for the development of POS. One aspect of linguistic description is the morphosyntactic analysis of the language, which is essential for understanding linguistic structures and enabling more complex language processing tasks. We emphasize the importance of a well-structured tag set, which should be informed by detailed linguistic analysis.

Keywords

Part of speech, Natural Language Processing (NLP), Under-resourced language

Full Text  Volume 15, Number 14