A word stemming algorithm for Hausa language

Muazzam, Bashir and Azilawati, Rozaimee and Wan Malini, Wan Isa (2015) A word stemming algorithm for Hausa language. Journal of Computer Engineering, 17 (3). pp. 25-31. ISSN 2278-8727

[img] Text
FH02-FIK-15-04221.pdf
Restricted to Registered users only

Download (296kB)

Abstract

Hausa, a highly inflected language, needs a worthy stemming approach for efficient information retrieval (IR). However, there is a limited or unavailable study to stemming in the language. Stemming refers to the systematic way of reducing a word to its base or root form. It is a crucial aspect in the field of natural language processing (NLP) such as text summarization and machine translation. As such, this study inspirationally presents an automatic word stemming system for Hausa language with a view to contributing to the field of electronic text processing, as well as NLP, in general. The proposed method is a modification of Porter’s algorithm to fit Hausa morphological rules. The system has an accuracy of 73.8% for implementation with 2573 words extracted from four different articles from Hausa Leadership newspaper. If immensely improved over time (employing more exceptional cases in future work), it would inspire the development of more tools for the language. Hence, the language would rapidly adopt the advancement in technology.

Item Type: Article
Uncontrolled Keywords: Hausa language, Information retrieval, Natural language processing, Stemming
Subjects: Q Science > QA Mathematics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Informatics & Computing
Depositing User: Fatin Safura
Date Deposited: 31 Jan 2022 08:14
Last Modified: 31 Jan 2022 08:14
URI: http://eprints.unisza.edu.my/id/eprint/5000

Actions (login required)

View Item View Item