Muazzam, Bashir and Azilawati, Rozaimee and Wan Malini, Wan Isa (2015) A word stemming algorithm for Hausa language. Journal of Computer Engineering, 17 (3). pp. 25-31. ISSN 2278-8727
Text
FH02-FIK-15-04221.pdf Restricted to Registered users only Download (296kB) |
Abstract
Hausa, a highly inflected language, needs a worthy stemming approach for efficient information retrieval (IR). However, there is a limited or unavailable study to stemming in the language. Stemming refers to the systematic way of reducing a word to its base or root form. It is a crucial aspect in the field of natural language processing (NLP) such as text summarization and machine translation. As such, this study inspirationally presents an automatic word stemming system for Hausa language with a view to contributing to the field of electronic text processing, as well as NLP, in general. The proposed method is a modification of Porter’s algorithm to fit Hausa morphological rules. The system has an accuracy of 73.8% for implementation with 2573 words extracted from four different articles from Hausa Leadership newspaper. If immensely improved over time (employing more exceptional cases in future work), it would inspire the development of more tools for the language. Hence, the language would rapidly adopt the advancement in technology.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Hausa language, Information retrieval, Natural language processing, Stemming |
Subjects: | Q Science > QA Mathematics Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Informatics & Computing |
Depositing User: | Fatin Safura |
Date Deposited: | 31 Jan 2022 08:14 |
Last Modified: | 31 Jan 2022 08:14 |
URI: | http://eprints.unisza.edu.my/id/eprint/5000 |
Actions (login required)
View Item |