The Efficiency of IsiNdebele Part of Speech Tagger: A Quantitative Analysis

Muzi Matfunjwa; Nomsa Skosana

doi:10.51415/ajims.v8i1.3170

Authors

Muzi Matfunjwa South African Centre for Digital Language Resources (SADiLaR), North-West University https://orcid.org/0000-0003-4553-3225
Nomsa Skosana South African Centre for Digital Language Resources (SADiLaR), North-West University https://orcid.org/0000-0002-2833-5895

DOI:

https://doi.org/10.51415/ajims.v8i1.3170

Keywords:

accuracy, F1 score, part of speech tagger, precision, recall

Abstract

This study evaluates the performance of the isiNdebele part of speech tagger developed by the National Centre for Human Language Technologies as part of Nguni core technologies. A sample of 522 words from government documents and isiNdebele literary works was randomly selected. A mixed-methods approach was utilised to analyse the data. The raw data were automatically processed using the tagger, and the outputs were compared against the gold standard to calculate the tagger’s accuracy. Nouns attained an accuracy of 86%, verbs 66%, adverbs 59%, pronouns 90%, adjectives 14%, conjunctions 33%, copulatives 83%, relatives 50%, possessives 90%, demonstratives 71%, while it was 0% for ideophones, interjections, prepositions, question words and auxiliary verbs. Recall and precision were calculated using Python 3.0, enabling the researchers to determine the F1 score. Nouns achieved a recall of 0.86, precision of 0.55, and F1 score 0.67, verbs 0.66, 0.7 and 0.68, relatives 0.5, 0.46 and 0.48, adverbs 0.63, 0.86 and 0.73, possessives 0.9, 0.56 and 0.69, demonstratives 0.71, 0.86 and 0.78, adjectives 0.14, 0.67 and 0.23, pronouns 0.9, 0.95 and 0.92 copulatives 0.83, 1.0 and 0.91 and conjunctions 0.36, 0.83 and 0.5 respectively. These findings underscore the importance of improving the isiNdebele part of speech tagger.

The Efficiency of IsiNdebele Part of Speech Tagger: A Quantitative Analysis

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Similar Articles

Most read articles by the same author(s)

Current Issue

Information

Similar Articles

Challenges Associated with Research Management and Administration in Universities

Message for the Special Issue: Undergraduate Research, Scholarship, and Creative Inquiry

Internal Drivers of Innovation and Sustainability in South African Manufacturing Small and Medium Enterprises

Practices and Spaces (Location): Reflecting on the Contribution of Writing Centres for Decolonisation in Higher Education

The Ubuntu Principle in the Internal and Foreign Policy of South Africa

Lexicological Disjunction in Israel’s Gaza War Rhetoric and the Western Complicity

Curriculum Theorising in the Era of the Fourth Industrial Revolution

The Role of Traditional Leadership in Addressing Gender-Based Violence in Seke Community, Zimbabwe

Challenges Facing Street Vendors in Durban and the Role of The Law: A Means to Empowering Women

Crossing Disciplinary Boundaries and Extending Participation through Film and Applied Theatre Techniques: Reflecting on the Umzi ka Mama Oral Histories Project