Search
Now showing items 1-6 of 6
Building Text and Speech Datasets for Low Resourced Languages: A Case of Languages in East Africa
(AfricaNLP, 2022)
Africa has over 2000 languages; however, those languages are not well repre sented in the existing Natural Language Processing ecosystem. African languages
lack essential digital resources to be engaged effectively in the ...
Online Assessment and Examinations practices at the eCampus of Maseno University in Response to COVID 19 lockdown
(EasyChair, 2022)
The COVID-19 pandemic resulted in an increasing demand for online learning and led to the demand for the
offer of online assessment at Maseno university. Prior to the COVID19 pandemic the institution had put in place
measures ...
Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks
(arxiv.org, 2022)
Indigenous African languages are categorized as under-served in Artificial Intelligence and suffer poor digital inclusivity and information access. The challenge has been how to use machine learning and deep learning models ...
KenSwQuAD – A Question Answering Dataset for Swahili Low Resource Language
(arxiv.org, 2022)
This research developed a Kencorpus Swahili Question Answering Dataset KenSwQuAD from
raw data of Swahili language, which is a low resource language predominantly spoken in
Eastern African and also has speakers in other ...
Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili
(arxiv.org, 2022)
Building automatic speech recognition (ASR) systems is a challenging task, especially for under resourced languages that need to construct corpora nearly from scratch and lack sufficient training
data. It has emerged ...
Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili
(Cornell University, 2022)
Building automatic speech recognition (ASR) systems is a challenging task, especially for underresourced languages that need to construct corpora nearly from scratch and lack sufficient training
data. It has emerged that ...