<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0">
<channel>
<title>School of Computing &amp; Informatics</title>
<link>https://repository.maseno.ac.ke/handle/123456789/1312</link>
<description/>
<pubDate>Fri, 15 May 2026 14:08:05 GMT</pubDate>
<dc:date>2026-05-15T14:08:05Z</dc:date>
<item>
<title>Ensembles of machine learning models for detecting writing style changes at the sentence level</title>
<link>https://repository.maseno.ac.ke/handle/123456789/6246</link>
<description>Ensembles of machine learning models for detecting writing style changes at the sentence level
OLOO, Vivian Anyango
Establishing the exact number of authors collaborating in writing a document is the focus of writing styles change detection models. However, existing writing style change detection models fail to adequately detect writing style changes in documents where each author writes very short texts in form of sentences, which are randomly distributed in the document. In addition, a number of features have been used in detecting writing styles but few studies have determined their suitability for this task. For writing style change detection models to remain relevant, there is need for models that can detect writing styles changes at the sentence level. The aim of this study was to develop ensembles of machine learning models for detecting writing style changes at the sentence level. The specific objectives were; to design ensembles of machine learning models for detecting writing style changes in documents, to implement ensembles of machine learning models for detecting writing style changes, to determine optimal feature sets for detecting writing style changes, and to evaluate the effectiveness of the ensemble models on detecting writing style changes at the sentence level. The study variables were the ensembles of machine learning models, while the dependent variable was the detection of writing style changes at the sentence level. Other variables looked at were the feature sets, model evaluation at the sentence level and performance of the model on detecting writing style changes at the sentence level. Mixed research design was used in this study, where exploratory design was used to identify stylometric features for use in the study. Features whose importance scores were greater than zero were considered optimal and were used to carry out experiments. Under experimental design, four experiments were performed: first to select the optimal document features and second to select the optimal sentence level features using feature importance scores. The third experiment was designed to classify documents as either single authored or multi-authored. The last experiment was used to detect the number of writing style changes in documents classified as multi-authored. The Pan at Clef 2019 style change date set was used to train, validate and test the models. The corpus consisted of 5088 documents out of which 50% was used for training, 25% for validation and 25% for testing. Half of the documents were single authored while the other half were multi-authored. Results show that 19 features were optimal at the document level while twenty two features were optimal sentence level. The models were able to classify single authored documents and multi-authored documents with an accuracy of 0.91 and an F1score of 0.90. Similarly, the study achieved an Ordinal Classification Index of 0.731 in detecting the number of writing style changes in multi-authored documents outperforming state-of-the-art models which achieved 0.808. The better performance is attributed to the use of optimal feature sets, ensembles learning models and sentence level representation. The main contribution of this study is ensembles of machine learning models able to detect writing style changes at the sentence level. In addition, the study identified two sets of features; the optimal document and sentence level feature sets which can be used for writing style change detection with improved performance.
PhD Thesis
</description>
<pubDate>Mon, 01 Jan 2024 00:00:00 GMT</pubDate>
<guid isPermaLink="false">https://repository.maseno.ac.ke/handle/123456789/6246</guid>
<dc:date>2024-01-01T00:00:00Z</dc:date>
</item>
<item>
<title>A reinforcement learning multi-agent systems architecture for guaranteeing advancing conversations in task-oriented dialog systems</title>
<link>https://repository.maseno.ac.ke/handle/123456789/5533</link>
<description>A reinforcement learning multi-agent systems architecture for guaranteeing advancing conversations in task-oriented dialog systems
MUGOYE, Kevin.  Sindu
The conversational capabilities of a dialog system have a direct impact on the tasks it can accomplish. Solving all conversational issues in dialog systems have the potential to make them serve in complex domains. While this is not achievable, addressing fundamental aspects in conversation is desired to make a task-oriented dialog system (TODS) serve in new domains where they are needed, besides increasing their usefulness. One such aspect is the ability to advance a conversation logically. The primary aim of this study was to develop a novel architecture that will guarantee advancing conversations in TODS. To realize this aim, theories and literature were interrogated that informed the formulation of an agent-based architecture for dialog management. Then implementation of the architecture previously realized in a dialog system prototype. Followed by training the dialog system on initial domain-specific data. And evaluating its performances in a specific domain. The study used exploratory methodology to provide the theories that justified the construction of the multi-agent system (MAS_DM) architecture, while the experimental design was explored to synthesize and train the prototype.The design involved the fusion of agent-based architecture with reinforcement learning technique to enable tracking of context, structure and policy without depending on handcrafted rules. MAS_DM architecture explores learning agents in an unknown environment, where each agent is endowed with the ability to learn and select a policy. Learning and policy selection is sustained through reinforcement learning, eliminating the need for handcrafted rules. The architectural model was evaluated and validated in a prototype Chabot system. The Chatbot system was trained and tested in the maternal healthcare domain and was evaluated by human users. In this context, each user filled out an online questionnaire after successful interaction with the Chatbot. The evaluation parameters were coherence, task success, general performance, user satisfaction and goal achievement. This evaluation adheres to the specifications of Goal Question Metrics and PARAdigm for DIalog System Evaluation frameworks. The key findings were that Chabot’s ability to advance the conversation scored 0.8903, and achieved an overall performance score of 0.553. It achieved a task success rate of 0.936. with a user satisfaction score of 0.775. Based on global acceptable measures, interpreted this task success as substantial, coherence score as substantial, user satisfaction as excellent and the overall performance as good. Where machine learning is involved kappa statistic values above 0.40 are considered exceptional. The results suggest that it is reasonable to conclude that the MAS_DM architecture can be trusted to guarantee conversations that advance logically. The study contributes to the body of knowledge of conversational artificial intelligence by; - developing a novel agent-based architectural model for TODS, demonstrating the practicability of combining multi-agent systems and machine learning toward solving conversational issues and enhancing the capability of TODS.
PhD Thesis
</description>
<pubDate>Sat, 01 Jan 2022 00:00:00 GMT</pubDate>
<guid isPermaLink="false">https://repository.maseno.ac.ke/handle/123456789/5533</guid>
<dc:date>2022-01-01T00:00:00Z</dc:date>
</item>
<item>
<title>Script Acquisition: A Crowdsourcing and Text mining approach</title>
<link>https://repository.maseno.ac.ke/handle/123456789/5395</link>
<description>Script Acquisition: A Crowdsourcing and Text mining approach
WANZARE, Lilian Diana Awuor
According to Grice’s (1975) theory of pragmatics, people tend to omit basic information&#13;
when participating in a conversation (or writing a narrative) under the assumption that left&#13;
out details are already known or can be inferred from commonsense knowledge by the&#13;
hearer (or reader). Writing and understanding of texts makes particular use of a specific&#13;
kind of common-sense knowledge, referred to as script knowledge. Schank and Abelson&#13;
(1977) proposed Scripts as a model of human knowledge represented in memory that stores&#13;
the frequent habitual activities, called scenarios, (e.g. eating in a fast food restaurant, etc.),&#13;
and the different courses of action in those routines.&#13;
This thesis addresses measures to provide a sound empirical basis for high-quality script&#13;
models. We work on three key areas related to script modeling: script knowledge acquisition, script induction and script identification in text. We extend the existing repository&#13;
of script knowledge bases in two different ways. First, we crowdsource a corpus of 40&#13;
scenarios with 100 event sequence descriptions (ESDs) each, thus going beyond the size of&#13;
previous script collections. Second, the corpus is enriched with partial alignments of ESDs,&#13;
done by human annotators. The crowdsourced partial alignments are used as prior knowledge to guide the semi-supervised script-induction algorithm proposed in this dissertation.&#13;
We further present a semi-supervised clustering approach to induce script structure from&#13;
crowdsourced descriptions of event sequences by grouping event descriptions into paraphrase sets and inducing their temporal order. The proposed semi-supervised clustering&#13;
model better handles order variation in scripts and extends script representation formalism,&#13;
Temporal Script graphs, by incorporating "arbitrary order" equivalence classes in order to&#13;
allow for the flexible event order inherent in scripts.&#13;
In the third part of this dissertation, we introduce the task of scenario detection, in which&#13;
we identify references to scripts in narrative texts. We curate a benchmark dataset of annotated narrative texts, with segments labeled according to the scripts they instantiate. The&#13;
dataset is the first of its kind. The analysis of the annotation shows that one can identify scenario references in text with reasonable reliability. Subsequently, we proposes a benchmark&#13;
model that automatically segments and identifies text fragments referring to given scenarios. The proposed model achieved promising results, and therefore opens up research on&#13;
script parsing and wide coverage script acquisition
Phd Thesis(Donation)
</description>
<pubDate>Tue, 01 Jan 2019 00:00:00 GMT</pubDate>
<guid isPermaLink="false">https://repository.maseno.ac.ke/handle/123456789/5395</guid>
<dc:date>2019-01-01T00:00:00Z</dc:date>
</item>
<item>
<title>Compact operators on sequence and function spaces: Characterisations and duality results</title>
<link>https://repository.maseno.ac.ke/handle/123456789/4342</link>
<description>Compact operators on sequence and function spaces: Characterisations and duality results
AYWA, Shem
Since the early seventies numerous papers appeared in which the authors considered the&#13;
question "For which Banach spaces X and Y is the space K(X, Y) of compact linear&#13;
operators uncomplemented in the space L(X, Y) of bounded linear operators?". The&#13;
most general answer to this question has close connection with the question whether the&#13;
scalar sequence space Co of null sequences embeds isomorphically into K(X, Y). In one of&#13;
the early papers ([28J (1979)) J. Johnson followed a path through dual spaces of spaces of&#13;
operators when he proved that there is a projection on L(X, Y)* with range isomorphic to&#13;
K(X, Y)* and kernel the annihilator of K(X, Y) when Y has the bounded approximation&#13;
property. Johnson then applied his result to consider L(X, Y) as an isomorphic subspace&#13;
of K(X, Y)** and could then derive necessary and sufficient conditions for K(X, Y) to&#13;
be reflexive, provided that either X or Y has the bounded approximation property. It&#13;
turns out that the spaces X and Y have to be reflexive and that the property K(X, Y) =&#13;
L(X, Y) necessarily has to hold for K(X, Y) to be reflexive in this case.&#13;
The questions described in the previous paragraph led to research into different directions.&#13;
Much work went into the study of K(X, Y) as a subspace of L(X, Y) and the question&#13;
when (i.e. for which Banach spaces) is the equality K(X, Y) = L(X, Y) true? For&#13;
instance an extensive investigation on when K(X, Y) is an M-ideal in L(X, Y) was done.&#13;
And some researchers (for example in the papers [2J and [6]) considered the question on&#13;
the equality of K(X, Y) and L(X, Y) in the context of scalar sequence spaces and Banach&#13;
function spaces - i.e. where either X or both X and Yare such spaces. Also, especially&#13;
in recent papers (for example in [10], [22], [25], [26J and [27]), the same questions and the&#13;
question about projections from L(X, Y)* onto K(X, Y)* were considered in the setting&#13;
of Banach spaces which fail the approximation property. These studies also extended&#13;
to similar research activities in the setting of locally convex spaces (for example in the&#13;
papers [8], [19J and [20]).&#13;
The objective in the present thesis is to contribute to the above mentioned study, in the&#13;
following ways:&#13;
(a) In line with recent developments we want to show the existence of a suitable&#13;
projection onto the space K(X. Y)*, which will allow us to find necessary and sufficient&#13;
conditions for the reflexivity of K(X, Y) without relying on the presence of the bounded&#13;
approximation property on X or Y. The idea is to put recent work of others in connection&#13;
with continuous dual spaces of spaces of bounded linear operators in a suitable framework&#13;
and to improve the present known results and techniques.&#13;
(b) Use techniques from the theory of vector sequence spaces to simplify proofs of&#13;
existing results in connection with the equality K(X, Y) = L(X, Y) when X is a Banach&#13;
scalar sequence space and then to extend the existing results to include more general&#13;
sequence spaces X.&#13;
(c) Exposing that recent studies in connection with "absolutely summing multipliers"&#13;
and " sequences in the range of a vector measure" are intertwined, we intend to extend&#13;
the concept of "absolutely summing multiplier" to more general types of "summing multipliers"&#13;
and to apply our work to consider properties of Banach space valued operators&#13;
on scalar sequence spaces.&#13;
What are our contributions in this thesis?&#13;
* Firstly, we introduce an operator ideal approach which seems to provide a natural&#13;
setting in which to consider the existence of projections from L(X, Y)* onto K(X, Y)* and&#13;
derive necessary and sufficient conditions for the reflexivity of K(X, Y) in the absence&#13;
of the approximation property. Thus we simplify the proofs of existing results in the&#13;
literature and also generalise these results to such an extend that at least the well known&#13;
spaces without the approximation property are included.&#13;
* Secondly, in line with modern trend to provide proofs for theorems about operators&#13;
on Banach spaces which do not rely on the approximation properties, we expand the&#13;
concept of conjugate ideal to introduce the operator dual space of spaces of bounded&#13;
linear operators. It turns out that the operator dual space is a handy tool to study&#13;
inclusion theorems for spaces of operators. Also, using operator dual techniques, we are&#13;
able to prove existing characterisations of continuous dual spaces of important classes of&#13;
operators without relying on the continuity of the trace functional with respect to the&#13;
nuclear norm - thus the proofs do not depend on the approximation property.&#13;
* Thirdly, we provide a direct and easy proof of a known result which provides necessary&#13;
and sufficient condition for all weak p-summable sequences in a Banach space to be norm&#13;
null. Our proof uses sequence space arguments only, thereby allowing us to extend the&#13;
proof to more general sequence spaces, including certain Orlicz sequence spaces. Applying&#13;
these results, together with some known characterisations of operators on sequence spaces&#13;
in terms of vector sequence spaces, we succeed on the one hand to provide easier proofs&#13;
for existing results about necessary and sufficient conditions for the equalities K(1!.P, X) =&#13;
L(1!.P,X) and K(co, X) = L(co, X) and on the other hand to obtain further improvements.&#13;
* An absolutely summing multiplier of a Banach space X is a scalar sequence (ai) such&#13;
that (aixi) is absolutely summable in X for all weakly absolutely summable sequences (Xi)&#13;
in X. Recently there were several papers by Spanish mathematicians about sequences in&#13;
the range of a vector measure. We expose the fact that these concepts are intertwined and&#13;
thereby show that various results in one of the papers about sequences in the range of a&#13;
vector measure can easily be obtained, using the absolutely summing multiplier concept.&#13;
In the last chapter of the thesis we generalise the idea of absolutely summing multiplier&#13;
to that of p-summing multiplier, A- summing multiplier and even more general, (A, E)-&#13;
summing multiplier and use these concepts to obtain results about Banach space valued&#13;
bounded linear operators on A.
Donation
</description>
<pubDate>Fri, 01 Jan 1999 00:00:00 GMT</pubDate>
<guid isPermaLink="false">https://repository.maseno.ac.ke/handle/123456789/4342</guid>
<dc:date>1999-01-01T00:00:00Z</dc:date>
</item>
<item>
<title>Utility-Based Agent for Vehicle Driver Behaviour Modelling</title>
<link>https://repository.maseno.ac.ke/handle/123456789/4007</link>
<description>Utility-Based Agent for Vehicle Driver Behaviour Modelling
IMENDE, James Obuhuma
Knowledge on driver behaviour is a major factor that can possibly aid in future strategies for minimising if not fully controlling road fatalities. The behaviour of human vehicle drivers is the main cause of road accidents and is also the factor which has so far proved to be the most difficult to establish and model. Studies conducted on driver behaviour modelling have been limited by five factors: study methodology; vehicle model compatibility; cost; overestimation of critical driving events; and scope for driver behaviour monitoring. Probabilistic reasoning and intelligence, which are critical in modelling under stochastic environments are lacking in the applied methodologies. Fortunately, a combination of computing and communication technologies now makes it possible to model the behaviour of drivers operating in complex environments. The main objective of the research was to model human vehicle driver behaviour using a utility-based agent. To realise this objective, the research identified parameters that describe the behaviour of a human vehicle driver operating under diverse environments, formulated a vehicle driver behaviour dataset and developed and evaluated a vehicle driver agent that can operate in a complex environment. A sample of 30 drivers was used, with tonnes of data collected and analysed. Vehicle position coordinates, speed, direction, altitude, time and a reflected signal signifying the presence of an obstacle were collected using the Global Positioning System (GPS) comprising of satellites, GPS receivers and a server. Data analysis generated a driver behaviour dataset that was used in the preparation of the driver agent through three main phases: training, validation and testing. The driver agent was founded on Mixture Models with Bayesian inferencing techniques that performed driver behavioural pattern recognition and predictive analyses. The agent’s actions under dynamic conditions were evaluated against sets of performance standards, yielding mean success rates of over 68% accuracies and over 70% F-scores, +/- 5. This was an indicator of the appropriateness of the data collection tools and techniques, data analysis algorithms and the driver behaviour dataset. The significance of the study is three-fold. First, the function of the system could be extended to providing advisory services to drivers in real-time. Second, data gathered from the system could be used by road safety stakeholders to vet drivers and to diagnose causes of road accidents. Finally, the resulting knowledge-base could establish standards of rationality in driving and/or formulate rules for use in driverless vehicle control systems.
</description>
<pubDate>Wed, 01 Jan 2020 00:00:00 GMT</pubDate>
<guid isPermaLink="false">https://repository.maseno.ac.ke/handle/123456789/4007</guid>
<dc:date>2020-01-01T00:00:00Z</dc:date>
</item>
<item>
<title>A Model of the Relationships Between Availability Mechanisms and Outage Sources in Cloud Computing</title>
<link>https://repository.maseno.ac.ke/handle/123456789/3635</link>
<description>A Model of the Relationships Between Availability Mechanisms and Outage Sources in Cloud Computing
MBOGHOLI, JOHN SAUL MSAGHA
ABSTRACT&#13;
The use of cloud computing has been growing exponentially since its inception. Availability of the cloud, however, has been a problem for users and Cloud Service Providers (CSPs) alike; outages have been on the rise. This problem could be attributed to the fact that engineers building Availability Mechanisms (AMs) and those studying outage causes do not work collectively. The general objective of the study was to develop and evaluate an availability mechanism model for service outages in cloud computing environments. The study specifically sought to: identify the causes of outages in cloud computing environments, identify availability mechanisms in use in cloud computing environments, formulate a model that establishes correspondences between AMs and outages, and evaluate the performance of the model by measuring its service availability levels in cloud computing environments in relation to the settings of the cloud computing system parameters. A model was developed called the Ferris Wheel of Availability (FWA) model. The model was developed by relating AMs to outage causes, with AMs being conjugate in nature in relation to the respective outage causes. There were seven categories of AMs and seven categories of outage causes; AMs were categorized as cluster management, component redundancy, limit detection policy, checkpointing, node management, Active-X variant and fault tolerance. Outage causes were categorized as configuration issues, hardware issues, resource exhaustion, security issues, node failures, network issues and natural disasters. Testing of the model was done using CloudSim, a discrete, deterministic simulator that allows users to set up their customized configurations and run them in it. The simulator was configured to run each outage cause individually and the applicable AMs were then injected simultaneously and output recorded. Each outage cause had two AMs, and the findings confirmed the effectiveness of the proposed model structure in increasing service availability at infrastructure level. Key findings were that checkpointing is not effective as an AM against resource exhaustion, and that effective management of a cluster results in effective management of the nodes in it. It was not conclusive as to whether limit detection policy was effective as an AM against security issues. The study also suggested that limit detection policy be renamed limit prevention policy. The study introduced a new availability parameter called execution availability and recommended its use together with service availability in predicting overall availability at infrastructure level. The key contributions of the study were: development of the FWA model that establishes correspondences between AMs and outage causes since a model that establishes these correspondences had not been developed before; discovery of the relationships between AMs and outage causes based on simulation tests and consequent analysis; and introduction of an availability parameter called execution availability that measures the ratio of tasks allocated versus tasks executed. It is recommended to study the feasibility of merging two or more simulators to achieve results which were inconclusive using one simulator; an extension to the simulator in use may also be investigated. The use of the FWA model at CSP level is also recommended as it assists analysts and developers to build for availability from the very foundation as opposed to adapting a wait-and-see attitude in countering outages as they occur. The outcome of the study points to suggest that the application of the FWA model in a cloud computing infrastructure has the potential to increase availability in the cloud.
</description>
<pubDate>Wed, 01 Jan 2020 00:00:00 GMT</pubDate>
<guid isPermaLink="false">https://repository.maseno.ac.ke/handle/123456789/3635</guid>
<dc:date>2020-01-01T00:00:00Z</dc:date>
</item>
</channel>
</rss>
