COSTECH Integrated Repository

Comparative study of pagerank and hits algorithms for reciprocal link prediction in online social networks

Show simple item record

dc.creator Pallangyo, Brian Somi
dc.date 2020-09-30T08:33:38Z
dc.date 2020-09-30T08:33:38Z
dc.date 2020
dc.date.accessioned 2022-10-20T13:46:58Z
dc.date.available 2022-10-20T13:46:58Z
dc.identifier Pallangyo, B. S. (2020) Comparative study of pagerank and hits algorithms for reciprocal link prediction in online social networks (Master’s dissertation). The University of Dodoma, Dodoma.
dc.identifier http://hdl.handle.net/20.500.12661/2492
dc.identifier.uri http://hdl.handle.net/20.500.12661/2492
dc.description Dissertation (MSc Computer Science)
dc.description Online Social Networks (OSN) provides active space for digital human interaction and are used daily. Human engagement is reflected by exploiting the dynamics of OSN, where the fundamental problem is to infer future interactions on the network, called link prediction. Most studies have employed classical algorithms which consider node similarity but neglected the link analysis algorithms which consider topological structure. This study focused on the comparative study of predicting reciprocal interaction from para-social interaction using algorithms. Particularly, this study selected PageRank and HITS, which are considered famous link analysis algorithms with high order heuristics. Network simulation was performed to understand the performance of the algorithms when used to predict reciprocal link formation by employing machine learning techniques. For the experiment, two datasets were used to ensure the reliability of the results. Initially, the publicly available secondary dataset of Twitter was used followed by primary dataset crawled from Mayocoo, both of which are directed networks. The resulting networks from both datasets adhere to power-law distribution. Resource allocation was used as the baseline for the study after outperforming Adamic-Adar, Jaccard Coefficient, and Preferential Attachment. The result of this study showed that both PageRank and HITS surpassed the baseline in performance of prediction. Thus, PageRank has an accuracy improvement of 1.8% with precision and recall of 4.8% and 1.1%, respectively. Furthermore, this improvement comes with a balance of 3% (f1-measure). When HITS is used, there is an improvement accuracy by 5%, with 15.1% (precision), 7.9% (recall) and 11.5% (f1-measure). These empirical results demonstrate that HITS outperforms PageRank in prediction performance. Also, the results from the computational test showed that PageRank uses less computational resources compared to HITS. This study suggests the use of link analysis algorithms over classical algorithms for reciprocal link prediction in OSN. Furthermore, the use of HITS is recommended when prediction performance is vital compared to computational cost, otherwise, PageRank in cases were computational resources are minimal.
dc.language en
dc.publisher The University of Dodoma
dc.subject Pagerank
dc.subject Hits algorithms
dc.subject Reciprocal link
dc.subject Prediction
dc.subject Online social networks
dc.subject Social networks
dc.subject OSN
dc.subject Algorithm
dc.subject Reciprocal interaction
dc.subject Digital human interactions
dc.title Comparative study of pagerank and hits algorithms for reciprocal link prediction in online social networks
dc.type Dissertation


Files in this item

Files Size Format View
Pallangyo, Brian.pdf 2.797Mb application/pdf View/Open

This item appears in the following Collection(s)

Show simple item record

Search COSTECH


Advanced Search

Browse

My Account