Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data

Bloggers possess the capability of understanding and influencing mass psychology to a wide community of fans and followers by posting their online valuable content. Their dominance over audience can be used as a helping hand in the corporate world which desires to disseminate their product or servic...

Full description

Bibliographic Details
Main Authors: Yousra Asim, Ahmad Kamran Malik, Basit Raza, Ahmad R. Shahid, Nafees Qamar
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9311724/
_version_ 1818734586033602560
author Yousra Asim
Ahmad Kamran Malik
Basit Raza
Ahmad R. Shahid
Nafees Qamar
author_facet Yousra Asim
Ahmad Kamran Malik
Basit Raza
Ahmad R. Shahid
Nafees Qamar
author_sort Yousra Asim
collection DOAJ
description Bloggers possess the capability of understanding and influencing mass psychology to a wide community of fans and followers by posting their online valuable content. Their dominance over audience can be used as a helping hand in the corporate world which desires to disseminate their product or services among diversified people belonging to varying localities, and is always on the lookout for suitable and quick ways to grasp public access. Due to this reason, influential bloggers are preferred in the online market to initiate marketing campaigns which is a thought-provoking task due to loads of blogger communities. The novelty of this paper lies in the proposed Framework for Influential Blogger Prediction based on Blogger and Blog Features (IBP-BBF) using Case-Based Reasoning (CBR) which is not only capable of handling labeled data but also unstructured data (blogs) and imbalanced data in an optimized way. Detailed labelled and unstructured data are collected by online survey of 129 bloggers and text mining of their 32,200 blogs respectively. The classification results are compared and validated with state-of-the-art machine learning techniques by using standard evaluation measures respectively in the context of imbalanced data. The results show that the proposed IBP-BBF framework through CBR modeling outperforms existing techniques in classifying and adapting the influential blogger prediction. The IBP-BBF framework performed better as compared to baseline imbalanced data classification techniques. It is found that the Balanced Random Forest contributes towards the performance of CBR approach than Balanced Bagging Classifier and RUSBoost classifier. By using the CBR approach, baseline techniques can be optimized for influential blogger identification in a better way.
first_indexed 2024-12-18T00:07:43Z
format Article
id doaj.art-9c94d8527c1746d7a138384fc7592f06
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-18T00:07:43Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-9c94d8527c1746d7a138384fc7592f062022-12-21T21:27:45ZengIEEEIEEE Access2169-35362021-01-0196836685410.1109/ACCESS.2020.30486109311724Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced DataYousra Asim0https://orcid.org/0000-0003-1521-6579Ahmad Kamran Malik1https://orcid.org/0000-0001-5569-5629Basit Raza2https://orcid.org/0000-0001-6711-2363Ahmad R. Shahid3https://orcid.org/0000-0002-7520-6770Nafees Qamar4Department of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanDepartment of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanDepartment of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanDepartment of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanDepartment of Health Administration, Governors State University, University Park, IL, USABloggers possess the capability of understanding and influencing mass psychology to a wide community of fans and followers by posting their online valuable content. Their dominance over audience can be used as a helping hand in the corporate world which desires to disseminate their product or services among diversified people belonging to varying localities, and is always on the lookout for suitable and quick ways to grasp public access. Due to this reason, influential bloggers are preferred in the online market to initiate marketing campaigns which is a thought-provoking task due to loads of blogger communities. The novelty of this paper lies in the proposed Framework for Influential Blogger Prediction based on Blogger and Blog Features (IBP-BBF) using Case-Based Reasoning (CBR) which is not only capable of handling labeled data but also unstructured data (blogs) and imbalanced data in an optimized way. Detailed labelled and unstructured data are collected by online survey of 129 bloggers and text mining of their 32,200 blogs respectively. The classification results are compared and validated with state-of-the-art machine learning techniques by using standard evaluation measures respectively in the context of imbalanced data. The results show that the proposed IBP-BBF framework through CBR modeling outperforms existing techniques in classifying and adapting the influential blogger prediction. The IBP-BBF framework performed better as compared to baseline imbalanced data classification techniques. It is found that the Balanced Random Forest contributes towards the performance of CBR approach than Balanced Bagging Classifier and RUSBoost classifier. By using the CBR approach, baseline techniques can be optimized for influential blogger identification in a better way.https://ieeexplore.ieee.org/document/9311724/Blogger classificationcase based reasoning (CBR)machine learning (ML)imbalanced datatext mining
spellingShingle Yousra Asim
Ahmad Kamran Malik
Basit Raza
Ahmad R. Shahid
Nafees Qamar
Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data
IEEE Access
Blogger classification
case based reasoning (CBR)
machine learning (ML)
imbalanced data
text mining
title Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data
title_full Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data
title_fullStr Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data
title_full_unstemmed Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data
title_short Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data
title_sort predicting influential blogger x2019 s by a novel hybrid and optimized case based reasoning approach with balanced random forest using imbalanced data
topic Blogger classification
case based reasoning (CBR)
machine learning (ML)
imbalanced data
text mining
url https://ieeexplore.ieee.org/document/9311724/
work_keys_str_mv AT yousraasim predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata
AT ahmadkamranmalik predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata
AT basitraza predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata
AT ahmadrshahid predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata
AT nafeesqamar predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata