Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data
Bloggers possess the capability of understanding and influencing mass psychology to a wide community of fans and followers by posting their online valuable content. Their dominance over audience can be used as a helping hand in the corporate world which desires to disseminate their product or servic...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9311724/ |
_version_ | 1818734586033602560 |
---|---|
author | Yousra Asim Ahmad Kamran Malik Basit Raza Ahmad R. Shahid Nafees Qamar |
author_facet | Yousra Asim Ahmad Kamran Malik Basit Raza Ahmad R. Shahid Nafees Qamar |
author_sort | Yousra Asim |
collection | DOAJ |
description | Bloggers possess the capability of understanding and influencing mass psychology to a wide community of fans and followers by posting their online valuable content. Their dominance over audience can be used as a helping hand in the corporate world which desires to disseminate their product or services among diversified people belonging to varying localities, and is always on the lookout for suitable and quick ways to grasp public access. Due to this reason, influential bloggers are preferred in the online market to initiate marketing campaigns which is a thought-provoking task due to loads of blogger communities. The novelty of this paper lies in the proposed Framework for Influential Blogger Prediction based on Blogger and Blog Features (IBP-BBF) using Case-Based Reasoning (CBR) which is not only capable of handling labeled data but also unstructured data (blogs) and imbalanced data in an optimized way. Detailed labelled and unstructured data are collected by online survey of 129 bloggers and text mining of their 32,200 blogs respectively. The classification results are compared and validated with state-of-the-art machine learning techniques by using standard evaluation measures respectively in the context of imbalanced data. The results show that the proposed IBP-BBF framework through CBR modeling outperforms existing techniques in classifying and adapting the influential blogger prediction. The IBP-BBF framework performed better as compared to baseline imbalanced data classification techniques. It is found that the Balanced Random Forest contributes towards the performance of CBR approach than Balanced Bagging Classifier and RUSBoost classifier. By using the CBR approach, baseline techniques can be optimized for influential blogger identification in a better way. |
first_indexed | 2024-12-18T00:07:43Z |
format | Article |
id | doaj.art-9c94d8527c1746d7a138384fc7592f06 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-18T00:07:43Z |
publishDate | 2021-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-9c94d8527c1746d7a138384fc7592f062022-12-21T21:27:45ZengIEEEIEEE Access2169-35362021-01-0196836685410.1109/ACCESS.2020.30486109311724Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced DataYousra Asim0https://orcid.org/0000-0003-1521-6579Ahmad Kamran Malik1https://orcid.org/0000-0001-5569-5629Basit Raza2https://orcid.org/0000-0001-6711-2363Ahmad R. Shahid3https://orcid.org/0000-0002-7520-6770Nafees Qamar4Department of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanDepartment of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanDepartment of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanDepartment of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanDepartment of Health Administration, Governors State University, University Park, IL, USABloggers possess the capability of understanding and influencing mass psychology to a wide community of fans and followers by posting their online valuable content. Their dominance over audience can be used as a helping hand in the corporate world which desires to disseminate their product or services among diversified people belonging to varying localities, and is always on the lookout for suitable and quick ways to grasp public access. Due to this reason, influential bloggers are preferred in the online market to initiate marketing campaigns which is a thought-provoking task due to loads of blogger communities. The novelty of this paper lies in the proposed Framework for Influential Blogger Prediction based on Blogger and Blog Features (IBP-BBF) using Case-Based Reasoning (CBR) which is not only capable of handling labeled data but also unstructured data (blogs) and imbalanced data in an optimized way. Detailed labelled and unstructured data are collected by online survey of 129 bloggers and text mining of their 32,200 blogs respectively. The classification results are compared and validated with state-of-the-art machine learning techniques by using standard evaluation measures respectively in the context of imbalanced data. The results show that the proposed IBP-BBF framework through CBR modeling outperforms existing techniques in classifying and adapting the influential blogger prediction. The IBP-BBF framework performed better as compared to baseline imbalanced data classification techniques. It is found that the Balanced Random Forest contributes towards the performance of CBR approach than Balanced Bagging Classifier and RUSBoost classifier. By using the CBR approach, baseline techniques can be optimized for influential blogger identification in a better way.https://ieeexplore.ieee.org/document/9311724/Blogger classificationcase based reasoning (CBR)machine learning (ML)imbalanced datatext mining |
spellingShingle | Yousra Asim Ahmad Kamran Malik Basit Raza Ahmad R. Shahid Nafees Qamar Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data IEEE Access Blogger classification case based reasoning (CBR) machine learning (ML) imbalanced data text mining |
title | Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data |
title_full | Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data |
title_fullStr | Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data |
title_full_unstemmed | Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data |
title_short | Predicting Influential Blogger’s by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data |
title_sort | predicting influential blogger x2019 s by a novel hybrid and optimized case based reasoning approach with balanced random forest using imbalanced data |
topic | Blogger classification case based reasoning (CBR) machine learning (ML) imbalanced data text mining |
url | https://ieeexplore.ieee.org/document/9311724/ |
work_keys_str_mv | AT yousraasim predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata AT ahmadkamranmalik predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata AT basitraza predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata AT ahmadrshahid predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata AT nafeesqamar predictinginfluentialbloggerx2019sbyanovelhybridandoptimizedcasebasedreasoningapproachwithbalancedrandomforestusingimbalanceddata |