Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles

Introduction: Data can currently be found within organizations and outside of them, they are growing exponentially. Today, the information available on the Internet and social networks has become a generator of value, through the effective analysis of a specific situation, using techniques and metho...

Full description

Bibliographic Details
Main Authors: Ariel Guillermo Sánchez Paipilla, Mónica Katherine Durán Vaca, Javier Antonio Ballesteros Ricaurte, Angela María González Amarillo, Pedro Nel López
Format: Article
Language:English
Published: Universidad de la Costa 2022-09-01
Series:Inge-Cuc
Subjects:
Online Access:https://revistascientificas.cuc.edu.co/ingecuc/article/view/4419
_version_ 1797668269932412928
author Ariel Guillermo Sánchez Paipilla
Mónica Katherine Durán Vaca
Javier Antonio Ballesteros Ricaurte
Angela María González Amarillo
Pedro Nel López
author_facet Ariel Guillermo Sánchez Paipilla
Mónica Katherine Durán Vaca
Javier Antonio Ballesteros Ricaurte
Angela María González Amarillo
Pedro Nel López
author_sort Ariel Guillermo Sánchez Paipilla
collection DOAJ
description Introduction: Data can currently be found within organizations and outside of them, they are growing exponentially. Today, the information available on the Internet and social networks has become a generator of value, through the effective analysis of a specific situation, using techniques and methodologies with which content-based solutions can be proposed, and thus achieve, execute timely, intelligent and assertive decision-making processes. Objective: The main objective of this work is to development of a Bot Crawler, which allows extracting information from Facebook without access restrictions, or request for credentials, based on web crawling and scraping techniques, through the selection of HTML tags, to track and be able to define patterns. Method: The development of this project consisted of four main stages: A) Teamwork with SCRUM, B) Comparison of web data extraction techniques, C) Extraction and validation of permissions to access the data in Facebook, D) Development of the bor crawler. Results:  Briefly, mention the main results of the research Conclusions: As a result of this process, a graphical interface is created that allows checking the process of obtaining data derived from user profiles of this social network.
first_indexed 2024-03-11T20:26:43Z
format Article
id doaj.art-d4ffeb3558d043359253fda734fc94e9
institution Directory Open Access Journal
issn 0122-6517
2382-4700
language English
last_indexed 2024-03-11T20:26:43Z
publishDate 2022-09-01
publisher Universidad de la Costa
record_format Article
series Inge-Cuc
spelling doaj.art-d4ffeb3558d043359253fda734fc94e92023-10-02T14:16:52ZengUniversidad de la CostaInge-Cuc0122-65172382-47002022-09-0118210111310.17981/ingecuc.18.2.2022.083435Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profilesAriel Guillermo Sánchez Paipilla0Mónica Katherine Durán Vaca1Javier Antonio Ballesteros RicaurteAngela María González Amarillo2Pedro Nel López3Universidad Pedagógica y Tecnológica de ColombiaUniversidad Pedagógica y Tecnológica de ColombiaUniversidad Nacional Abierta y a DistanciaUniversidad Pedagógica y Tecnológica de ColombiaIntroduction: Data can currently be found within organizations and outside of them, they are growing exponentially. Today, the information available on the Internet and social networks has become a generator of value, through the effective analysis of a specific situation, using techniques and methodologies with which content-based solutions can be proposed, and thus achieve, execute timely, intelligent and assertive decision-making processes. Objective: The main objective of this work is to development of a Bot Crawler, which allows extracting information from Facebook without access restrictions, or request for credentials, based on web crawling and scraping techniques, through the selection of HTML tags, to track and be able to define patterns. Method: The development of this project consisted of four main stages: A) Teamwork with SCRUM, B) Comparison of web data extraction techniques, C) Extraction and validation of permissions to access the data in Facebook, D) Development of the bor crawler. Results:  Briefly, mention the main results of the research Conclusions: As a result of this process, a graphical interface is created that allows checking the process of obtaining data derived from user profiles of this social network.https://revistascientificas.cuc.edu.co/ingecuc/article/view/4419web scrapingweb crawlinghtmlsocial networkingdata
spellingShingle Ariel Guillermo Sánchez Paipilla
Mónica Katherine Durán Vaca
Javier Antonio Ballesteros Ricaurte
Angela María González Amarillo
Pedro Nel López
Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles
Inge-Cuc
web scraping
web crawling
html
social networking
data
title Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles
title_full Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles
title_fullStr Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles
title_full_unstemmed Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles
title_short Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles
title_sort bot crawler to retrieve data from facebook based on the selection of posts and the extraction of user profiles
topic web scraping
web crawling
html
social networking
data
url https://revistascientificas.cuc.edu.co/ingecuc/article/view/4419
work_keys_str_mv AT arielguillermosanchezpaipilla botcrawlertoretrievedatafromfacebookbasedontheselectionofpostsandtheextractionofuserprofiles
AT monicakatherineduranvaca botcrawlertoretrievedatafromfacebookbasedontheselectionofpostsandtheextractionofuserprofiles
AT javierantonioballesterosricaurte botcrawlertoretrievedatafromfacebookbasedontheselectionofpostsandtheextractionofuserprofiles
AT angelamariagonzalezamarillo botcrawlertoretrievedatafromfacebookbasedontheselectionofpostsandtheextractionofuserprofiles
AT pedronellopez botcrawlertoretrievedatafromfacebookbasedontheselectionofpostsandtheextractionofuserprofiles