TY - JOUR
T1 - OSN crawling schedulers and their implications on k-plexes detection
AU - Pérez-Solà, Cristina
AU - Herrera-Joancomartí, Jordi
PY - 2013/6/1
Y1 - 2013/6/1
N2 - Web crawlers are complex applications that explore the Web for different purposes. Web crawlers can be configured to crawl online social networks (OSNs) to obtain relevant data about their global structure. Before a web crawler can be launched to explore the Web, a large amount of settings have to be configured. These settings define the crawler's behavior and they have a big impact on the collected data. Both the amount of collected data and the quality of the information that it contains are affected by the crawler settings and, therefore, by properly configuring these web crawler settings we can target specific goals to achieve with our crawl. In this paper, we review the configuration choices that an attacker who wants to obtain information from an OSN by crawling it has to make to conduct his attack. We analyze different scheduler algorithms for web crawlers and evaluate their performance in terms of how useful they are to pursue a set of different adversary goals. © 2013 Wiley Periodicals, Inc.
AB - Web crawlers are complex applications that explore the Web for different purposes. Web crawlers can be configured to crawl online social networks (OSNs) to obtain relevant data about their global structure. Before a web crawler can be launched to explore the Web, a large amount of settings have to be configured. These settings define the crawler's behavior and they have a big impact on the collected data. Both the amount of collected data and the quality of the information that it contains are affected by the crawler settings and, therefore, by properly configuring these web crawler settings we can target specific goals to achieve with our crawl. In this paper, we review the configuration choices that an attacker who wants to obtain information from an OSN by crawling it has to make to conduct his attack. We analyze different scheduler algorithms for web crawlers and evaluate their performance in terms of how useful they are to pursue a set of different adversary goals. © 2013 Wiley Periodicals, Inc.
U2 - 10.1002/int.21594
DO - 10.1002/int.21594
M3 - Article
SN - 0884-8173
VL - 28
SP - 583
EP - 605
JO - International Journal of Intelligent Systems
JF - International Journal of Intelligent Systems
IS - 6
ER -