作者: Jukka Ruohonen , Sanja Scepanovic , Sami Hyrynsalmi , Igor Mishkovski , Tuomas Aura
关键词: Popularity 、 The Internet 、 World Wide Web 、 Big data 、 Computer science 、 Theoretical definition 、 Open data 、 Web crawler 、 Malware 、 Snapshot (computer storage)
摘要: This short empirical paper investigates a snapshot of about two million files from continuously updated big data collection maintained by F-Secure for security intelligence purposes. By further augmenting the with open covering half files, examines questions: (a) what is shape probability distribution characterizing relative share malware to all distributed web-facing Internet domains, and (b) shaping popularity files? A bimodal proposed as an answer former question, while graph theoretical definition concept indicates long-tailed, extreme value distribution. With these questions – answers thereto, contributes attempts understand large-scale characteristics at grand population level whole Internet.