SQL or NoSQL? Contrasting Approaches to the Storage, Manipulation and Analysis of Spatio-temporal Online Social Network Data

作者: Adrian Tear

DOI: 10.1007/978-3-319-09144-0_16

关键词:

摘要: Researchers are now accessing millions of Online Social Network (OSN) interactions. These available at no or low cost through Application Programming Interfaces (APIs) data custodians including DataSift and GNIP. Records held in Extensible Markup Language (XML) JavaScript Object Notation (JSON) well structured but often inconveniently formatted for use popular Relational Database Management Systems (RDBMS) Geographic Information (GIS) software. In contrast, emerging NoSQL (Not-only Structured Query Language) technologies specially designed to ‘ingest’ unstructured data. Extract/Transform/Load (ETL) procedures the storage subsequent analysis two OSN datasets SQL/NoSQL databases examined. The fixed model relational approach may prove problematic when loading unpredictable document-based structures arising from extended periods collection. Although far obsolete spatial community seems likely benefit experimentation with new software explicitly handling spatio-temporal Big Data.

参考文章(48)
Shashi Shekhar, Michael R. Evans, Viswanath Gunturi, KwangSoo Yang, Daniel Cintra Cugler, Benchmarking Spatial Big Data Specifying Big Data Benchmarks. pp. 81- 93 ,(2014) , 10.1007/978-3-642-53974-9_8
Rolf Kiefer, Christiane Biedermann, Public Relations (PR) Springer Berlin Heidelberg. pp. 117- 131 ,(2008) , 10.1007/978-3-540-36358-3_8
Raymondus Kosala, Erwin Adi, Steven, Harvesting real time traffic information from Twitter Procedia Engineering. ,vol. 50, pp. 1- 11 ,(2012) , 10.1016/J.PROENG.2012.10.001
J. Manyika, Michael Chui, Brad Brown, Jacques Bughin, Richard Dobbs, Charles Roxburgh, Angela Hung Byers, Big data: The next frontier for innovation, competition, and productivity ,(2011)
danah boyd, Kate Crawford, CRITICAL QUESTIONS FOR BIG DATA Information, Communication & Society. ,vol. 15, pp. 662- 679 ,(2012) , 10.1080/1369118X.2012.678878
Desmond F. D'Souza, Alan Cameron Wills, Objects, Components, and Frameworks With Uml: The Catalysis Approach ,(1998)
Bernie C. Till, Justin Longo, A. Rod Dobell, Peter F. Driessen, Self-organizing maps for latent semantic analysis of free-form text in support of public policy analysis Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery. ,vol. 4, pp. 71- 86 ,(2014) , 10.1002/WIDM.1112
Fred Morstatter, Kathleen M. Carley, Jürgen Pfeffer, Huan Liu, Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose international conference on weblogs and social media. pp. 400- 408 ,(2013)
Stefan Stieglitz, Christian Kaufhold, Automatic Full Text Analysis in Public Social Media - Adoption of a Software Prototype to Investigate Political Communication international conference ambient systems networks and technologies. ,vol. 5, pp. 776- 781 ,(2011) , 10.1016/J.PROCS.2011.07.104