Facilitating Twitter data analytics: Platform, language and functionality

作者: Ke Tao , Claudia Hauff , Geert-Jan Houben , Fabian Abel , Guido Wachsmuth

DOI: 10.1109/BIGDATA.2014.7004259

关键词: Social webSet (abstract data type)Computer scienceAnalyticsData analysisWorkflowUse caseData collectionWorld Wide WebInterface (Java)

摘要: Conducting analytics over data generated by Social Web portals such as Twitter is challenging, due to the volume, variety and velocity of data. Commonly, adhoc pipelines are used that solve a particular use case. In this paper, we generalize across range typical Twitter-data cases determine set common characteristics. Based on investigation, present our Analytical Platform (TAP), generic platform for conducting analytical tasks with The provides domain-specific Analysis Language (TAL) interface its functionality stack. TAL includes analysis tools ranging from collection semantic enrichment, machine learning. With these tools, it becomes possible create customize workflows in build applications make results. We showcase applicability building Twinder-a search engine streams.

参考文章(32)
Fabian Abel, Claudia Hauff, Geert-Jan Houben, Ke Tao, Leveraging user modeling on the social web with linked data international conference on web engineering. pp. 378- 385 ,(2012) , 10.1007/978-3-642-31753-8_31
Fabian Abel, Qi Gao, Geert-Jan Houben, Ke Tao, Semantic Enrichment of Twitter Posts for User Profile Construction on the Social Web The Semanic Web: Research and Applications. pp. 375- 389 ,(2011) , 10.1007/978-3-642-21064-8_26
Cecilia Mascolo, Anastasios Noulas, Massimiliano Pontil, Salvatore Scellato, Exploiting semantic annotations for clustering geographic areas and users in location-based social networks international conference on weblogs and social media. ,(2011)
Qi Gao, Fabian Abel, Geert-Jan Houben, GeniUS: generic user modeling library for the social semantic web international semantic technology conference. pp. 160- 175 ,(2011) , 10.1007/978-3-642-29923-0_11
Erhard Rahm, Hong Hai Do, Data Cleaning: Problems and Current Approaches. IEEE Data(base) Engineering Bulletin. ,vol. 23, pp. 3- 13 ,(2000)
Ke Tao, Fabian Abel, Qi Gao, Geert-Jan Houben, TUMS: twitter-based user modeling service international semantic web conference. pp. 269- 283 ,(2011) , 10.1007/978-3-642-25953-1_22
Paul S. Earle, Daniel C. Bowden, Michelle R. Guy, Twitter earthquake detection: earthquake monitoring in a social world Annals of Geophysics. ,vol. 54, pp. 708- 715 ,(2012) , 10.4401/AG-5364
Ke Tao, Fabian Abel, Claudia Hauff, Geert-Jan Houben, Ujwal Gadiraju, Groundhog day Proceedings of the 22nd international conference on World Wide Web - WWW '13. pp. 1273- 1284 ,(2013) , 10.1145/2488388.2488499