INforE: Interactive Cross-platform Analytics for Everyone

作者: Nikos Giatrakos , David Arnu , Theodoros Bitsakis , Antonios Deligiannakis , Minos Garofalakis

DOI: 10.1145/3340531.3417435

关键词: Big dataInteractivityScale (chemistry)AnalyticsWorkflowComputer scienceCross-platformMultimediaStreaming dataData stream mining

摘要: We present INforE, a prototype supporting non-expert programmers in performing optimized, cross-platform, streaming analytics at scale. INforE offers: a) new extension to the RapidMiner Studio for graphical design of Big Data workflows, (b) novel optimizer instruct execution workflows across platforms and clusters, (c) synopses data engine interactivity scale via use summaries, (d) distributed, online mining machine learning module. To our knowledge is first holistic approach settings. demonstrate fields life science financial analysis.

参考文章(10)
Mu Li, Scaling Distributed Machine Learning with the Parameter Server international conference on big data. ,vol. 2014, pp. 3- ,(2014) , 10.1145/2640087.2644155
Vanja Josifovski, Alexander J. Smola, Bor-Yiing Su, David G. Andersen, Amr Ahmed, James Long, Eugene J. Shekita, Jun Woo Park, Mu Li, Scaling distributed machine learning with the parameter server operating systems design and implementation. pp. 583- 598 ,(2014) , 10.5555/2685048.2685095
Ionel Gog, Malte Schwarzkopf, Natacha Crooks, Matthew P. Grosvenor, Allen Clement, Steven Hand, Musketeer: all for one, one for all in data processing systems european conference on computer systems. pp. 2- ,(2015) , 10.1145/2741948.2741968
Minos Garofalakis, Johannes Gehrke, Rajeev Rastogi, Data Stream Management: A Brave New World Data-Centric Systems and Applications. pp. 1- 9 ,(2016) , 10.1007/978-3-540-28608-0_1
Data Stream Management Springer Berlin Heidelberg. ,(2016) , 10.1007/978-3-540-28608-0
Minlan Yu, Jianshu Chen, Hongqiang Harry Liu, Shivaram Venkataraman, Ming Zhang, Omid Alipourfard, Cherrypick: adaptively unearthing the best cloud configurations for big data analytics networked systems design and implementation. pp. 469- 482 ,(2017)
Róbert Pálovics, András A. Benczúr, Levente Kocsis, Online Machine Learning in Big Data Streams. arXiv: Distributed, Parallel, and Cluster Computing. ,(2018)
Ji Lucas, Yasser Idris, Bertty Contreras-Rojas, Jorge-Arnulfo Quiane-Ruiz, Sanjay Chawla, RheemStudio: Cross-Platform Data Analytics Made Easy 2018 IEEE 34th International Conference on Data Engineering (ICDE). pp. 1573- 1576 ,(2018) , 10.1109/ICDE.2018.00179
Antonis Kontaxakis, Nikos Giatrakos, Antonios Deligiannakis, A Synopses Data Engine for Interactive Extreme-Scale Analytics conference on information and knowledge management. pp. 2085- 2088 ,(2020) , 10.1145/3340531.3412154
Katerina Doka, Nikolaos Papailiou, Dimitrios Tsoumakos, Christos Mantas, Nectarios Koziris, IReS: Intelligent, Multi-Engine Resource Scheduler for Big Data Analytics Workflows international conference on management of data. pp. 1451- 1456 ,(2015) , 10.1145/2723372.2735377