Examine Manipulated Datasets with Topology Data Analysis: A Case Study

作者: Yun Guo , Daniel Sun , Guoqiang Li , Shiping Chen

DOI: 10.1007/978-3-030-01950-1_21

关键词:

摘要: Learning and mining technologies have been broadly applied to reveal the value of tremendous data impact decision-making. Usually, correctness decisions roots in truth for these technologies. Data fraud presents everywhere, even if were true, could be maliciously manipulated by cyber-attackers. Methods long exploited examine authenticity, but are less effective when only values without violating scopes definitions. Then made from wrong or hijacked. It has concluded that manipulation is latest technique “the art war cyberspace.” Examining each instance its source exhaustive impossible, example recollecting national consensus. In this paper, through a case study on banknotes, we exploit Topological Analysis (TDA) examining data. A fraction records examined integrally other than individually. The possibility using TDA verify efficiently then evaluated. We first test above detection, discuss limitations state art. Although not so matured, it reported many applications, now our work evidences usage anomalies.

参考文章(29)
Clément Maria, Jean-Daniel Boissonnat, Marc Glisse, Mariette Yvinec, The Gudhi library: Simplicial complexes and persistent homology international congress on mathematical software. pp. 167- 174 ,(2014) , 10.1007/978-3-662-44199-2_28
Ulrike Tillmann, Ulrike Tillmann, Mason A. Porter, Mason A. Porter, Peter Grindrod, Heather A. Harrington, Nina Otter, Nina Otter, A roadmap for the computation of persistent homology arXiv: Algebraic Topology. ,(2015) , 10.1140/EPJDS/S13688-017-0109-5
Richard J. Bolton, David J. Hand, Foster Provost, Leo Breiman, Richard J. Bolton, David J. Hand, Statistical Fraud Detection: A Review Statistical Science. ,vol. 17, pp. 235- 255 ,(2002) , 10.1214/SS/1042727940
Facundo Mémoli, Gunnar E. Carlsson, Gurjeet Singh, Topological Methods for the Analysis of High Dimensional Data Sets and 3D Object Recognition eurographics. pp. 91- 100 ,(2007) , 10.2312/SPBG/SPBG07/091-100
Florian T. Pokorny, Majd Hawasly, Subramanian Ramamoorthy, Topological trajectory classification with filtrations of simplicial complexes and persistent homology The International Journal of Robotics Research. ,vol. 35, pp. 204- 223 ,(2016) , 10.1177/0278364915586713
D. DeWoskin, J. Climent, I. Cruz-White, M. Vazquez, C. Park, J. Arsuaga, Applications of computational homology to the analysis of treatment response in breast cancer patients Topology and its Applications. ,vol. 157, pp. 157- 164 ,(2010) , 10.1016/J.TOPOL.2009.04.036
Pek Y Lum, Gurjeet Singh, Alan Lehman, Tigran Ishkanov, Mikael Vejdemo-Johansson, Muthu Alagappan, John Carlsson, Gunnar Carlsson, None, Extracting insights from the shape of complex data using topology Scientific Reports. ,vol. 3, pp. 1236- 1236 ,(2013) , 10.1038/SREP01236
Stephen C. Johnson, Hierarchical clustering schemes Psychometrika. ,vol. 32, pp. 241- 254 ,(1967) , 10.1007/BF02289588
Kelin Xia, Xin Feng, Yiying Tong, Guo Wei Wei, Persistent homology for the quantitative prediction of fullerene stability. Journal of Computational Chemistry. ,vol. 36, pp. 408- 422 ,(2015) , 10.1002/JCC.23816
Subhrajit Bhattacharya, Robert Ghrist, Vijay Kumar, Persistent Homology for Path Planning in Uncertain Environments IEEE Transactions on Robotics. ,vol. 31, pp. 578- 590 ,(2015) , 10.1109/TRO.2015.2412051