Mining sequences for patterns with non-repeating symbols

作者: Michal Walicki , Diogo R. Ferreira

DOI: 10.1109/CEC.2010.5585995

关键词:

摘要: Finding the case id in unlabeled event logs is arguably one of hardest challenges process mining research. While this problem can be addressed with greedy approaches, these usually converge to sub-optimal solutions. In paper, we describe an approach perform complete search over space. We formulate as a matter finding minimal set patterns contained sequence, where interleaved but do not have repeating symbols. show that for practical purposes it possible reduce space maximal disjoint occurrences patterns. Experimental results suggest that, whenever finds solution, one.

参考文章(8)
Diogo R. Ferreira, Daniel Gillblad, Discovering Process Models from Unlabelled Event Logs business process management. ,vol. 5701, pp. 143- 158 ,(2009) , 10.1007/978-3-642-03848-8_11
Diogo Ferreira, Marielba Zacarias, Miguel Malheiros, Pedro Ferreira, Approaching process mining with sequence clustering: experiments and findings business process management. pp. 360- 374 ,(2007) , 10.1007/978-3-540-75183-0_26
A. Tiwari, C.J. Turner, B. Majeed, A review of business process mining: state‐of‐the‐art and future trends Business Process Management Journal. ,vol. 14, pp. 5- 22 ,(2008) , 10.1108/14637150810849373
T. Calders, C. W. Günther, M. Pechenizkiy, A. Rozinat, Using minimum description length for process mining acm symposium on applied computing. pp. 1451- 1455 ,(2009) , 10.1145/1529282.1529606
J. Rissanen, Paper: Modeling by shortest data description Automatica. ,vol. 14, pp. 465- 471 ,(1978) , 10.1016/0005-1098(78)90005-5
W.M.P. van der Aalst, B.F. van Dongen, J. Herbst, L. Maruster, G. Schimm, A.J.M.M. Weijters, Workflow mining: a survey of issues and approaches data and knowledge engineering. ,vol. 47, pp. 237- 267 ,(2003) , 10.1016/S0169-023X(03)00066-1
Donald E. Knuth, Dancing links ,(2000)