Systems and methods for trie-based automated discovery of patterns in computer logs

作者: Maciolek Przemyslaw , Cincunegui Daniel , Koszyka Krzysztof

DOI:

关键词: AnalyticsMatching (statistics)Tokenization (data security)Computer scienceComputer data storageTrieWildcard characterData patternsMetadataData mining

摘要: Systems and methods for tokenization of log records efficient data storage, querying, analytics can utilize a trie pattern conversion the files, storing IDs, free parameters, metadata instead entire record. New patterns be discovered automatically by counting occurrences tokens matching wildcards existing patterns.

参考文章(28)
Patrick Audley, John Bradley, Martin Renaud, Globally aware authentication system ,(2007)
Erik Hinterbichler, Ron Oded Gery, Chengdu Huang, Zhenmin Li, Dynamic field extraction of log data ,(2013)
Jesse Miller, Micah James Delfino, David Carasso, Marc Robichaud, Advanced field extractor with modification of an extracted field ,(2015)
Sebastian Johannes Blohm, Huaiyu Zhu, Vivian Yaw-Wen Chu, Yunyao Li, Ching-Tien Ho, Systems and methods for information extraction using contextual pattern discovery ,(2011)
Christopher Petersen, Phillip Villella, Log collection, structuring and processing ,(2006)
Patrick Duessel, Konrad Rieck, Pavel Laskov, Klaus-Robert Mueller, Method and apparatus for automatic comparison of data sequences using local and global relationships ,(2006)
Cezar P. Grzelak, Jason D. Keirstead, Rory F. Bray, Firewall event reduction for rule use counting ,(2012)
Partha Bhattacharya, Jan Christian Lawrence, Network security monitoring system ,(2003)