作者: Maciolek Przemyslaw , Cincunegui Daniel , Koszyka Krzysztof
DOI:
关键词: Analytics 、 Matching (statistics) 、 Tokenization (data security) 、 Computer science 、 Computer data storage 、 Trie 、 Wildcard character 、 Data patterns 、 Metadata 、 Data mining
摘要: Systems and methods for tokenization of log records efficient data storage, querying, analytics can utilize a trie pattern conversion the files, storing IDs, free parameters, metadata instead entire record. New patterns be discovered automatically by counting occurrences tokens matching wildcards existing patterns.