作者: Denis Kleyko , Abbas Rahimi , Ross W. Gayler , Evgeny Osipov
DOI: 10.1007/S00521-019-04397-1
关键词: False positive rate 、 Autoscaling 、 Data structure 、 Algorithm 、 Set (abstract data type) 、 Minification 、 Probabilistic logic 、 Mathematics 、 Bloom filter 、 False positive paradox
摘要: A Bloom filter is a simple data structure supporting membership queries on set. The standard does not support the delete operation, therefore, many applications use counting to enable deletion. This paper proposes generalization of approach, called "autoscaling filters", which allows adjustment its capacity with probabilistic bounds false positives and true positives. In essence, autoscaling binarized an adjustable binarization threshold. We present mathematical analysis performance as well give procedure for minimization positive rate.