作者: Chang Liu , Hui Wang
DOI:
关键词: Algorithm 、 Mathematics 、 Longest common subsequence problem 、 Similarity measure 、 Probabilistic logic 、 Neighbourhood (mathematics) 、 Bayes classifier
摘要: The longest common subsequence (LCS) is a well known and popular method for measuring similarity between sequences. In this paper we consider all subsequences (ACS) as measure of sequence with the view that information captured. ACS inspired derived from generic measure, neighbourhood counting metric (NCM). close connection NCM probability Bayes classifier helps gain an insight probabilistic perspective into ACS. We also design algorithm to calculate carry out experiment in framework k-nearest neighbours on gene classification task. shows LCS have little difference small k values, but differ significantly large values. ACS's performance remains steady gets larger, LCS's drops sharply larger. Such property may be useful some tasks.