摘要: A method for determining statistically significant token sequences lends itself use in the recognition of broken wrappers as well construction new wrapper rules. When rules are needed underlying wrapped data has changed, training examples used to recognized rule candidates that culled with a bias would be probably more successful. The resulting candidate set is clustered according feature characteristics, then compared examples. Those most similar create