作者: Ron Kohavi , Daniel A Sommerfield
DOI:
关键词:
摘要: 57 ABSTRACT A system and method determines how well various attributes in a record discriminate different values of a chosen label attribute. An attribute is considered a relevant attribute if it discriminates different values of a chosen label attribute either alone or in conjunction with other attributes. Accord ing to the present invention, a label attribute is Selected by a user from a Set of records, with each record having a plurality of attributes. Next, one or more first important attributes considered important by the user are Selected. The present invention then generates one or more Second impor tant attributes. The Second important attributes together with the user chosen first important attributes discriminate well between different values of the label attribute. A measure called “purity”(a number from 0 to 100) informs how well each attribute discriminates the different label attributes. The purity measure allows the …