作者: Wu QingQiang , Liu Hua , Liu KunHong
DOI:
关键词:
摘要: Leukemia's types and their relationships to literatures are introduced, based on which data set about Leukemia for classification is constructed with original sources, such as Cancer Gene Census, PubMed gene2pubmed. The imbalanced the research object. Based introduction of current methods set, problems sampling in analyzed, mixed-sampling method proposed classify set. multi-class problem transferred a two-class problems. Area Under Receiver Operating Characteristic (ROC) Curve (AUC) used evaluate method. Then, experiments performed verify efficiency stability eight methods, results comparatively analyzed. It can be found that achieves best performance. At last, work this paper concluded look forward future work.