作者: Sjsu ScholarWorks , Sami Khuri , Glenn Jahnke
DOI:
关键词:
摘要: MRCRAIG: MAPREDUCE AND ENSEMBLE CLASSIFIERS FOR PARALLELIZING DATA CLASSIFICATION PROBLEMS by Glenn Jahnke In this paper, a novel technique for parallelizing data-classification problems is applied to finding genes in sequences of DNA. The involves various ensemble classification methods such as Bagging and Select Best. It then distributes the classifier training prediction using MapReduce. A sequence voting algorithm evaluated method, well compared against Best method.