作者: Yaser I. Suleiman , Michael Zoll , Subhransu Basu , Thomas Herter , Thomas Breidt
DOI:
关键词: End user 、 Artificial intelligence 、 Machine learning 、 Selection (genetic algorithm) 、 Set (abstract data type) 、 Domain (software engineering) 、 Training set 、 Computer science
摘要: Described is an improved approach to implement selection of training data for machine learning, by presenting a designated set specific indicators where these correspond metrics that end users are familiar with and easily understood ordinary DBAs within their knowledge domain. Selection would correlate automatically corresponding other metrics/signals less understandable user. Additional analysis the selected can then be performed identify correct any statistical problems data.