作者: Mark Huckvale , Aimee Webb
DOI: 10.1007/978-3-319-25789-1_11
关键词:
摘要: The estimation of the age a speaker from his or her voice has both forensic and commercial applications. Previous studies have shown that human listeners are able to estimate within 10 years on average, while recent machine systems seem show superior performance with average errors as low 6 years. However used highly non-uniform test sets, for which knowledge distribution offers considerable advantage system. In this study we compare same data chosen be uniformly distributed in age. We case accuracy is more similar 9.8 8.6 respectively, although if panels consulted, can improved value closer 7.5 Both machines difficulty accurately predicting ages older speakers.