作者: Hinrich Schütze , Tobias Schnabel
DOI: 10.18419/OPUS-3064
关键词:
摘要: Most systems in natural language processing experience a substantial loss performance when the data that system is tested with differs significantly from has been trained on. Systems for part-of-speech (POS) tagging, example, are typically on newspaper texts but often applied to of other domains such as medical texts. Domain adaptation (DA) techniques seek improve so they able achieve consistently good - independent at hand. We investigate robustness domain representations and methods across target using tagging case study. We find there no single representation method works equally well all domains. In particular, large differences between more similar source those less similar.