作者: A. Honrado , R. Leon , R. O'Donnel , D. Sinclair
DOI: 10.1109/SPIRE.2000.878189
关键词:
摘要: The paper describes a word stemming algorithm for the Spanish language. Experiments in document retrieval regarding English text suggest that based on morphological analysis does not generally or consistently outperform ad-hoc hand tuned algorithms such as proposed by M. Porter (1980). It is difficult to produce style romantic language Spanish, however due greater grammatical complexity and fact inflection often causes changes root of words, just their endings (as mostly case with English). In general terms, difficulty consists producing an which can cope additional morphology whilst preserving simplicity algorithm. One presented. combines dictionary look-ups some 300 intermediate reduction rules.