作者: Sihem Amer-Yahia , Mary Fernández , Divesh Srivastava , Yu Xu
DOI: 10.1016/B978-012722442-8/50024-0
关键词:
摘要: Phrase matching is a common IR technique to search text and identify relevant documents in document collection. XML presents new challenges as may be interleaved with arbitrary markup, thwarting techniques that require strict contiguity or close proximity of keywords. We present for phrase permits dynamic specification both the matched markup ignored. develop an effective algorithm our utilizes inverted indices on words tags. describe experimental results comparing indexed-nested loop illustrate algorithm's efficiency.