作者: Arul Prakash Asirvatham , Kranthi Kumar
DOI:
关键词:
摘要: The web is a huge repository of information and there need for categorizing documents to facilitate the search retrieval pages. Existing algorithms rely solely on text content pages classification. However, has lot contained in structure, images, video etc present document. In this paper, we propose method automatic classification into few broad categories based structure document characteristics images it.