Mobile Application for Archaeological Site Image Content Retrieval and Automated Generating Image Descriptions with Neural Network

作者: Sathit Prasomphan , Jai E. Jung

DOI: 10.1007/S11036-016-0805-6

关键词:

摘要: This paper presents a novel algorithm for generating descriptions of stupa image such as stupa's era, architecture and other description in mobile application by using key points generated from SIFT algorithms learning the with artificial neural network. Neural network was used being classifier description. We have presented new approach to feature extraction based on analysis descriptors an image. The were tested dataset Phra Nakhon Si Ayutta province, Sukhothai province Bangkok. experimental results show that proposed framework can efficiently give correct compared traditional method.

参考文章(15)
M. Hodosh, P. Young, J. Hockenmaier, Framing image description as a ranking task: data, models and evaluation metrics Journal of Artificial Intelligence Research. ,vol. 47, pp. 853- 899 ,(2013) , 10.1613/JAIR.3994
Hao Su, Fan Wang, Eric Yi, Leonidas Guibas, 3D-Assisted Feature Synthesis for Novel Views of an Object 2015 IEEE International Conference on Computer Vision (ICCV). pp. 2677- 2685 ,(2015) , 10.1109/ICCV.2015.307
Ilya Sutskever, Wojciech Zaremba, Oriol Vinyals, Recurrent Neural Network Regularization arXiv: Neural and Evolutionary Computing. ,(2014)
Ali Farhadi, Mohsen Hejrati, Mohammad Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, David Forsyth, Every Picture Tells a Story: Generating Sentences from Images Computer Vision – ECCV 2010. pp. 15- 29 ,(2010) , 10.1007/978-3-642-15561-1_2
Andrej Karpathy, Li Fei-Fei, Deep visual-semantic alignments for generating image descriptions computer vision and pattern recognition. pp. 3128- 3137 ,(2015) , 10.1109/CVPR.2015.7298932
D.G. Lowe, Object recognition from local scale-invariant features international conference on computer vision. ,vol. 2, pp. 1150- 1157 ,(1999) , 10.1109/ICCV.1999.790410
Richard Socher, Andrej Karpathy, Quoc V. Le, Christopher D. Manning, Andrew Y. Ng, Grounded Compositional Semantics for Finding and Describing Images with Sentences Transactions of the Association for Computational Linguistics. ,vol. 2, pp. 207- 218 ,(2014) , 10.1162/TACL_A_00177
Jason J. Jung, Exploiting geotagged resources for spatial clustering on social network services Concurrency and Computation: Practice and Experience. ,vol. 28, pp. 1356- 1367 ,(2016) , 10.1002/CPE.3634
David G. Lowe, Distinctive Image Features from Scale-Invariant Keypoints International Journal of Computer Vision. ,vol. 60, pp. 91- 110 ,(2004) , 10.1023/B:VISI.0000029664.99615.94
Jason J. Jung, Big Bibliographic Data Analytics by Random Walk Model Mobile Networks and Applications. ,vol. 20, pp. 533- 537 ,(2015) , 10.1007/S11036-014-0555-2