LayoutLM: Pre-training of Text and Layout for Document Image Understanding

作者: Ming Zhou , Furu Wei , Lei Cui , Shaohan Huang , Minghao Li

DOI: 10.1145/3394486.3403172

关键词:

摘要: … Business documents are files that provide details … files, or they may be in scanned form that comes from written or printed on paper. Some common examples of business documents …

参考文章(30)
Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 39, pp. 1137- 1149 ,(2017) , 10.1109/TPAMI.2016.2577031
Jaekyu Ha, R.M. Haralick, I.T. Phillips, Document page decomposition by the bounding-box project international conference on document analysis and recognition. ,vol. 2, pp. 1119- 1122 ,(1995) , 10.1109/ICDAR.1995.602115
Hao Wei, Micheal Baechler, Fouad Slimane, Rolf Ingold, Evaluation of SVM, MLP and GMM Classifiers for Layout Analysis of Historical Documents international conference on document analysis and recognition. pp. 1220- 1224 ,(2013) , 10.1109/ICDAR.2013.247
D. Lewis, G. Agam, S. Argamon, O. Frieder, D. Grossman, J. Heard, Building a test collection for complex document information processing Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '06. pp. 665- 666 ,(2006) , 10.1145/1148170.1148307
M. Shilman, P. Liang, P. Viola, Learning nongenerative grammatical models for document analysis international conference on computer vision. ,vol. 2, pp. 962- 969 ,(2005) , 10.1109/ICCV.2005.140
F. Lebourgeois, Z. Bublinski, H. Emptoz, A fast and efficient method for extracting text paragraphs and graphics from unconstrained documents international conference on pattern recognition. pp. 272- 276 ,(1992) , 10.1109/ICPR.1992.201771
A. Simon, J.-C. Pret, A.P. Johnson, A fast algorithm for bottom-up document layout analysis IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 19, pp. 273- 277 ,(1997) , 10.1109/34.584106
L. O'Gorman, The document spectrum for page layout analysis IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 15, pp. 1162- 1173 ,(1993) , 10.1109/34.244677
S. Marinai, M. Gori, G. Soda, Artificial neural networks for document analysis and recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 27, pp. 23- 35 ,(2005) , 10.1109/TPAMI.2005.4
Jaekyu Ha, R.M. Haralick, I.T. Phillips, Recursive X-Y cut using bounding boxes of connected components international conference on document analysis and recognition. ,vol. 2, pp. 952- 955 ,(1995) , 10.1109/ICDAR.1995.602059