Chemical structure recognition: a rule-based approach

作者: Noureddin M. Sadawi , Alan P. Sexton , Volker Sorge

DOI: 10.1117/12.912185

关键词: Translation (geometry)Process (engineering)Principal (computer security)Computer scienceRule-based systemCurrent (mathematics)Data miningGraph (abstract data type)Embedding

摘要: In chemical literature much information is given in the form of diagrams depicting molecules. order to access this information have be recognised and translated into a processable format. We present an approach that models the principal recognition steps for molecule strictly rule based system, providing rules identify main components - atoms bonds as well resolve possible ambiguities. The result process translation into a graph representation can used further processing. show effectiveness our by describing its embedding full system experimental evaluation demonstrates how current implementation outperforms leading open source currently available.

参考文章(15)
Joe R. McDaniel, Jason R. Balmuth, Automatic Interpretation of Chemical Structure Diagrams graphics recognition. pp. 148- 158 ,(1995) , 10.1007/3-540-61226-2_13
Igor V. Filippov, Marc C. Nicklaus, Optical Structure Recognition Software To Recover Chemical Information: OSRA — An Open Source Solution Journal of Chemical Information and Modeling. ,vol. 49, pp. 740- 743 ,(2009) , 10.1021/CI800067R
David Weininger, SMILES, a chemical language and information system. 1. introduction to methodology and encoding rules Journal of Chemical Information and Computer Sciences. ,vol. 28, pp. 31- 36 ,(1988) , 10.1021/CI00057A005
DAVID H DOUGLAS, THOMAS K PEUCKER, ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE Cartographica: The International Journal for Geographic Information and Geovisualization. ,vol. 10, pp. 112- 122 ,(1973) , 10.3138/FM57-6770-U75U-7727
Aniko T. Valko, A. Peter Johnson, CLiDE Pro: the latest generation of CLiDE, a tool for optical chemical structure recognition. Journal of Chemical Information and Modeling. ,vol. 49, pp. 780- 787 ,(2009) , 10.1021/CI800449T
Joe R. McDaniel, Jason R. Balmuth, Kekule: OCR-optical chemical (structure) recognition Journal of Chemical Information and Computer Sciences. ,vol. 32, pp. 373- 378 ,(1992) , 10.1021/CI00008A018
P. Ibison, M. Jacquot, F. Kam, A. G. Neville, R. W. Simpson, C. Tonnelier, T. Venczel, A. P. Johnson, Chemical literature data extraction: The CLiDE Project annual conference on computers. ,vol. 33, pp. 338- 344 ,(1993) , 10.1021/CI00013A010
J.-Y. Ramel, G. Boissier, H. Emptoz, Automatic reading of handwritten chemical formulas from a structural representation of the image international conference on document analysis and recognition. pp. 83- 86 ,(1999) , 10.1109/ICDAR.1999.791730
Koji Nakagawa, Akio Fujiyoshi, Masakazu Suzuki, Ground-truthed dataset of chemical structure images in Japanese published patent applications Proceedings of the 8th IAPR International Workshop on Document Analysis Systems - DAS '10. pp. 455- 462 ,(2010) , 10.1145/1815330.1815389
Tom Y. Ouyang, Randall Davis, Recognition of hand drawn chemical diagrams national conference on artificial intelligence. pp. 846- 851 ,(2007)