Schema Discovery Through Statistical Transduction

作者: Srinivasan Parthasarathy , Deepak S. Turaga , Venkata N. Pavuluri

DOI:

关键词:

摘要: A method, system, and computer program product derive data schema for application to a set. One or more processors generate directed acyclic weighted graph that encodes types semantic used by assign estimated frequencies each component of the graph, where predict likelihood particular element being any traverse through paths in with predetermined portion set determine correctly defines from identifies errors set, then apply clean is properly formatted.

参考文章(16)
Michael Kinsely, Shirley Wu, Alex Raitz, John Robert Coates, Templates for defining fields in machine data ,(2014)
Michael Dietrich, Gunther Stuhec, Jens Lemcke, Computing canonical hierarchical schemas ,(2012)
Tomer Sagi, Avigdor Gal, Schema matching prediction with applications to data source discovery and dynamic ensembling very large data bases. ,vol. 22, pp. 689- 710 ,(2013) , 10.1007/S00778-013-0325-Y
Philip A. Bernstein, Jayant Madhavan, Methods and systems for model matching ,(2004)