Some Alternatives to Parikh Matrices Using String Kernels

作者: Chris Watkins , Alexander Clark

DOI:

关键词:

摘要: We describe methods of representing strings as real valued vectors or matrices; we show how to integrate two separate lines enquiry: string kernels, developed in machine learning, and Parikh matrices [8], which have been studied intensively over the last few years a powerful tool study combinatorics words. In field there is widespread use analogous mappings into high dimensional feature spaces based on occurrences subwords factors. this paper one can kernels construct alternatives matrices, that overcome some limitations matrix construction. These are morphisms from free monoid rings real-valued under multiplication: subsequence kernel other gap-weighted kernel. For latter demonstrate for many values gap-weight hyperparameter resulting morphism injective.

参考文章(13)
Arto Salomaa, On Languages Defined by Numerical Parameters. Formal Models, Languages and Applications. pp. 320- 336 ,(2007)
Carlos Martín-Vide, Alexandru Mateescu, Adrian Atanasiu, Codifiable Languages and the Parikh Matrix Mapping. Journal of Universal Computer Science. ,vol. 7, pp. 783- 793 ,(2001)
Nello Cristianini, John Shawe-Taylor, Kernel Methods for Pattern Analysis ,(2004)
Leonid Kontorovich, Corinna Cortes, Mehryar Mohri, Learning linearly separable languages algorithmic learning theory. pp. 288- 303 ,(2006) , 10.1007/11894841_24
Alexander Clark, Christophe Costa Florêncio, Chris Watkins, Mariette Serayet, Planar Languages and Learnability Grammatical Inference: Algorithms and Applications. pp. 148- 160 ,(2006) , 10.1007/11872436_13
D. Haussler, Convolution kernels on discrete structures Tech. Rep.. ,(1999)
Alexander Clark, Christophe Costa Florêncio, Chris Watkins, Languages as Hyperplanes: Grammatical Inference with String Kernels Lecture Notes in Computer Science. pp. 90- 101 ,(2006) , 10.1007/11871842_13
Carlos Martín-Vide, Alexandru Mateescu, Adrian Atanasiu, On the Injectivity of the Parikh Matrix Mapping Fundamenta Informaticae. ,vol. 49, pp. 289- 299 ,(2002)
Omer Egecioglu, Oscar H. Ibarra, A MATRIX Q-ANALOGUE OF THE PARIKH MAP IFIP TCS. pp. 125- 138 ,(2004) , 10.1007/1-4020-8141-3_12
Alexandru Mateescu, Arto Salomaa, Kai Salomaa, Sheng Yu, A sharpening of the Parikh mapping Theoretical Informatics and Applications. ,vol. 35, pp. 551- 564 ,(2001) , 10.1051/ITA:2001131