Features for Generic Corpus Querying

作者: Thomas Eckart , Uwe Quasthoff , Christoph Kuras

DOI:

关键词:

摘要: The availability of large corpora for more and languages enforces generic querying standard interfaces. This development is especially relevant in the context integrated research environments like CLARIN or DARIAH. paper focuses on several applications implementation details basis a unified corpus format, unique POS tag set, prepared data word similarities. All described are already will be near future accessible via well-documented RESTful Web services. target group all kinds interested persons with varying level experience programming query languages.

参考文章(6)
Uwe Quasthoff, Christian Wolff, Gerhard Heyer, Martin Läuter, Thomas Wittig, Learning relations using collocations OL'01 Proceedings of the 2nd International Conference on Ontology Learning - Volume 38. pp. 19- 24 ,(2001)
Adam Kilgarriff, Vít Baisa, Jan Bušta, Miloš Jakubíček, Vojtěch Kovář, Jan Michelfeit, Pavel Rychlý, Vít Suchomel, The Sketch Engine: ten years on Lexicography ASIALEX. ,vol. 1, pp. 7- 36 ,(2014) , 10.1007/S40607-014-0009-9
Ted Dunning, Accurate methods for the statistics of surprise and coincidence Computational Linguistics. ,vol. 19, pp. 61- 74 ,(1993)
Pavel Rychlý, Manatee/Bonito - A Modular Corpus Manager RASLAN. pp. 65- 70 ,(2007)
Adam Kilgarriff, Pavel Rychlý, Milos Husak, Michael Rundell, Katy McAdam, GDEX: Automatically Finding Good Dictionary Examples in a Corpus Proceedings of the XIII EURALEX International Congress (Barcelona, 15-19 July 2008), 2008, ISBN 978-84-96742-67-3, págs. 425-432. pp. 425- 432 ,(2008)