Czech Dataset for Semantic Textual Similarity

作者: Lukás̆ Svoboda , Tomás̆ Brychcín

DOI: 10.1007/978-3-030-00794-2_23

关键词: Task (project management)Natural language processingCzechMeaning (linguistics)SentenceComputer scienceSemEvalSimilarity (psychology)Artificial intelligence

摘要: Semantic textual similarity is the core shared task at International Workshop on Evaluation (SemEval). It focuses sentence meaning comparison. So far, most of research has been devoted to English.

参考文章(18)
John C. Platt, Fast training of support vector machines using sequential minimal optimization Advances in kernel methods. pp. 185- 208 ,(1999)
Christopher D. Manning, Hinrich Schütze, Foundations of Statistical Natural Language Processing ,(1999)
Francis Jeffry Pelletier, The Principle of Semantic Compositionality Topoi-an International Review of Philosophy. ,vol. 13, pp. 11- 24 ,(1994) , 10.1007/BF00763644
Ivan Vulić, Marie-Francine Moens, Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings international acm sigir conference on research and development in information retrieval. pp. 363- 372 ,(2015) , 10.1145/2766462.2767752
Tomáš Brychcín, Miloslav Konopík, HPS: High precision stemmer Information Processing and Management. ,vol. 51, pp. 68- 91 ,(2015) , 10.1016/J.IPM.2014.08.006
Richard Socher, Will Y. Zou, Christopher D. Manning, Daniel Cer, Bilingual Word Embeddings for Phrase-Based Machine Translation empirical methods in natural language processing. pp. 1393- 1398 ,(2013)
Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, Ian H. Witten, The WEKA data mining software ACM SIGKDD Explorations Newsletter. ,vol. 11, pp. 10- 18 ,(2009) , 10.1145/1656274.1656278
Jana Straková, Milan Straka, Jan Hajič, Open-Source Tools for Morphology, Lemmatization, POS Tagging and Named Entity Recognition Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations. pp. 13- 18 ,(2014) , 10.3115/V1/P14-5003
Stephan Gouws, Anders Søgaard, Simple task-specific bilingual word embeddings north american chapter of the association for computational linguistics. pp. 1386- 1390 ,(2015) , 10.3115/V1/N15-1157
Miloslav Konopík, Karel Jezek, Lubomír Krcmár, Exploration of Semantic Spaces Obtained from Czech Corpora. DATESO. pp. 97- 107 ,(2011)