Toward Better Loanword Identification in Uyghur Using Cross-lingual Word Embeddings

作者: Yating Yang , Chenggang Mi , Tonghai Jiang , Lei Wang , Xi Zhou

DOI:

关键词: PronunciationPersianBLEULoanwordFeature (machine learning)Identification (information)ArabicLanguage modelMachine translationComputer scienceNatural language processingVocabularyWord (computer architecture)Artificial intelligence

摘要: To enrich vocabulary of low resource settings, we proposed a novel method which identify loanwords in monolingual corpora. More specifically, we first use cross-lingual word …

参考文章(23)
Matthew D. Zeiler, ADADELTA: An Adaptive Learning Rate Method arXiv: Learning. ,(2012)
Chenggang Mi, Yating Yang, Lei Wang, Xiao Li, Kamali Dalielihan, Detection of loan words in uyghur texts international conference natural language processing. ,vol. 496, pp. 103- 112 ,(2014) , 10.1007/978-3-662-45924-9_10
Sharon Peperkamp, A Psycholinguistic Theory of Loanword Adaptations Annual Meeting of the Berkeley Linguistics Society. ,vol. 30, pp. 341- 352 ,(2004) , 10.3765/BLS.V30I1.919
Yoshua Bengio, Greg Corrado, Stephan Gouws, BilBOWA: Fast Bilingual Distributed Representations without Word Alignments international conference on machine learning. pp. 748- 756 ,(2015)
Shigeko Shinohara, Loanword-specific grammar in Japanese adaptations of Korean words and phrases Journal of East Asian Linguistics. ,vol. 24, pp. 149- 191 ,(2015) , 10.1007/S10831-014-9129-3
Thang Luong, Ilya Sutskever, Quoc Le, Oriol Vinyals, Wojciech Zaremba, Addressing the Rare Word Problem in Neural Machine Translation Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). ,vol. 1, pp. 11- 19 ,(2015) , 10.3115/V1/P15-1002
Ilya Sutskever, Tomas Mikolov, Greg S Corrado, Kai Chen, Jeff Dean, Distributed Representations of Words and Phrases and their Compositionality neural information processing systems. ,vol. 26, pp. 3111- 3119 ,(2013)
Yulia Tsvetkov, Chris Dyer, Cross-lingual bridges with models of lexical borrowing Journal of Artificial Intelligence Research. ,vol. 55, pp. 63- 93 ,(2016) , 10.1613/JAIR.4786
Ivan Titov, Alexandre Klementiev, Binod Bhattarai, Inducing Crosslingual Distributed Representations of Words international conference on computational linguistics. pp. 1459- 1474 ,(2012)
Yulia Tsvetkov, Chris Dyer, Lexicon Stratification for Translating Out-of-Vocabulary Words Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). ,vol. 2, pp. 125- 131 ,(2015) , 10.3115/V1/P15-2021