Paraphrasing Predicates from Written Language to Spoken Language Using the Web.

作者: Sadao Kurohashi , Masashi Okamoto , Nobuhiro Kaji

DOI:

关键词:

摘要: There are a lot of differences between expressions used in written language and spoken language. It is one the reasons why speech synthesis applications prone to produce unnatural speech. This paper represents method paraphrasing unsuitable for into suitable ones. Those two can be distinguished based on occurrence probability corpora which automatically collected from Web. Experimental results indicated effectiveness our method. The precision was 94%, accuracy learning paraphrases 76 %.

参考文章(19)
Hitoshi Isahara, Sadaoki Furui, Kikuo Maekawa, Hanae Koiso, Spontaneous Speech Corpus of Japanese language resources and evaluation. ,(2000)
Ulf Hermjakob, Abdessamad Echihabi, Daniel Marcu, Natural Language Based Reformulation Resource and Wide Exploitation for Question Answering. text retrieval conference. ,(2002)
Eiichiro Sumita, Toshiyuki Takezawa, Fumiaki Sugaya, Hirofumi Yamamoto, Seiichi Yamamoto, Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World language resources and evaluation. ,(2002)
Yusuke Shinyama, Satoshi Sekine, Kiyoshi Sudo, Ralph Grishman, Automatic paraphrase acquisition from news articles international conference on human language technology research. pp. 313- 318 ,(2002)
Tomohiro Fukuhara, Toyoaki Nishida, Shunsuke Uemura, Public Opinion Channel: A System for Augmenting Social Intelligence of a Community Lecture Notes in Computer Science. pp. 51- 58 ,(2001) , 10.1007/3-540-45548-5_7
Daisuke Kawahara, Sadao Kurohashi, Japanese case frame construction by coupling the verb and its closest case component Proceedings of the first international conference on Human language technology research - HLT '01. pp. 1- 7 ,(2001) , 10.3115/1072133.1072195
George Tambouratzis, Stella Markantonatou, Nikolaos Hairetakis, Marina Vassiliou, Dimitrios Tambouratzis, George Carayannis, Discriminating the registers and styles in the modern Greek language Proceedings of the workshop on Comparing corpora -. pp. 35- 42 ,(2000) , 10.3115/1117729.1117735
Philip Edmonds, Graeme Hirst, Near-synonymy and lexical choice Computational Linguistics. ,vol. 28, pp. 105- 144 ,(2002) , 10.1162/089120102760173625
Ivan Bulyko, Mari Ostendorf, Andreas Stolcke, Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology companion volume of the Proceedings of HLT-NAACL 2003--short papers - NAACL '03. pp. 7- 9 ,(2003) , 10.3115/1073483.1073486
Akiyo Nadamoto, Hiroyuki Kondo, Katsumi Tanaka, WebCarousel: Restructuring web search results for passive viewing in mobile environments database systems for advanced applications. pp. 164- 165 ,(2001) , 10.1109/DASFAA.2001.6044759