Deep Reinforcement Learning with a Combinatorial Action Space for Predicting and Tracking Popular Discussion Threads.

作者: Mari Ostendorf , Li Deng , Jianfeng Gao , Xiaodong He , Lihong Li

DOI:

关键词: Bellman equationTask (computing)Reinforcement learningNatural languageComputer scienceAction (philosophy)Benchmark (computing)Artificial intelligenceSpace (commercial competition)

摘要: We introduce an online popularity prediction and tracking task as a benchmark for reinforcement learning with combinatorial, natural language action space. A specified number of discussion threads predicted to be popular are recommended, chosen from fixed window recent comments track. Novel deep architectures studied effective modeling the value function associated actions comprised interdependent sub-actions. The proposed model, which represents dependence between sub-actions through bi-directional LSTM, gives best performance across different experimental configurations domains, it also generalizes well varying numbers recommendation requests.

参考文章(37)
Jure Leskovec, Julian J. McAuley, Himabindu Lakkaraju, What's in a Name? Understanding the Interplay between Titles, Content, and Communities in Social Media international conference on weblogs and social media. ,(2013)
Hannaneh Hajishirzi, Aaron Jaech, Victoria Zayats, Mari Ostendorf, Hao Fang, Talking to the crowd: What do people react to in online discussions? arXiv: Computation and Language. ,(2015)
Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Margaret Mitchell, Jian-Yun Nie, Jianfeng Gao, Bill Dolan, A Neural Network Approach to Context-Sensitive Generation of Conversational Responses north american chapter of the association for computational linguistics. pp. 196- 205 ,(2015) , 10.3115/V1/N15-1020
Tae Yano, Noah A. Smith, What's Worthy of Comment? Content and Comment Volume in Political Blogs. international conference on weblogs and social media. ,(2010)
S.R.K. Branavan, D. Silver, R. Barzilay, Learning to Win by Reading Manuals in a Monte-Carlo Framework meeting of the association for computational linguistics. ,vol. 43, pp. 268- 277 ,(2011) , 10.1613/JAIR.3484
Long-Ji Lin, Reinforcement learning for robots using neural networks Carnegie Mellon University. ,(1992)
Jure Leskovec, P. Alex Dow, Justin Cheng, Jon Kleinberg, Lada A. Adamic, Can Cascades be Predicted arXiv: Social and Information Networks. ,(2014) , 10.1145/2566486.2567997
Michael Mathioudakis, Nick Koudas, TwitterMonitor Proceedings of the 2010 international conference on Management of data - SIGMOD '10. pp. 1155- 1158 ,(2010) , 10.1145/1807167.1807306
Bongwon Suh, Lichan Hong, Peter Pirolli, Ed H. Chi, Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network international conference on social computing. pp. 177- 184 ,(2010) , 10.1109/SOCIALCOM.2010.33
Alexandru Tatar, Jérémie Leguay, Panayotis Antoniadis, Arnaud Limbourg, Marcelo Dias de Amorim, Serge Fdida, Predicting the popularity of online articles based on user comments Proceedings of the International Conference on Web Intelligence, Mining and Semantics - WIMS '11. pp. 67- ,(2011) , 10.1145/1988688.1988766