Lexical and Discourse Analysis of Online Chat Dialog

作者: Cassio V.S. Prazeres , Maria da Graca C. Pimentel , Cesar A.C. Teixeira

DOI: 10.1109/ICSC.2007.54

关键词:

摘要: One of the ultimate goals natural language processing (NLP) systems is understanding meaning what being transmitted, irrespective medium (e.g., written versus spoken) or form static documents dynamic dialogues). Although much work has been done in traditional domains such as speech and text, little yet newer communication enabled by Internet, e.g., online chat instant messaging. This part due to fact that there are no annotated corpora available broader research community. The purpose this build a corpus, tagged with lexical (token part-of-speech labels), syntactic (post parse tree), discourse classification) information. Such corpus can then be used develop more complex, statistical-based NLP applications perform tasks author profiling, entity identification, social network analysis.

参考文章(27)
José Luis Ambite, Peter Dolog, Gustaf Neumann, Susanne Busse, Andreas Harth, Matthew Weathers, Nicola Henze, Stefan Decker, Wolfgang Nejdl, Michael Sintek, Uwe Zdun, Andreas Billig, Andreas Leicher, TRIPLE - an RDF Rule Language with Context and Use Cases Rule Languages for Interoperability. ,(2005)
Faisal M. Khan, Tianhao Wu, Todd A. Fisher, William M. Pottenger, Lori A. Shuler, Posting Act Tagging Using Transformation-Based Learning. Foundations of Data Mining and knowledge Discovery. pp. 319- 331 ,(2005)
Peter Buneman, Sanjeev Khanna, Tan Wang-Chiew, Why and Where: A Characterization of Data Provenance international conference on database theory. pp. 316- 330 ,(2001) , 10.1007/3-540-44503-X_20
Jorge Pérez, Marcelo Arenas, Claudio Gutierrez, Semantics and complexity of SPARQL international semantic web conference. pp. 30- 43 ,(2006) , 10.1007/11926078_3
Jane Lin, Automatic Author Profiling of Online Chat Logs Monterey, California. Naval Postgraduate School. ,(2007)
Wang-Chiew Tan, Peter Buneman, Sanjeev Khanna, Data Provenance: Some Basic Issues foundations of software technology and theoretical computer science. pp. 87- 93 ,(2000) , 10.1007/3-540-44450-5_6
Alaa M. Khamis, Francisco J. Rodríguez, Miguel A. Salichs, Remote Interaction with Mobile Robots Autonomous Robots. ,vol. 15, pp. 267- 281 ,(2003) , 10.1023/A:1026268504593
Y. Cui, J. Widom, Practical lineage tracing in data warehouses international conference on data engineering. pp. 367- 378 ,(2000) , 10.1109/ICDE.2000.839437
Jerry R. Hobbs, Feng Pan, An ontology of time for the semantic web ACM Transactions on Asian Language Information Processing. ,vol. 3, pp. 66- 85 ,(2004) , 10.1145/1017068.1017073