ALCIDE: Extracting and visualising content from large document collections to support humanities studies

作者: Giovanni Moretti , Rachele Sprugnoli , Stefano Menini , Sara Tonelli

DOI: 10.1016/J.KNOSYS.2016.08.003

关键词:

摘要: Abstract The application of research practices and methodologies from the Information Communication Technologies to Humanities studies is having a great impact on way humanities being conducted. However, although many applications have been developed automatically analyse document collections historical or literary domain, they often fail provide real support scholars because their inherent complexity: technical skills are required use them inspect output. On other hand, some systems more user-friendly, but present basic analyses limited needs specific community. In order overcome aforementioned limitations, we ALCIDE ( Analysis Language Content Digital Environment ), web-based platform designed assist in navigating analysing large quantities textual data such as sources works. This suite tools combines advanced text processing techniques with intuitive visualisations output serve broad range questions, which no comparable tool can address single platform. Textual corpora be inspected compared along five semantic dimensions: who, where, when, what how. Such dimensions different combinations allow targeting key questions disciplines, shown cases presented.

参考文章(51)
Emanuele Pianta, Roberto Zanoli, Christian Girardi, The TextPro Tool Suite language resources and evaluation. ,(2008)
Andrea Esuli, Stefano Baccianella, Fabrizio Sebastiani, SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. language resources and evaluation. ,(2010)
Benito Mussolini, Alfredo Rocco, Scritti e discorsi politici A. Giuffrè. ,(1938)
T McEnery, P Rayson, DE Archer, SL Piao, The UCREL Semantic Analysis System European Language Resources Association. ,(2004)
Marco Guerini, Lorenzo Gatti, Marco Turchi, Sentiment Analysis: How to Derive Prior Polarities from SentiWordNet empirical methods in natural language processing. pp. 1259- 1269 ,(2013)
Peter Wittenburg, Tamás Váradi, Kimmo Koskenniemi, Steven Krauwer, Martin Wynne, CLARIN: Common language resources and technology infrastructure language resources and evaluation. ,(2008)
Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N. Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick van Kleef, Sören Auer, Christian Bizer, DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia Social Work. ,vol. 6, pp. 167- 195 ,(2015) , 10.3233/SW-140134
Samuele Poy, The Public Debate on an Intractable Issue: Local Minimum Wages in Italy Rivista Italiana di Politiche Pubbliche. pp. 87- 114 ,(2015) , 10.1483/79369
Gunther Maier, None, OpenStreetMap, the Wikipedia Map REGION. ,vol. 1, pp. 3- 10 ,(2014) , 10.18335/REGION.V1I1.70
Serge Heiden, The TXM Platform: Building Open-Source Textual Analysis Software Compatible with the TEI Encoding Scheme pacific asia conference on language information and computation. ,vol. 2, pp. 389- 398 ,(2010)