作者: Antonio Moreno-Sandoval , Esteban Moro
DOI: 10.1016/J.SBSPRO.2015.07.452
关键词: Small data 、 Data science 、 Big data 、 Engineering 、 Corpus linguistics 、 Information quality 、 Field (computer science) 、 Content analysis 、 Predictive analytics 、 Data processing
摘要: Abstract Big data is a broad term for sets so large and complex that traditional processing applications are inadequate. A new field, Predictive Analytics, trying to extract value from those big (unstructured) data. In Corpus Linguistics, researchers usually deal with small this paper, we compare the amount quality of information respect single topic (flu) in Twitter MultiMedica (a corpus medicine texts).