Improving the Reproducibility of PAN's Shared Tasks: - Plagiarism Detection, Author Identification, and Author Profiling.

作者: Tim Gollub , Efstathios Stamatatos , Martin Potthast , Benno Stein , Paolo Rosso

DOI:

关键词:

摘要: This paper reports on the PAN 2014 evaluation lab which hosts three shared tasks plagiarism detection, author identification, and profiling. To improve reproducibility of in general, PAN’s particular, Webis group developed a new web service called TIRA, facilitates software submissions. Unlike many other labs, asks participants to submit running softwares instead their run output. deal with organizational overhead involved handling submissions, TIRA experimentation platform helps significantly reduce workload for both organizers, whereas submitted are kept state. year, we addressed matter responsibility successful execution order put back charge executing at our site. In sum, 57 have been lab; together 58 submissions last this forms largest collection date, all readily available further analysis. The report concludes brief summary each task.

参考文章(56)
Alexander F. Gelbukh, Grigori Sidorov, Miguel A. Sanchez-Perez, A Winning Approach to Text Alignment for Text Reuse Detection at PAN 2014. CLEF (Working Notes). pp. 1004- 1011 ,(2014)
Michael Völske, Martin Potthast, Matthias Hagen, Benno Stein, Exploratory Search Missions for TREC Topics. EuroHCIR. pp. 7- 10 ,(2013)
Vlado Keselj, Evangelos E. Milios, Magdalena Jankowska, CNG Text Classification for Authorship Profiling Task Notebook for PAN at CLEF 2013. CLEF (Working Notes). ,(2013)
Kyle Williams, C. Lee Giles, Hung Hsuan Chen, Sagnik Ray Choudhury, Unsupervised Ranking for Plagiarism Source Retrieval Notebook for PAN at CLEF 2013. cross language evaluation forum. ,vol. 1180, pp. 1021- 1026 ,(2013)
Luis Villaseñor-Pineda, Manuel Montes-y-Gómez, Hugo Jair Escalante, Luis Enrique, A. Pastor López-Monroy, Using Intra-Profile Information for Author Profiling Notebook for PAN at CLEF 2014 ,(2014)
Alberto Barrón-Cedeno, Andreas Eiselt, Martin Potthast, Benno Stein, Paolo Rosso, Overview of the 1st international competition on plagiarism detection SEPLN 2009 - 3rd Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse, PAN 2009 and 1st International Competition on Plagiarism Detection. ,vol. 502, pp. 1- 9 ,(2009)
Patrick Juola, Authorship Attribution ,(2008)
Efstathios Stamatatos, Patrick Juola, Overview of the Author Identification Task at PAN 2013 CLEF (Working Notes). ,(2013)
Guido Zarrella, John D. Burger, John Henderson, George Kim, Discriminating Gender on Twitter empirical methods in natural language processing. pp. 1301- 1309 ,(2011)
Tim Gollub, Martin Potthast, Anna Beyer, Matthias Busse, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein, Recent Trends in Digital Text Forensics and Its Evaluation cross language evaluation forum. pp. 282- 302 ,(2013) , 10.1007/978-3-642-40802-1_28