Developing an optimised activity type annotation method based on classification accuracy and entropy indices

作者: Wim Ectors , Sofie Reumers , Won Do Lee , Keechoo Choi , Bruno Kochan

DOI: 10.1080/23249935.2017.1331275

关键词: Data miningAnnotationComputer scienceBig dataOriginal dataActivity classificationMachine learningEntropy (information theory)InferenceStatistical classificationArtificial intelligence

摘要: ABSTRACTThe generation of substantial amounts travel- and mobility-related data has spawned the emergence era big data. However, this generally lacks activity-travel information such as trip purpose. This deficiency led to development purpose inference (activity type imputation/annotation) techniques, which performance depends on available input (number of) activity classes infer. Aggregating types strongly increases accuracy is usually left discretion researcher. As open for interpretation, it undermines reported accuracy. study developed an optimised classification methodology by identifying with optimal balance between improving model accuracy, preserving from original set. A sensitivity analysis was performed. Additionally, several machine learning algorithms are experimented with. The proposed method may be app...

参考文章(42)
Kevin Manaugh, Tyler Kreider, What is mixed use? Presenting an interaction method for measuring land use mix Journal of Transport and Land Use. ,vol. 6, pp. 63- 72 ,(2013) , 10.5198/JTLU.V6I1.291
Atizaz Ali, Jooyoung Kim, Seungjae Lee, Travel behavior analysis using smart card data KSCE Journal of Civil Engineering. ,vol. 20, pp. 1532- 1539 ,(2016) , 10.1007/S12205-015-1694-0
Mahdieh Allahviranloo, Will Recker, Mining activity pattern trajectories and allocating activities in the network Transportation. ,vol. 42, pp. 561- 579 ,(2015) , 10.1007/S11116-015-9602-5
Jun Zhang, Peter Stopher, Camden FitzGerald, Search for a global positioning system device to measure person travel Transportation Research Part C-emerging Technologies. ,vol. 16, pp. 350- 369 ,(2008) , 10.1016/J.TRC.2007.10.002
Wendy Bohte, Kees Maat, Deriving and validating trip purposes and travel modes for multi-day GPS-based travel surveys: A large-scale application in the Netherlands Transportation Research Part C-emerging Technologies. ,vol. 17, pp. 285- 297 ,(2009) , 10.1016/J.TRC.2008.11.004
CE Shennon, Warren Weaver, A mathematical theory of communication Bell System Technical Journal. ,vol. 27, pp. 379- 423 ,(1948) , 10.1002/J.1538-7305.1948.TB01338.X
Tom Bellemans, Bruno Kochan, Davy Janssens, Geert Wets, Theo Arentze, Harry Timmermans, Implementation Framework and Development Trajectory of FEATHERS Activity-Based Simulation Platform: Transportation Research Record. ,vol. 2175, pp. 111- 119 ,(2010) , 10.3141/2175-13
Olle Järv, Rein Ahas, Frank Witlox, Understanding monthly variability in human activity spaces: A twelve-month study using mobile phone call detail records Transportation Research Part C-emerging Technologies. ,vol. 38, pp. 122- 135 ,(2014) , 10.1016/J.TRC.2013.11.003
Marcelo G. Simas Oliveira, Peter Vovsha, Jean Wolf, Michael Mitchell, Evaluation of Two Methods for Identifying Trip Purpose in GPS-Based Household Travel Surveys Transportation Research Record. ,vol. 2405, pp. 33- 41 ,(2014) , 10.3141/2405-05
Tao Feng, Harry J.P. Timmermans, Extracting Activity-travel Diaries from GPS Data: Towards Integrated Semi-automatic Imputation Procedia Environmental Sciences. ,vol. 22, pp. 178- 185 ,(2014) , 10.1016/J.PROENV.2014.11.018