Internet Traffic Volumes Are Not Gaussian -- They Are Log-Normal: An 18-Year Longitudinal Study With Implications for Modelling and Prediction (Complete Version).

作者: Richard Clegg , George Parisis , Mohammed Alasmar , Nickolay Zakhleniuk

DOI:

关键词: Statistical modelPercentileComputer scienceLog-normal distributionService-level agreementWeibull distributionStatisticsDistribution (mathematics)Internet trafficGaussian

摘要: Getting good statistical models of traffic on network links is a well-known, often-studied problem. A lot attention has been given to correlation patterns and flow duration. The distribution the amount per unit time an equally important but less studied We study large number traces from many different networks including academic, commercial residential using state-of-the-art techniques. show that obeys log-normal which better fit than Gaussian commonly claimed in literature. also investigate alternative heavy-tailed (the Weibull) its performance worse log-normal. examine anomalous exhibit poor for all distributions tried this often due outages or hit maximum capacity. demonstrate data we look at stationary if consider samples 15- minute long even 1-hour long. This gives confidence can use estimation modelling purposes. utility our findings two contexts: predicting proportion will exceed level (for service agreement link capacity estimation) 95th percentile pricing. predictor Weibull both contexts.

参考文章(40)
Ricardo de Oliveira Schmidt, Ramin Sadre, Anna Sperotto, Hans van den Berg, Aiko Pras, Impact of Packet Sampling on Link Dimensioning IEEE Transactions on Network and Service Management. ,vol. 12, pp. 392- 405 ,(2015) , 10.1109/TNSM.2015.2436365
Ricardo de O. Schmidt, Hans van den Berg, Aiko Pras, Measurement-based network link dimensioning integrated network management. pp. 1071- 1077 ,(2015) , 10.1109/INM.2015.7140435
Xiaowei Yang, Designing traffic profiles for bursty Internet traffic global communications conference. ,vol. 3, pp. 2149- 2154 ,(2002) , 10.1109/GLOCOM.2002.1189012
Denis Kwiatkowski, Peter C.B. Phillips, Peter Schmidt, Yongcheol Shin, Testing the null hypothesis of stationarity against the alternative of a unit root: How sure are we that economic time series have a unit root? Journal of Econometrics. ,vol. 54, pp. 159- 178 ,(1992) , 10.1016/0304-4076(92)90104-Y
Xenofontas Dimitropoulos, Paul Hurley, Andreas Kind, Marc Ph. Stoecklin, On the 95-Percentile Billing Method Lecture Notes in Computer Science. pp. 207- 216 ,(2009) , 10.1007/978-3-642-00975-4_21
Jin Cao, William S. Cleveland, Dong Lin, Don X. Sun, On the nonstationarity of Internet traffic Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems - SIGMETRICS '01. ,vol. 29, pp. 102- 112 ,(2001) , 10.1145/378420.378440
José Luis García-Dorado, José Alberto Hernández, Javier Aracil, Jorge E. López de Vergara, Sergio Lopez-Buedo, Characterization of the busy-hour traffic of IP networks based on their intrinsic features Computer Networks. ,vol. 55, pp. 2111- 2125 ,(2011) , 10.1016/J.COMNET.2011.02.015
K. Thompson, G.J. Miller, R. Wilder, Wide-area Internet traffic patterns and characteristics IEEE Network. ,vol. 11, pp. 10- 23 ,(1997) , 10.1109/65.642356
David A. Dickey, Wayne A. Fuller, Distribution of the Estimators for Autoregressive Time Series with a Unit Root Journal of the American Statistical Association. ,vol. 74, pp. 427- 431 ,(1979) , 10.1080/01621459.1979.10482531
Ignacio Castro, Rade Stanojevic, Sergey Gorinsky, Using tuangou to reduce IP transit costs IEEE ACM Transactions on Networking. ,vol. 22, pp. 1415- 1428 ,(2014) , 10.1109/TNET.2013.2278236