作者: Herodotos Herodotou
DOI:
关键词:
摘要: Hadoop MapReduce is now a popular choice for performing large-scale data analytics. This technical report describes detailed set of mathematical performance models describing the execution job on Hadoop. The describe dataflow and cost information at fine granularity phases within map reduce tasks execution. can be used to estimate jobs as well find optimal configuration settings use when running jobs.