摘 要:当今社会,随着网络技术的广泛普及,互联网中的数据正以前所未有的速度增长和积累。大数据已经走入我们的生活,提高对海量数据的应用效率成为现在社会发展的焦点。Map Reduce并行编程环境,已经在大数据处理领域得到了广泛的应用。同时,学术界也对Map Reduce的相关运算做出了巨大贡献,有效地推动了Map Reduce的发展。 本论文主要围绕基于Map Reduce模式的大数据聚集方法,提出聚集运算,用Map Reduce函数进行分组运算,统计,求最大值最小值。聚集运算不仅能够提高运算效率,而且能减少远算时间。37817 毕业论文关键词:Map Reduce;聚集运算;求和;最大值或最小值
Study of on the Data Aggregation Method Based on the Map Reduce
Abstract: With the development of computer network and Internet widely available, the data of Internet with hitherto unknown speed growth and accumulation, the big data has been entered into our life. To improve the efficiency of massive data has become the focus of the development of modern society. Map Reduce parallel programming environment, has been widely used in the field of big data processing. At the same time, academic circles also made a great contribution to related algorithms of the Map Reduce, effectively promoted the development of Map Reduce. This paper focuses on the data aggregation method based on the Reduce model of Map, and uses the Map Reduce function to find the grouping operation, statistics, and the maximum and minimum value. put forward the aggregation algorithm. Aggregation algorithm not only can improve the computational efficiency, but also can effectively reduce the operation time. To this end, according to the features and advantages of Map Reduce large data put forward algorithm.
Key Words: Map Reduce; Clustering algorithm; Count; Maximum or Minimum
目 录