当前位置：和泉文库 > 计算机 > 浏览文档

南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）专题一 MapReduce的概念、原理与应用

一、MapReduce的应用背景二、MapReduce的概念三、MapReduce的原理四、MapReduce的实现五、MapReduce的性能

文件格式：PDF，文件大小：5.44MB，售价：9.24元

文档详细内容（约32页）

专题l:MapReduce的概念、原理与应用谢磊博士南京大学计算机科学与技术系

专题1： MapReduce的概念、原理与应用谢磊博士南京大学计算机科学与技术系

主要内容：一、MapReduce的应用背景二 MapReduce的概念三、MapReduce的原理四、MapReduce的实现五、MapReduce的性能六、参考文献

一、MapReduce的应用背景三、MapReduce的原理主要内容：二、MapReduce的概念五、MapReduce的性能四、MapReduce的实现六、参考文献

MapReduce的应用背景-l Google have implemented hundreds of special- purpose computations that process large amounts of raw data, such as crawled documents,web request logs,etc

MapReduce的应用背景-1 • Google have implemented hundreds of specialpurpose computa6ons that process large amounts of raw data, – such as crawled documents, web request logs, etc

MapReduce的应用背景-1 Google's data center compute various kinds of derived data. Various representations Inverted indices of the graph structure of web documents Summaries of the number of pages The set of most crawled per host frequent queries in a given

MapReduce的应用背景-1 • Google’s data center compute various kinds of derived data. Inverted indices Various representa6ons of the graph structure of web documents Summaries of the number of pages crawled per host The set of most frequent queries in a given

MapReduce的应用背景-2 Most such computations are conceptually straightforward. However, the input data is usually large and the computations have to be distributed across hundreds or thousands of machines in order to finish in a reasonable amount of time. 数据总量 ·100~1000PB 数据处理量 ·10~100PB/天 oml 网页 ·千亿万亿索引 ·百亿千亿更新量 ·十亿~百亿天 orig 请求 ·十亿~百亿/天 X C 日志 100TB~1PB/天

MapReduce的应用背景-2 • Most such computa6ons are conceptually straighDorward. However, – the input data is usually large – and the computa6ons have to be distributed across hundreds or thousands of machines in order to finish in a reasonable amount of 6me. • The issues of – how to parallelize the computa6on, – distribute the data, – and handle failures • conspire to obscure the original simple computa6on with large amounts of complex code to deal with these issues

点击进入文档下载页（PDF格式）

共32页，可试读12页，点击继续阅读 ↓↓

您可能感兴趣的文档

南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）第九章并行数值算法（稠密矩阵运算）
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）第八章并行数值算法（基本通信操作）
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）第七章并行算法的一般设计过程
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）第六章并行算法的基本设计技术
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）第五章并行算法的一般设计方法
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）第四章并行算法的设计基础
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）第三章并行计算硬件结构基础（并行计算性能评测）
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）专题二云计算的概念、技术与应用
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）第二章并行计算硬件结构基础——当代并行机系统（SMP、MPP和Cluster）
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）第一章并行计算硬件结构基础（并行计算机系统及其结构模型）
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）引论 Introduction（谢磊）
计算机科学与技术（参考文献）Focus and Shoot - Efficient Identification over RFID Tags in the Specified Area
南京大学：《并行处理技术——分布式与并行计算 Distributed and Parallel computing（并行计算——结构、算法、编程）》课程教学资源（课件讲稿）专题三边缘智能（边缘计算时代的人工智能）
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）课程介绍 Introduction（谢磊）
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）第一章物联网概述
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）第二章智能感知技术概述
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）第三章传感器感知技术
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）第六章自动识别技术与RFID（RFID防冲突协议与无源感知技术）
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）第四章非传感器感知技术
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）专题——RFID的识别与估算机制
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）专题——从识别到感知（基于RFID的可标记无源感知机制研究）
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）第七章传感器技术（传感器网络）
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）第八章定位系统（定位技术）
南京大学：《物联网技术导论 Introduction of Internet of Things》课程教学资源（课件讲稿）专题——物联网定位机制（概念、原理与前沿技术）以及基于位置的服务

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录