Hadoop操作手册

节选

[

如果你需要维护大型而且复杂的Hadoop集群的话,《Hadoop操作手册(影印版)》是绝对必需的。随着Hadoop变成数据中心里大规模数据处理的行业标准,操作手册方面的需求急剧增长。萨默尔,cloudera公司的首席方案架构师,在本书中为你展示了产品级Hadoop的运行细节,从规划、安装和配置系统到提供可持续的维护管理。

]

本书特色

[

    如果你需要维护大型而且复杂的hadoop集群的话,《hadoop操作手册(影印版)》是绝对必需的。随着hadoop变成数据中心里大规模数据处理的行业标准,操作手册方面的需求急剧增长。萨默尔,cloudera公司的首席方案架构师,在本书中为你展示了产品级hadoop的运行细节,从规划、安装和配置系统到提供可持续的维护管理。

]

内容简介

[

    
如果你需要维护大型而且复杂的hadoop集群的话,《hadoop操作手册(影印版)》是绝对必需的。随着
hadoop变成数据中心里大规模数据处理的行业标准,操作手册方面的需求急剧增长。萨默尔,cloudera公司的首席方案架构师,在本书中为你展示了产品级
hadoop的运行细节,从规划、安装和配置系统到提供可持续的维护管理。
    
《hadoop操作手册(影印版)》这本操作指南并没有列举每种可能的场景,它更注重实效,描述了在重要部署中的各项步骤。
     本书内容: hdfs和mapredlice概览:它们存在的原因和原理;
从硬件和os选择到网络需求来规划hadoop部署; 根据重要属性列表来学习搭建和配置细节; 通过在多个组中共享集群来管理资源;
获取*常见的集群维护任务运行手册; 监控hadoop集群——以及学习基于实际例子的故障检测;
使用基础工具和技术来处理备份和灾难性故障。

]

目录

preface1.introduction2.hdfs goals and motivation design daemons reading and writing data the read path the write path managing filesystem metadata namenode high availability namenode federation access and integration command—line tools fuse rest support3.mapreduce the stages of mapreduce introducing hadoop mapreduce daemons when it all goes wrong yarn4.planning a hadoop cluster picking a distribution and version of hadoop apache hadoop cloudera’s distribution including apache hadoop what should i use? hardware selection master hardware selection worker hardware selection cluster sizing blades,sans,and virtualization operating system selection and preparation deployment layout software hostnames.dns.and identmcation users,groups,and privileges kernel tuning vm.swappiness vm.overcommit_memory disk configuration choosing a filesystem mount options network design network usage in hadoop:a review 1 gb versus 10 gb networks typical network topologies 5.installation andconfiguration installing hadoop apache hadoop cdh configuration:an 0verview the hadoop xml configuration files environment variables and shell scripts logging configuration hdfs identification and location optimization and tuning formatting the namenode creating a/tmp directory namenode high availability fencing options basic configuration automatic failover configuration format and bootstrap the namenodes namenode federation mapreduce identification and location optimization and tuning rack topology security6.identity,authentication,and authorization identity kerberos and hadoop kerberos:a refresher kerberos support in hadoop authorization hdfs mapreduce other tools and systems tying it together7.resojjrcemanagement what is resource management? hdfs quotas mapreduce schedulers the fifo scheduler the fair scheduler the capacity scheduler the future8.clustermaintenance managing hadoop processes starting and stopping processes with into scripts starting and stopping processes manually hdfs maintenance tasks adding a datanode decommissioning a datanode checking filesystem integrity with fsck balancing hdfs block data dealing with a failed disk mapreduce maintenance tasks adding a tasktracker decommissioning a tasktracker killing a mapreduce job killing a mapreduce task dealing with a blacklisted tasktracker9.troubleshooting differential diagnosis applied to systems

封面

Hadoop操作手册

书名:Hadoop操作手册

作者:(美)萨默尔 著

页数:282

定价:¥59.0

出版社:东南大学出版社

出版日期:2013-06-01

ISBN:9787564142582

PDF电子书大小:101MB 高清扫描完整版

百度云下载:http://www.chendianrong.com/pdf

发表评论

邮箱地址不会被公开。 必填项已用*标注