Ubuntu 12.04建立hadoop单机版环境（ubuntu20.04安装hadoop）-大数据-知优网

在11月初的时候，我们了解了Ubuntu 12.04 搭建 hadoop 集群版环境的方法，今天再来看看在单机版环境中，Ubuntu12.04搭建hadoop是如何实现的。

在11月初的时分，咱们了解了Ubuntu 12.04 树立 hadoop 集群版环境的办法，今日再来看看在单机版环境中，Ubuntu12.04树立hadoop是怎么完成的。

　　一. 你要装置Ubuntu这一步省掉；

　　二. 在Ubuntu下创立hadoop用户组和用户;

　　1. 创立hadoop用户组：

sudo addgroup hadoop

　　如图：

Ubuntu 12.04建立hadoop单机版环境（ubuntu20.04安装hadoop） hadoop 12.04 第1张

　　2. 创立hadoop用户：

sudo adduser -ingroup hadoop hadoop

　　如图：

Ubuntu 12.04建立hadoop单机版环境（ubuntu20.04安装hadoop） hadoop 12.04 第2张

　　3. 给hadoop用户增加权限，翻开/etc/sudoers文件：

sudo gedit /etc/sudoers

　　按回车键后就会翻开/etc/sudoers文件了，给hadoop用户赋予root用户相同的权限。

　　在root ALL=(ALL:ALL) ALL下增加hadoop ALL=(ALL:ALL) ALL，

hadoop ALL=(ALL:ALL) ALL

　　如图：

Ubuntu 12.04建立hadoop单机版环境（ubuntu20.04安装hadoop） hadoop 12.04 第3张

　　三. 在Ubuntu下装置JDK

　　运用如下指令履行即可：

sudo apt-get install openjdk-6-jre

　　如图：

Ubuntu 12.04建立hadoop单机版环境（ubuntu20.04安装hadoop） hadoop 12.04 第4张

　　四. 修正机器名

　　每逢Ubuntu装置成功时，咱们的机器名都默以为：ubuntu ，但为了今后集群中能够简单分辩各台服务器，需求给每台机器取个不同的姓名。机器名由 /etc/hostname文件决议。

　　1. 翻开/etc/hostname文件：

sudo gedit /etc/hostname

　　2. 将/etc/hostname文件中的ubuntu改为你想取的机器名。这儿我取"dubin-ubuntu"。重启体系后才会收效。

　　五. 装置ssh服务

　　这儿的ssh和三大结构:spring,struts,hibernate没有什么关系，ssh能够完成长途登录和办理，详细能够参阅其他相关材料。

　　装置openssh-server，

sudo apt-get install ssh openssh-server

　　这时假定您现已装置好了ssh，您就能够进行第六步了哦~

　　六、树立ssh无暗码登录本机

　　首先要转换成hadoop用户，履行以下指令：

su - hadoop

　　如图：

Ubuntu 12.04建立hadoop单机版环境（ubuntu20.04安装hadoop） hadoop 12.04 第5张

　　ssh生成密钥有rsa和dsa两种生成方法，默许情况下选用rsa方法。

　　1. 创立ssh-key，，这儿咱们选用rsa方法：

ssh-keygen -t rsa -P ""

　　如图：

Ubuntu 12.04建立hadoop单机版环境（ubuntu20.04安装hadoop） hadoop 12.04 第6张

　　（注：回车后会在~/.ssh/下生成两个文件：id_rsa和id_rsa.pub这两个文件是成对呈现的）

　　2. 进入~/.ssh/目录下，将id_rsa.pub追加到authorized_keys授权文件中，开端是没有authorized_keys文件的：

cd ~/.ssh
cat id_rsa.pub >> authorized_keys

　　如图：

　　（完成后就能够无暗码登录本机了。）

　　3. 登录localhost：

ssh localhost

　　如图：

　　( 注：当ssh长途登录到其它机器后，现在你操控的是长途的机器，需求履行退出指令才干从头操控本地主机。)

　　4. 履行退出指令：

exit

　　七. 装置hadoop

　　咱们选用的hadoop版别是：hadoop-0.20.203（http://www.apache.org/dyn/closer.cgi/hadoop/common/ ），由于该版别比较稳定。

　　1. 假定hadoop-0.20.203.tar.gz在桌面，将它复制到装置目录 /usr/local/下：

sudo cp hadoop-0.20.203.0rc1.tar.gz /usr/local/

　　2. 解压hadoop-0.20.203.tar.gz：

cd /usr/local
sudo tar -zxf hadoop-0.20.203.0rc1.tar.gz

　　3. 将解压出的文件夹改名为hadoop：

sudo mv hadoop-0.20.203.0 hadoop

　　4. 将该hadoop文件夹的属主用户设为hadoop：

sudo chown -R hadoop:hadoop hadoop

　　5. 翻开hadoop/conf/hadoop-env.sh文件：

sudo gedit hadoop/conf/hadoop-env.sh

　　6. 装备conf/hadoop-env.sh（找到#export JAVA_HOME=...,去掉#，然后加上本机jdk的途径）：

export JAVA_HOME=/usr/lib/jvm/java-6-openjdk

　　7. 翻开conf/core-site.xml文件：

sudo gedit hadoop/conf/core-site.xml

　　修改如下：

<?xmlversion="1.0"?>
<?xml-stylesheettype="text/xsl"href="configuration.xsl"?> 
<!--Putsite-specificpropertyoverridesinthisfile.-->
 
<configuration>
<property> 
<name>fs.default.name</name> 
<value>hdfs://localhost:9000</value> 
</property> 
</configuration>

　　8. 翻开conf/mapred-site.xml文件：

sudo gedit hadoop/conf/mapred-site.xml

　　修改如下：

<?xmlversion="1.0"?>
<?xml-stylesheettype="text/xsl"href="configuration.xsl"?> 
<!--Putsite-specificpropertyoverridesinthisfile.--> 
<configuration> 
<property> 
<name>mapred.job.tracker</name> 
<value>localhost:9001</value> 
</property> 
</configuration>

　　9. 翻开conf/hdfs-site.xml文件：

sudo gedit hadoop/conf/hdfs-site.xml

　　修改如下：

<configuration>
<property>
<name>dfs.name.dir</name>
<value>/usr/local/hadoop/datalog1,/usr/local/hadoop/datalog2</value>
</property>
<property>
<name>dfs.data.dir</name>
<value>/usr/local/hadoop/data1,/usr/local/hadoop/data2</value>
</property>
<property>
<name>dfs.replication</name>
<value>2</value>
</property>
</configuration>