Changes between Version 1 and Version 2 of HowToSetupHadoopCluster


Ignore:
Timestamp:
Jan 18, 2010, 10:05:18 PM (14 years ago)
Author:
claudiu.gheorghe
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • HowToSetupHadoopCluster

    v1 v2  
    33'''Step 2'''. After that, you can step further by following [http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_(Multi-Node_Cluster) the guide for dual-node setup]. [[BR]]
    44'''Step 3'''. For extending the cluster and add the N-th slave to the cluster, you must do the following:
    5     '''Step 3.1''' Follow step 1 on the slave machine. [[BR]]
    6     '''Step 3.2''' Set the hostname of the machine to a suggestive string, let's say ''slave-N''. [[BR]]
    7     '''Step 3.3''' Add an entry to the master's /etc/hosts file like
     5    '''Step 3.1''' Follow step 1 on the ''slave-N'' machine. [[BR]]
     6    '''Step 3.2''' Copy all the contents of the <HADOOP_DIR>/conf/ directory from a working slave configuration.[[BR]]
     7    '''Step 3.3''' Set the hostname of the machine to a suggestive string, let's say ''slave-N''. [[BR]]
     8    '''Step 3.4''' Add an entry to the master's /etc/hosts file like
    89{{{
    910...
     
    1112...
    1213}}}
    13      where 10.10.10.10 is the ip of the slave-N machine. [[BR]]
    14     '''Step 3.4''' Add the same entry added in master's /etc/hosts file to each other slave's /etc/hosts, so that every slave can resolve the name ''slave-N''.[[BR]]
    15     '''Step 3.5''' Put the master's ssh public keys to ''slave-N''`s ~/.ssh directory, and check that
     14     where 10.10.10.10 is the ip of the ''slave-N'' machine. [[BR]]
     15    '''Step 3.5''' Add the same entry added in master's /etc/hosts file to each other slave's /etc/hosts, so that every slave can resolve the name ''slave-N''.[[BR]]
     16    '''Step 3.6''' Put the master's ssh public keys to ''slave-N''`s ~/.ssh directory, and check that
    1617{{{
    1718#ssh slave-N
    1819}}}
    1920    works without asking a passwsord. [[BR]]
    20     '''Step 3.6''' Add a line with ''slave-N'' in the master's <HADOOP_DIR>/conf/slaves file.
     21    '''Step 3.7''' Add a line with ''slave-N'' in the master's <HADOOP_DIR>/conf/slaves file.
    2122
    2223Make sure you install on each machine a Linux distribution that easily permits changing the computer's hostname. We have used Ubuntu 8.04 and Ubuntu 9.10 and it worked fine. We also tried with Fedora 10, but we didn't succeeded.