Changes between Version 12 and Version 13 of PDAD_Performance


Ignore:
Timestamp:
Jan 15, 2010, 4:36:46 PM (14 years ago)
Author:
claudiu.gheorghe
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • PDAD_Performance

    v12 v13  
    99As the main idea of a distributed system is to use commodity hardware, we have used a maximum of four computers from ED202, running Ubuntu 8.04, Hadoop 0.20.1, Pig 0.5.0. They were interconnected using a Gigabit switch, so as to obtain the maximum from infrastructure's point of view.
    1010
     11First, we have to overview the Hadoop architecture, to clear-up the setup decisions. There are 4 types of entities in a Hadoop cluster: NameNode, JobTracker, DataNode, TaskTracker. TBD
    1112
    1213The first test scenario uses two nodes. For the Hadoop Framework, one of them is master, having in the same time master attributions (namenode - keeps the structure of the file system, jobTracker - keeps track of the jobs' execution), taskTracker - keeps track of the tasks...
    1314
    14 [[Image(testing.png)]
     15[[Image(testing.png)]]
    1516
    1617// TODO continue Claudiu for the whole testing infrastructure