Changes between Version 12 and Version 13 of PDAD_Performance
- Timestamp:
- Jan 15, 2010, 4:36:46 PM (14 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
PDAD_Performance
v12 v13 9 9 As the main idea of a distributed system is to use commodity hardware, we have used a maximum of four computers from ED202, running Ubuntu 8.04, Hadoop 0.20.1, Pig 0.5.0. They were interconnected using a Gigabit switch, so as to obtain the maximum from infrastructure's point of view. 10 10 11 First, we have to overview the Hadoop architecture, to clear-up the setup decisions. There are 4 types of entities in a Hadoop cluster: NameNode, JobTracker, DataNode, TaskTracker. TBD 11 12 12 13 The first test scenario uses two nodes. For the Hadoop Framework, one of them is master, having in the same time master attributions (namenode - keeps the structure of the file system, jobTracker - keeps track of the jobs' execution), taskTracker - keeps track of the tasks... 13 14 14 [[Image(testing.png)] 15 [[Image(testing.png)]] 15 16 16 17 // TODO continue Claudiu for the whole testing infrastructure