Changes between Version 14 and Version 15 of PDAD_Performance


Ignore:
Timestamp:
Jan 17, 2010, 9:39:16 PM (14 years ago)
Author:
claudiu.gheorghe
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • PDAD_Performance

    v14 v15  
    4141In the 2-cluster node, MapReduce's tunning gave results and it performed much better than Pig. [[Image(mr_vs_pig.png)]]
    4242
    43 The framework is indeed suitable for large data processing, as shown in the charts below. Increasing the replication factor would have probably increased the throughput for the seti dataset [[Image(throughput_2.png)]] [[Image(time_char_1.png)]] [[Image(time_chart.png)]].
     43The framework is indeed suitable for large data processing, as shown in the charts below. The biggest throughput (17,8MB/s) is achieved on the biggest data set, while the smaller test is the less efficient.
     44
     45[[Image(throughput.png)]] [[Image(time_char_1.png)]] [[Image(size_chart.png)]].
    4446
    4547Here are [http://spreadsheets.google.com/ccc?key=0Av7LR4rlPvTEdGFRQUdfSklFR29pR0NGRmowZ0otZGc&hl=en some detailed tests results] with the number of tasks, maps and reduces.