Changes between Version 2 and Version 3 of PDAD_Applications
- Timestamp:
- Jan 14, 2010, 1:57:55 AM (14 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
PDAD_Applications
v2 v3 10 10 d. Having a classification for job duration as short, medium and long, find out the most frequent (>1000) causes of failures for each of these categories. It uses event_trace table, looking at event_start_time and event_end_time columns and also event_trace.event_end_reason code table. 11 11 e. For each category from event_trace.event_end_reason code ranges, which of event_trace.event_end_reason code definitions is the most frequent? 12 f. Number of failures for each geographical location. This application uses 12 f. Number of failures for each geographical location. This application uses the event_trace table, more precisely the columns platform_id and node_id so as to identify the nodes that failed, and then maps them with the location by using the table node, having among others the columns node.platform_id and node.node_id. 13 13 14 == MapReduce == 14 15 16 == Pig == 15 17 16 and the approach used for each of the three mediums. 17 18 19 20 21 Obs: pt Pig, comment-urile din input, desi nu afecteaza functionarea script-ului, nu sunt ignorate iar rezultatele ce implica un COUNT (precum media) nu vor fi corecte 22 Sol: o functie custom de citire sau scoaterea comment-urilor din fisierul de input 18 == MPI ==