Changes between Version 2 and Version 3 of PDAD_Applications


Ignore:
Timestamp:
Jan 14, 2010, 1:57:55 AM (14 years ago)
Author:
cristina.basescu
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • PDAD_Applications

    v2 v3  
    1010 d. Having a classification for job duration as short, medium and long, find out the most frequent (>1000) causes of failures for each of these categories. It uses event_trace table, looking at event_start_time and event_end_time columns and also event_trace.event_end_reason code table.
    1111 e. For each category from event_trace.event_end_reason code ranges, which of event_trace.event_end_reason code definitions is the most frequent?
    12  f. Number of failures for each geographical location. This application uses
     12 f. Number of failures for each geographical location. This application uses the event_trace table, more precisely the columns platform_id and node_id so as to identify the nodes that failed, and then maps them with the location by using the table node, having among others the columns node.platform_id and node.node_id.
    1313
     14== MapReduce ==
    1415
     16== Pig ==
    1517
    16 and the approach used for each of the three mediums.
    17 
    18 
    19 
    20 
    21 Obs: pt Pig, comment-urile din input, desi nu afecteaza functionarea script-ului, nu sunt ignorate iar rezultatele ce implica un COUNT (precum media) nu vor fi corecte
    22 Sol: o functie custom de citire sau scoaterea comment-urilor din fisierul de input
     18== MPI ==