Changes between Version 1 and Version 2 of PDAD


Ignore:
Timestamp:
Nov 29, 2009, 11:24:22 PM (14 years ago)
Author:
cristina.basescu
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • PDAD

    v1 v2  
    11= PDAD: Parallel Data Analysis Diff =
    22
    3  * Acronym: '''PDAD'''
     3 * Acronym: '''PDAD''' (who's your 'parallel' daddy)
    44 * SVN: https://svn-batch.grid.pub.ro/svn/PP2009/proiecte/pdad
    55
     
    2020
    2121== Proposed Data Analysis Applications ==
    22  *
     22 * '''Image filters and metadata processing''' - this is a scenario where people upload pictures on an website and want to apply a filter (such as blurr, sharpen, emboss etc) on them, while the company would like to make statistics regarding the pictures' metadata, such as camera type, shutter speed, ambient light levels, whether the flash was used, etc. This is a typical map-reduce application, especially for the metadata phase: map jobs extract the necessary metadata information and group it, for example, by producer, and the reduce jobs count the number of occurences. For the filter phase, the map job applies the filter, while the reduce job is an idempotent one.
     23// ''TODO'': find source for downloading data
     24
     25 * '''Inverted-index for e-mails'''
     26// ''TODO'' Add description
     27
     28 * Semantic web - Reccomendation system
     29// ''TODO'' Add description
     30