| 1 | = PDAD: Parallel Data Analysis Diff = |
| 2 | |
| 3 | * Acronym: '''PDAD''' |
| 4 | * SVN: https://svn-batch.grid.pub.ro/svn/PP2009/proiecte/pdad |
| 5 | |
| 6 | * Team members: Cristina Basescu - cristina.basescu, Claudiu-Dan Gheorghe - cluster_account_name2 |
| 7 | * Project description: compare data analysis performed using (a) Hadoop's MapReduce (b) Hadoop's Pig (c) MPI |
| 8 | |
| 9 | == Technologies and Languages == |
| 10 | * [http://hadoop.apache.org/ Hadoop Framework] |
| 11 | * [http://hadoop.apache.org/mapreduce/ MapReduce subproject] |
| 12 | * [http://hadoop.apache.org/pig/ Pig subproject] |
| 13 | * MPI |
| 14 | |
| 15 | == Project Activity == |
| 16 | * Oct 25 - install Hadoop framework and get familiar with MapReduce and Pig; run examples |
| 17 | * Nov 2 - project roadmap |
| 18 | * Nov 22 - ideas for data analysis applications to implement |
| 19 | * Dec 3 - decide on two data analysis applications; start implementation |
| 20 | |
| 21 | == Proposed Data Analysis Applications == |
| 22 | * |