= PDAD: Parallel Data Analysis Diff = * Acronym: '''PDAD''' (who's your ''parallel'' daddy) * SVN: https://svn-batch.grid.pub.ro/svn/PP2009/proiecte/PDAD * Original repository that shows the project's evolution, on !GoogleCode: http://code.google.com/p/pdad/ * SVN project members: svn checkout https://pdad.googlecode.com/svn/trunk/ pdad --username google_username * SVN non-members: svn checkout http://pdad.googlecode.com/svn/trunk/ pdad-read-only * Team members: Cristina Basescu - cristina.basescu, Claudiu-Dan Gheorghe - claudiu.gheorghe * Project description: compare data analysis performed using (a) [http://hadoop.apache.org/mapreduce Hadoop's MapReduce] (b) [http://hadoop.apache.org/pig/ Hadoop's Pig] (c) [http://www.mcs.anl.gov/research/projects/mpich2/ MPI] * [https://svn-batch.grid.pub.ro/svn/PP2009/proiecte/PDAD/trunk/PDAD.pdf Presentation] == Contents == * [https://ncit-cluster.grid.pub.ro/trac/PP2009/wiki/PDAD_Introduction Introduction] * Motivation * Goals * Hadoop Framework * MPI * [https://ncit-cluster.grid.pub.ro/trac/PP2009/wiki/PDAD_Applications Applications] * !MapReduce * Pig * MPI * [https://ncit-cluster.grid.pub.ro/trac/PP2009/wiki/PDAD_Performance Performance Analysis] * Testing Infrastructure * Parameters * Results * [https://ncit-cluster.grid.pub.ro/trac/PP2009/wiki/PDAD_Conclusions Conclusions]