Chris Bunch @ UCSB

MapScale: A Cloud Environment for Scientific Computing


Chris Bunch (CS), Brian Drawert (CSE), Matthew Norman (CS)


As we have shown in our first homework assignment, the MapReduce framework presents an novel opportunity to parallelize scientific computing applications. We thus aim at implementing each of the eight benchmarks that make up the NAS Parallel Benchmarks with the MapReduce framework. After this has been accomplished, we will evaluate their performance for varying numbers of nodes as provided to us by our underlying cloud infrastructure (Eucalyptus) and platform (AppScale). If time permits, we will also evaluate the performance of our system on Amazon's Elastic Compute Cloud and note the performance differences between the two infrastructures. We are also interested in implementing some of the less difficult benchmarks across several languages and runtimes in order to explore performance differences across this spectrum.

Progress Report (as of 5/18): We have distributed the eight NAS Parallel Benchmarks amongst the three of us in the following fashion:


Chris Bunch
Brian Drawert
Matthew Norman

The writeup for the paper has also been started, since the introduction and other informative sections on the technologies involved can be completed without having finished our performance analysis and optimizations. We are certainly open to suggestions and recommendations on the various algorithms, as we suspect some of these algorithms are sufficiently complex to require multiple rounds of MapReduce to yield their final answer.

Presentation Slides

Paper: MapScale: A Cloud Environment for Scientific Computing