Federated Computing for the Masses – Aggregating Resources to Tackle Large-scale Engineering Problems

Thumbnail Image
Date
2014-01-01
Authors
Diaz-Montes, Javier
Xie, Yu
Rodero, Ivan
Zola, Jaroslaw
Parashar, Manish
Major Professor
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract

The complexity of many problems in science and engineering requires computational capacity exceeding what average user can expect from a single computational center. While many of these problems can be viewed as a set of independent tasks, their collective complexity easily requires millions core-hours on any state-of-the-art HPC resource, and throughput that cannot be sustained by a single multi-user queuing system. In this paper we explore the use of aggregated HPC resources to solve large-scale engineering problems. We show it is possible to build a computational federation that is easy to use by end-users, and is elastic, resilient and scalable. We argue that the fusion of federated computing and real-life engineering problems can be brought to average user if relevant middleware is provided. We report on the use of federation of 10 distributed heterogeneous HPC resources to perform a large-scale interrogation of the parameter space in the microscale fluid flow problem.

Series Number
Journal Issue
Is Version Of
Versions
Series
Type
article
Comments

This is a manuscript of an article published as Diaz-Montes, Javier, Yu Xie, Ivan Rodero, Jaroslaw Zola, Baskar Ganapathysubramanian, and Manish Parashar. "Federated Computing for the Masses--Aggregating Resources to Tackle Large-Scale Engineering Problems." Computing in Science & Engineering 16, no. 4 (2014): 62-72. DOI:10.1109/MCSE.2013.134. Posted with permission.

Rights Statement
Copyright
Wed Jan 01 00:00:00 UTC 2014
Funding
DOI
Supplemental Resources
Collections