Skip to main navigation Skip to search Skip to main content

Combining In-Situ and In-Transit Processing to Enable Extreme-Scale Scientific Analysis

  • Janine C. Bennett
  • , Hasan Abbasi
  • , Peer Timo Bremer
  • , Ray Grout
  • , Attila Gyulassy
  • , Tong Jin
  • , Scott Klasky
  • , Hemanth Kolla
  • , Manish Parashar
  • , Valerio Pascucci
  • , Philippe Pebay
  • , David Thompson
  • , Hongfeng Yu
  • , Fan Zhang
  • , Jacqueline Chen
  • Sandia National Laboratories
  • Oak Ridge National Laboratory
  • Lawrence Livermore National Laboratory
  • University of Utah
  • Rutgers - The State University of New Jersey, New Brunswick
  • Kitware, Inc

Research output: Contribution to conferencePaperpeer-review

171 Scopus Citations

Abstract

With the onset of extreme-scale computing, I/O constraints make it increasingly difficult for scientists to save a sufficient amount of raw simulation data to persistent storage. One potential solution is to change the data analysis pipeline from a post-process centric to a concurrent approach based on either in-situ or in-transit processing. In this context computations are considered in-situ if they utilize the primary compute resources, while in-transit processing refers to offloading computations to a set of secondary resources using asynchronous data transfers. In this paper we explore the design and implementation of three common analysis techniques typically performed on large-scale scientific simulations: topological analysis, descriptive statistics, and visualization. We summarize algorithmic developments, describe a resource scheduling system to coordinate the execution of various analysis workflows, and discuss our implementation using the DataSpaces and ADIOS frameworks that support efficient data movement between in-situ and in-transit computations. We demonstrate the efficiency of our lightweight, flexible framework by deploying it on the Jaguar XK6 to analyze data generated by S3D, a massively parallel turbulent combustion code. Our framework allows scientists dealing with the data deluge at extreme scale to perform analyses at increased temporal resolutions, mitigate I/O costs, and significantly improve the time to insight.

Original languageAmerican English
Number of pages9
DOIs
StatePublished - 2012
Event2012 24th International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2012 - Salt Lake City, UT, United States
Duration: 10 Nov 201216 Nov 2012

Conference

Conference2012 24th International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2012
Country/TerritoryUnited States
CitySalt Lake City, UT
Period10/11/1216/11/12

NLR Publication Number

  • NREL/CP-2C00-59071

Fingerprint

Dive into the research topics of 'Combining In-Situ and In-Transit Processing to Enable Extreme-Scale Scientific Analysis'. Together they form a unique fingerprint.

Cite this