Skip to Content

Support Architecture for Large-Scale Subsurface Analysis (SALSSA)

Overview

In this SciDAC SAP, we have developed a user environment that integrates data management, workflow, and visualization tools to support model execution and analysys. The framework has been used to run both the Subsurface Transport Over Multiple Phases (STOMP) and Smoothed Particle Hydrodynamics (SPH) codes though the framework itself is generic and can be applied to other models. We are leveraging technologies developed by SciDAC centers including the Kepler workflow engine (SDM Center) for job execution and data staging, and the Visit visualization system from VACET. The framework includes a data and provenance tracking system to keep track of the simulations, the inputs and outputs, and the analyses. Large output files can be sent to an archive or referred to via their URIs.

SALSSA can be thought of as both an activity tracking system and a dashboard that summarizes modeling and analysis activity.

Shown in this figure are a provenance graph of simulations, a tablular view of simulations, the set of tools available, an SPH editor that can generate multiple parameter realizations, job launching and archiving dialogs, a detailed view of an individual SPH run, and some visualizations of SPH data.

The SALSSA framework has been applied to the validation of the SPH model. In this validation study, approximately 50 simulations were performed to initially test the model, then explore the combination of parameters that give the stable solution and finally to assess the affects of changes to the parameter space.

Key Capabilities

Please see our latest poster for more information.

SALSSA Software Releases

Several development versions of the SALSSA data management and workflow environment have been released. These releases are available for download and each is briefly described in the release summaries below. SALSSA releases previous to version 3.0 require a data management system to be installed and because of this are only available on site for project staff and collaborators. SALSSA, starting with version 3.0, can be run without a shared data managment system.

SALSSA 3.0 (May 2011)

This release added the capability for task parallel job execution. A set of simulations can now be submitted as a single job where all simulations are run independenly. A benefit of this approach is that a single job is placed in the queue and latency of waiting for individual jobs to get their turn in the queue is avoided. A spreadsheet interface for SPH execution was created, suporting the setup of a multi-simulation SPH job within the SALSSA Organizer. Also new to this version is the way in which SALSSA software is provided. izpack is now being used to package SALSSA software and generate a single installer application for each supported platform, greatly simplifying the distribution and installation process. Other features developed for this release include the capability to run SALSSA in an offline mode removing the requirement for a shared data management server, execution on NERSC systems, upgrade to the Kepler workfow software, improvements to archive/unarchive, and documentation was created describing how to add your own simulation code to SALSSA, see Adding a New Code.

SALSSA 2.4 (Jan 2010)

This release added context panels for individual simulations. From these panels, a user can easily get a realtime plot of the simulation progress, see a summary of the input parameters, job status, and output file locations. Other features developed for this release include a Parameter study editor for the SPH code, the capability to reconnect to jobs in case of failure of you shut your system down, and support for archiving large simulation outputs.

SALSSA 2.2 (Feb 09)

This version of SALSSA was used to perform the Smoothed Particle Hydrodynamics Validation Study.

New features include:

SALSSA 2.0 (Sep 08)

This prototype provides a graph-based view of activities. The activities can be chained together to an arbitrary depth and complexity. Jobs can be run to multiple workstations. The data management and provenance is based fully on rdf relationships. SALSSA v2.0 has been tested by running several of the simulations described on the Benchmark Application page, and has been provided to Idaho National Laboratory collaborators for use in performing additional design simulations.

SALSSA 1.0 (Jan 08)

An initial prototype with the capability to set up STOMP parameter studies, run jobs to workstations, and view the runs, jobs, and job files.

Additional Information

Project Contributors

Past Contributors