05.06.2013 Views

PNNL-13501 - Pacific Northwest National Laboratory

PNNL-13501 - Pacific Northwest National Laboratory

PNNL-13501 - Pacific Northwest National Laboratory

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Study Control Number: PN99078/1406<br />

Visualization and Active Guidance of Scientific Computations<br />

Pak Chung Wong<br />

We developed the concept of a data signature, which captures the essence of a large, multidimensional scientific data set<br />

in a compact format, and used it to conduct analysis as if using the original. Our results have shown that the technology<br />

can be applied to a wide variety of computational sciences and engineering studies—such as climate, combustion,<br />

imaging, and computational fluid dynamics.<br />

Project Description<br />

The purpose of this project is to develop the concept of a<br />

data signature that captures the essence of a multidimensional<br />

scientific data set in a compact format, and<br />

use it to conduct analysis as if using the original. A data<br />

signature, because of its compact design, always requires<br />

fewer computational resources than its original. We<br />

developed two novel approaches based on feature<br />

extraction and wavelet transforms for creating data<br />

signatures of very large time-dependent data sets. We<br />

successfully developed our first signature design (featurebased)<br />

on multiple parallel and desktop computer<br />

platforms. This system was applied to different<br />

simulation studies, including global climate and<br />

combustion. The experimental results were documented<br />

in two peer-reviewed journal and conference papers.<br />

Introduction<br />

Today, as data sets used in computations grow in size and<br />

complexity, the technologies developed over the years to<br />

manage scientific data sets have become less efficient and<br />

effective. Many frequently used operations, such as<br />

Eigenvector computation, could quickly exhaust our<br />

desktop workstations once the data size reached certain<br />

limits. While building new machines with more resources<br />

might conquer the data size problems, the complexity of<br />

today’s computations requires a new breed of projection<br />

techniques to support analysis of the data and verification<br />

of the results. Our solution is the data signature, that<br />

captures the essence of a scientific data set in a compact<br />

format and is then used to conduct analysis as if using the<br />

original.<br />

A data signature can be described as a mathematical data<br />

vector that captures the essence of a large data set in a<br />

fraction of its original size. It is designed to characterize<br />

a portion of a data set, such as an individual time frame of<br />

a scientific simulation. These signatures enable us to<br />

conduct analysis at a higher level of abstraction and yet<br />

still reflect the intended results as if using the original<br />

data.<br />

Approach<br />

To date, we have investigated designing signatures for<br />

scalar fields, tensor fields, and a combination of them<br />

with multiple parameters. The construction of a data<br />

signature is based on one or more of the following<br />

features and approaches:<br />

• velocity gradient tensors<br />

• critical points and their eigenvalues<br />

• orthogonal and non-orthogonal edges<br />

• covariance matrices<br />

• intensity histograms<br />

• content segmentation<br />

• conditional probability.<br />

The selected features often depend on data type.<br />

Information such as velocity gradient and critical points is<br />

suitable for vector and tensor field data. Edge detection,<br />

covariance, and histogram can be applied to scalar data.<br />

For large scientific simulations, we sometimes rely on a<br />

multi-resolution approach to determine the desirable size<br />

of the signatures.<br />

Results and Accomplishments<br />

In theory, a data signature can never be as descriptive as<br />

its full-size original. In practice, however, a well-defined<br />

data signature can be as good or better from many<br />

perspectives because it brings out specific details and<br />

eliminates less important information from consideration.<br />

The concept is useful when we study the characteristics of<br />

scientific simulations, such as global climate modeling.<br />

Computer Science and Information Technology 169

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!