HDF5 VOL Status Report – Exascale Computing Project

As part of our work with the Exascale Computing Project (ECP), Scot Breitenfeld, Hyo-Kyung Lee, and Larry Knox prepared this status report on the HDF5 VOL. This report provides an overview of the HDF5 VOL connectors created for the ECP.

A Kind of Magic: Storing Computations in HDF5

The purpose of this introduction is to highlight and celebrate a community contribution the impact of which we are just beginning to understand. Its principal author, Mr. Lucas C. Villa Real, calls it HDF5-UDF and describes it as “a mechanism to generate HDF5 dataset values on-the-fly using user-defined functions (UDFs).” This matter- of-fact characterization is quite accurate, but I would like to provide some context for what this means for us users of HDF5.

Citations for HDF Data and Software

The topic of software citation has been discussed in many forums recently and several major discovery repositories (e.g. zenodo and DataCite) support metadata for software in addition to datasets and other resource types. HDF5 stradles the boundary between the dataset and software worlds. It is most commonly thought of and referred to as a data format, but, as in any case, data written in the HDF formats can not be read without HDF software. So, the answer to the question: is it a format or is it software? is clearly both.

Scroll to Top