Dataset structure

The dataset is an essential concept of the ASpecD framework and in turn the radiometry package, as it abstracts the different measurement data file formats and combines both, numerical data and metadata, in an easily accessible way. Even more, the general structure of a dataset allows to compare data of entirely different origin (read: experimental method), as long as their axes are compatible.

Developers of the radiometry package frequently need to get an overview of the structure of the dataset and its different subclasses. Whereas the API documentation of each class provides a lot of information, a simple and accessible presentation of the dataset structure is often what is needed.

Therefore, the structure of each of the dataset classes is provided below in YAML format, automatically generated from the actual source code.

eve dataset

Entity containing both, numerical data as well as the corresponding metadata that represent all possible contents of an eve HDF5 file. For implementation details, see the API documentation of radiometry.dataset.EveDataset and radiometry.metadata.EveDatasetMetadata.

Important

While for the time being, the data model described here aims at representing all information currently contained in eve HDF5 files, it is and remains an abstraction from the file format and aims at a generalised description of radiometry data obtained with synchrotron radiation. Furthermore, the idea behind this data model is to contain all relevant metadata for reproducibility and traceability. Hence, it will by far exceed the fields (currently) contained in an eveH5 file.

data:
  calculated: false
  data:
    type: numpy.ndarray
    dtype: float64
    array: []
  axes:
  - quantity: ''
    symbol: ''
    unit: ''
    label: ''
    values:
      type: numpy.ndarray
      dtype: float64
      array: []
    index: []
  - quantity: ''
    symbol: ''
    unit: ''
    label: ''
    values:
      type: numpy.ndarray
      dtype: float64
      array: []
    index: []
_origdata:
  calculated: false
  data:
    type: numpy.ndarray
    dtype: float64
    array: []
  axes:
  - quantity: ''
    symbol: ''
    unit: ''
    label: ''
    values:
      type: numpy.ndarray
      dtype: float64
      array: []
    index: []
  - quantity: ''
    symbol: ''
    unit: ''
    label: ''
    values:
      type: numpy.ndarray
      dtype: float64
      array: []
    index: []
device_data: {}
metadata:
  measurement:
    start: null
    end: null
    purpose: ''
    operator: ''
    labbook_entry: ''
  sample:
    name: ''
    id: null
    loi: ''
  temperature_control:
    temperature:
      unit: ''
      dimension: ''
      name: ''
      value: 0.0
    controller: ''
  experiment:
    comment: ''
    procedure:
      filename: ''
      description: ''
  setup:
    name: ''
    software:
      engine_version: ''
      xml_schema_version: ''
history: []
_history_pointer: -1
analyses: []
annotations: []
representations: []
id: ''
label: ''
references: []
tasks: []
_package_name: radiometry

DeviceData

Device data are a subclass of aspecd.dataset.Data containing additional metadata. Device data are stored as a dictionary in the dataset structure, where the keys refer to the “name” of the respective device, and the values are instances of the radiometry.dataset.DeviceData class. Therefore, the structure of the radiometry.dataset.DeviceData class is not visible in the dataset structure shown above, but documented below:

calculated: false
metadata:
  name: ''
  type: ''
  xmlid: ''
  access: ''
  label: ''
data:
  type: numpy.ndarray
  dtype: float64
  array: []
axes:
- quantity: ''
  symbol: ''
  unit: ''
  label: ''
  values:
    type: numpy.ndarray
    dtype: float64
    array: []
  index: []
- quantity: ''
  symbol: ''
  unit: ''
  label: ''
  values:
    type: numpy.ndarray
    dtype: float64
    array: []
  index: []