You are here: Home iREAD CARMEN data conversion

CARMEN data conversion

Return to the main index.

Recommendations and Notes

The following recommendations are made:

  1. Experimental data in the CARMEN system will only be stored within an NDF format so all services and tools which cannot use this format will need to use a converter or wrapper / un-wrapper (depending on the nature of the data) to access and store data in the NDF format.
  2. On upload of experimental data the converters will convert it to one (or more) of the five NDF native formats - assuming data converters are available.  Data in its original format will also be wrapped as external data within the same NDF instance to maintain interoperability with third party / legacy tools and ensure that no original information is lost.
  3. It is recommended that newly developed CARMEN tools / services will consume and produce data in an NDF format – however, the writer of tools or service will be also free to use bespoke formats, bearing in mind that if this is a permanent situation than a converter to / from NDF will be required for every bespoke format to ensure interoperability with other services.  (If the tool or service use experimatal data then it will have to consume data in an NDF format or use an existing converter.)
  4. The CARMEN system will be able to use newly developed CARMEN tools and services and third party / legacy tools and services written in R, Java, Matlab, C, C++, modelling languages.  This list may be extended.

 Notes:

  1. It is possible that the conversion "services" may also be available as libraries to for use in the service frameworks.
  2. As mentioned prviously the NDF is a container of several data formats so it will be possible for situations to occur where a tool or service (which can read input data from a particular NDF content slot) cannot use an NDF instance because the instance does not contain data in the appropriate slot.  For example, a tool which can read and use time series data can only use NDF instances containing time series data - clearly not all NDF instances will contain time series data.

iRODS aspects- conclusions

For upload and download activities the use of iRODS rules and micro-services offers significant advantages since conversion and wrapping activities could be performed automatically.  There may / may not be issues if there is required to be user input on conversion / wrapping and converter / un-wrapping.  This requires further investigation.

 

For service and data interaction scenarios it is likely that the use of a work flow engine to invoke data conversion, wrapping, etc. is preferable to using iRODS rules and micro-services since it will be more flexible.

 

Document Actions

Please refer to the legal disclaimer covering content on this site.