Metadata

The metadata for each station are defined in a YAML file. The metadata YAML file expects a standardized set of keys.

There are 7 metadata keys for which is mandatory to specify the value :

  • the data_source must be the same as the data_source where the metadata are located.

  • the campaign_name must be the same as the campaign_name where the metadata are located.

  • the station_name must be the same as the name of the metadata YAML file without the .yml extension.

  • the sensor_name must be one of the implemented sensor configurations. See disdrodb.available_sensor_name(). If the sensor which produced your data is not within the available sensors, you first need to add the sensor configurations. For this task, read the section Add new sensor configs TODO ADD LINK.

  • the raw_data_format must be either ‘txt’ or ‘netcdf’. ‘txt’ if the source data are text/ASCII files. ‘netcdf’ if source data are netCDFs.

  • the platform_type must be either ‘fixed’ or ‘mobile’. If ‘mobile’, the DISDRODB L0 processing accepts latitude/longitude/altitude coordinates to vary with time.

  • the reader name is essential to enable to select the correct reader when processing the station.

Note

The reader key value must be defined with the following pattern: <READER_DATA_SOURCE>/<READER_NAME>. * <READER_DATA_SOURCE> is the parent directory within the disdrodb software where the reader is defined. Typically it coincides with the <DATA_SOURCE> of the DISDRODB archive. * <READER_NAME> is the name of the python file where the reader is defined. * For example, to use the GPM IFLOODS reader (defined at disdrodb.l0.reader.GPM.IFLOODS.py) to process the data, you specify the reader name GPM/IFLOODS.

To check the validity of the metadata YAML files, run the following code:

from disdrodb.l0 import check_archive_metadata_compliance, check_archive_metadata_geolocation

disdrodb_dir = "<...>/DISDRODB"
check_archive_metadata_compliance(disdrodb_dir)
check_archive_metadata_geolocation(disdrodb_dir)

The list of the standard metadata keys and their description is provided here below:

Mandatory keys

Keys

Description

data_source

Station data source.

campaign_name

Station campaign name.

station_name

Name of the stations (and the metadata file).

sensor_name

Sensor name. It defines the processing chain in DISDRODB.

raw_data_format

File format of the raw data. Either ‘txt’ or ‘netcdf’.

platform_type

Type of station. Either ‘fixed’ or ‘mobile’.

Station description

Keys

Description

title

Station dataset title

description

Station dataset description

project_name

Full project/campaign name of the station

keywords

Keywords related to the station and the campaign

summary

Summary information of the station

comment

Comment on the station measurements

history

History of the raw data file

station_id

ID of the station

location

Village, town or region where the disdrometer is located

country

Country where the disdrometer is located

continent

Continent where the disdrometer is located

Deployment info

Keys

Description

latitude

WGS84 latitude in degree north [-90,90]. If the disdrometer is moving, specify -9999

longitude

WGS84 longitude in degree east [-180,180]. If the disdrometer is moving, specify -9999

altitude

Elevation above the sea level in meters. If the disdrometer is moving, specify -9999

deployment_status

Deployment status. Possible values: ‘terminated’ or ‘ongoing’

deployment_mode

Deployment mode. Possible values: ‘land’, ‘ship’, ‘truck’, ‘cable’

platform_type

Platform type. Possible values: ‘ fixed’ or ‘mobile’

platform_protection

Platform protection. Possible values: ‘N/A’, ‘shielded’, ‘unshielded’

platform_orientation

Platform orientation in 0-360 degrees from the North direction (clockwise)

Sensor Info

Keys

Description

sensor_long_name

Sensor long name

sensor_manufacturer

Sensor manufacturer. Examples: Thies Clima, OTT Hydromet, Vaisala, Campbell, …

sensor_wavelength

Sensor wavelength

sensor_serial_number

Sensor serial number

firmware_iop

TO BE DEFINED [Available for OTT Parsivels]

firmware_dsp

Digital signal processor firmware [Available for OTT Parsivels]

firmware_version

Firmware version TO BE DEFINED

sensor_beam_length

TO BE DEFINED

sensor_beam_width

TO BE DEFINED

sensor_nominal_width

TO BE DEFINED

measurement_interval

Number of seconds over which measurements are taken

calibration_sensitivity

TO BE DEFINED

calibration_certification_date

Sensor calibration date(s)

calibration_certification_url

Sensor calibration certification url

Source information

Keys

Description

source

Source information

source_convention

Raw data file convention (i.e. ARM v1.XXX, NASA v1.XX, …)

source_processing_date

Date of source raw data file creation

Data Attribution

Keys

Description

contributors

People contributing to the disdrometer dataset

authors

People responsible and to to be contacted for questions

authors_url

Web url to contact the authors

contact

People to contact to request further information

contact_information

Email address of the contact people

acknowledgements

Ackwnowledgements

references

Literature references describing the usage of the sensor

documentation

Further documentation describing the sensor/campaign/network

website

Website reporting sensor information

institution

Institution providing funding or operating the sensor

source_repository

Repository where the original raw file can be retrieved

license

Data license

doi

Digital Object Identifier of the sensor/campaign/network dataset