METACLIP Project overview

The objective of METAdata for CLImate Products (METACLIP) is to encode the needed metadata to ensure the traceability and reproducibility of any kind climate product (data files, plots, maps…), thus requiring a comprehensive framework to track the operations undertaken through often complex data workflows. According to the terminology used in the project, the main elements in a climate data workflow are:

  1. Data sources . The provenance of the input data need to be clearly identified (source, version, model documentation etc.). These applies to any type of data source (observations, operative/retrospective forecasts, reanalysis, climate projections…). Links to [doc][OWL file][Visual schema]
  2. Transformations. Any operations transforming the original data source that do not entail a second dataset (may entail a different subset of the same dataset though; e.g. temporal/spatial aggregation, ensemble means, spatial interpolation/regridding, calculation of climate indices and anomalies …). Links to [doc][OWL file][Visual schema]
  3. Calibration (bias correction, downscaling, other forms of statistical adjustment). Links to [doc][OWL file][Visual Schema]
  4. Validation/Verification (bias, RMSE, AUC …) Links to [doc][OWL file][Visual schema]
  5. Outcomes (map, QQ-plot, time series, plumes…) [In construction]

The main blocks of the climate data workflow (represented in the figure below) are represented by different classes and properties encoded in RDF ontologies stored in the different directories of this repo.


Example Figure. Schematic representation of a data workflow to generate a verification map (Area under the ROC Curve, based on tercile categories) of a seasonal forecasting system (ECMWF System-4) of mean JJA global temperature. The verifying reference is the ECMWF ERA-Interim reanalysis. All the necessary metadata for the reconstruction of the figure is encoded in RDF (Resource Description Framework) and embedded in the final outcome (in this case a jpeg file, but any other type may serve as well). See this demo for a graphical representation of the metadata schema associated with this figure.


Key strengths

The main advantages of The Climate Ontology approach are next summarised:

Linked International Projects and Initiatives

The Climate Ontology Project is aligned with currently on-going initiatives facing the problems of data provenance and metadata encoding of climate products: