This section collects the atmospheric datasets on which the Complete Data Fusion (CDF) algorithm has been applied and verified during the EMM project. Each “tested dataset” is a retrieval product (instrument + provider + processing chain) for which we have checked whether the file content and format comply with the requirements of the CDF algorithm, and we have run the verification tests described in the dedicated sections of this website.

For each dataset we report the FAIR references (persistent identifiers, access URLs, documentation, citation), the general characteristics of the product, the results of the completeness and auto-consistency tests, and the mapping between the quantities required by the CDF algorithm and the actual variable names inside the data files.

These are data quality tests. The auto-consistency tests documented here are not merely a verification of algorithmic compatibility. A dataset that passes the CDF auto-consistency test demonstrates that the error covariance matrices and averaging kernels stored in the product file are internally consistent and mathematically well-formed — in other words, that the characterisation of the retrieval uncertainty is reliable. A failure indicates that the uncertainty information in the file is inconsistent or incomplete, which would compromise any data fusion application regardless of the algorithm used. The test results should therefore be interpreted as indicators of data quality, not just as a check of CDF readiness.

How to read this index. The Master overview table lists all five datasets; for IASI/EUMETSAT the six constituents are shown as separate rows. Separate columns give CDF(2022) and CDF(2015) auto-consistency test outcomes. The Constituent × Instrument cross-reference table shows which datasets cover each atmospheric constituent. The Test summary boxes collect the overall results at a glance.

Master overview

Dataset Provider Instrument / Platform Constituents Format CDF(2022) CDF(2015)
GOME-2 / AC-SAF AC SAF (EUMETSAT) GOME-2 / Metop-A,B,C O3 HDF5 ✓ passes ✓ passes
IASI / EUMETSAT EUMETSAT IASI / Metop-A,B,C T (Temperature) netCDF / EPS binary ✓ passes
GS basis expansion required
✓ passes
GS basis expansion required
H2O (Water vapour) ✓ passes
GS basis expansion required
✓ passes
GS basis expansion required; profile space: 10.88%
O3 (EPS netCDF) ~ partial
PC space confirmed; profile space partial
~ partial
PC space confirmed; profile space: 6.44%
O3 (FORLI native) ✓ passes ~ partial
max |Δx| = 3.71%
CO ✓ passes ~ partial
max |Δx| = 0.89%
HNO3 ✓ passes ~ partial
max |Δx| = 3.29%
IASI / AERIS AERIS (IASI Portal) IASI / Metop-A,B,C O3 (FORLI) netCDF ✓ passes
legacy & CDR formats
✗ fails
legacy: max |Δx| = 3523%; CDR: max |Δx| = 243%
MIPAS / IFAC IFAC-CNR MIPAS / Envisat O3 netCDF ✓ passes ✓ passes
OMPS / NASA NASA (Earthdata) OMPS / Suomi-NPP O3 HDF5 ~ partial
passes Variant A (AK from file);
fails Variant B (AK from Jacobian)
✗ fails
both variants (Var. A: max |Δx| = 209%)

For T and H2O, the covariance \mathbf{S}_a is rank-deficient in profile space (PC-based retrieval). A Gram–Schmidt orthonormal basis expansion is required to make the inversion well-defined. The expansion procedure differs between the two constituents (see the dedicated page for details).

Legend — auto-consistency test outcome: ✓ passes = max(|Δx|) and |ΔDOF| both negligible (errors indicate no issues with the uncertainty characterisation) · ~ partial = results not fully satisfactory: errors not negligible in some configurations, test confirmed only in a subset of spaces (e.g. PC space only), or uncertainty about the correct file reading · ✗ fails = errors clearly not negligible, indicating problems with the uncertainty characterisation in the product file.

Constituent × Instrument cross-reference

Find which datasets cover a given atmospheric constituent. Click the column header to open the corresponding page. A indicates that CDF(2022) passes for this constituent; ~ indicates results are not fully satisfactory (partial, uncertain, or PC space only); indicates the constituent is not provided by this dataset.

Constituent GOME-2
AC-SAF
IASI
EUMETSAT
IASI
AERIS
MIPAS
IFAC
OMPS
NASA
O3 (profile / total column) ~
O3 (EPS, PC-based) ~
T (Temperature)
H2O (Water vapour)
CO
HNO3

Legend: CDF(2022) passes · ~ CDF(2022) passes for some variants only · constituent not provided by this dataset.

Test summary

2 / 5
Fully compatible
GOME-2 and MIPAS: both CDF(2022) and CDF(2015) pass without reservations
CDF(2022)
Passes for most products
All IASI/EUMETSAT constituents pass (GS expansion required for T and H2O); IASI/AERIS passes; OMPS passes Variant A only
Partial / uncertain
Not fully satisfactory
IASI/EUMETSAT O3-EPS: test confirmed only in PC space. OMPS: CDF(2022) passes Variant A, but correct file reading is uncertain
CDF(2015)
Fails for 2 datasets
IASI/AERIS: legacy format 3523%, CDR 243%. OMPS: both variants fail (Var. A: 209%, Var. B: 13%)

Related pages

  • CDF Prerequisites — the six requirements that input products must satisfy for CDF application
  • CDF Tests — description of the completeness and auto-consistency tests applied to each dataset
  • CDF Algorithm — overview of the Complete Data Fusion method
  • Datasets catalogue — the general project catalogue of available datasets