Data

Training data

DC1 does not prescribe a fixed training dataset. Participants can use reanalyses, observations, model outputs, and their own preprocessing pipelines.

Evaluation references

The default DC1 configurations evaluate submissions against:

GLORYS (gridded reference fields)
SARAL (altimetry)
Jason-3 (altimetry)
SWOT (wide-swath altimetry)
Argo profiles (surface extraction of TEMP/PSAL)

Depending on the selected YAML profile and parameters, additional datasets may be available.

Evaluation period

Start: 2024-01-01
End: 2025-01-01 (profile-dependent during testing)

Surface-only rule

Even when source datasets are 3-D, DC1 scoring is performed on surface values only.

Where dataset definitions live

Dataset sources and metric assignments are defined in:

dc1/config/dc1_wasabi.yaml
dc1/config/dc1_edito.yaml

These files contain:

connection settings (public or credentialed endpoints)
per-dataset keep_variables and eval_variables
matching tolerances
metric lists (rmsd, mae, etc.)

Practical note

The local run workflow does not require manual download of all references. The evaluation pipeline fetches and caches required data as needed under the output directory.