Cell-scale atmospheric moisture flows dataset reconciled with ERA5 reanalysis

Atmospheric moisture flows from high-resolution tracking data are reconciled within the atmospheric hydrological balance on the annual basis through the Iterative Proportional Fitting (IPF) procedure, paving the way for future applications across multiple tracking models and forcing data.

Published in Earth & Environment

Cell-scale atmospheric moisture flows dataset reconciled with ERA5 reanalysis
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

Zenodo
Zenodo Zenodo

RECON - Cell-scale atmospheric moisture flows dataset reconciled with ERA5 reanalysis

The RECON dataset provides moisture flow volumes, in cubic meters, from evaporation sources to precipitation targets and vice versa. It offers global coverage at a resolution of 0.5° for an average year based on the period 2008–2017. It is a post-processed version of the Lagrangian (forward trajectory-based) tracking model UTrack dataset (DOI UTrack dataset: 10.1594/PANGAEA.912710, DOI Utrack support paper: 10.5194/essd-12-3177-2020), by means of Iterative Proportional Fitting procedure and ERA5 preprocessing.Data are stored in integers that need to be transformed into cubic meters, as explained in the supplement material pdf file. More information on the generation of the dataset, authors of the dataset, input variable information and data extraction with simple python scripts are provided in our official GitHub repository. We provide the following files: RECON_moisture_flows_0.5.nc.7z: is a compressed/packed version of the NetCDF4 RECON dataset. To get the NetCDF dataset file, follow the instructions in the supplement material pdf file ERA5_m_0.5_volumes_corrected.nc where m is the month: our own edited version of monthly averaged ERA5 data, that have been used to retrieve moisture volume flows from the Utrack dataset, as explained in our official GitHub repository RECON_ERA5_avgYear_0.5_volumes.nc: our own edited version of yearly averaged ERA5 data, that have been used to postprocess moisture volume flows retrieved from the UTrack dataset in order to get the RECON dataset, as explained in our official GitHub repository By sharing these ERA5 files, we provide means for reproducibility of our postprocessing framework.

 A key part of the global hydrological cycle is comprised of the moisture flows through the atmosphere, which connect locations where the moisture evaporates (upwind locations) with the locations where it subsequently precipitates (downwind locations). 

Moisture recycling, the phenomenon for which the moisture that origins from land vegetation re-preciptates over land, connects land- and atmospheric conditions up to thousands of kilometres away. Globally, moisture recycling is so important for global precipitation patterns that roughly half of all terrestrial precipitation has come from evapotranspiration from land, the other half being from the ocean.

Today, high attention is posed on the effects of land use and vegetation changes on moisture recycling. Indeed, land-use changes that affect evapotranspiration flows, such as deforestation, can affect precipitation regimes, the severity of droughts and hydrological flows in downwind regions.

Therefore, reconstructing evapotranspiration-to-precipitation connections has involved a broad range of researchers who has developed the so-called atmospheric moisture tracking models to  develop these vapour flows reconstructions.

These models typically use atmospheric reanalysis data to simulate the atmospheric branch of the hydrological cycle. However, despite the large interest, it is often not feasible for broad researchers to perform these simulations. As with all methods, becoming familiar with them requires significant time investment, but an additional constraint on widespread use of atmospheric moisture tracking is its heavy data demand. This data demand has increased considerably with the largest generation of atmospheric reanalysis data, ERA5, which allows for obtaining detailed global moisture flows. Models steadily advanced as cutting-edge data becomes increasingly available. This is the case with the UTrack Lagrangian model, which takes advantage of state-of-art climate reanalysis data and, by testing several combinations of model assumptions, optimally generates highly detailed evaporation footprints while avoiding unnecessary complexity. Despite these cutting-edge model advancements and the wide applications already achieved, less attention has been given to ensuring the consistency of tracked moisture volumes with reanalysis data of precipitation and evaporation simultaneously to ensure the closure of the annual hydrological cycle. Model error and assumptions, as well as possible discrepancies in ERA5 data, may lead to inconsistencies that could impede internally consistent descriptions of the global hydrological cycle.  Indeed, uncertainty related to a set of modelling assumptions and data resolution still poses an issue for the moisture tracking community.

To address this gap, our study -openly accessible at Scientific Data (https://doi.org/10.1038/s41597-025-04964-3)- proposes a reconciliation framework based on the Iterative Proportional Fitting (IPF) procedure, a rigorous mathematical framework for refining tracking model outputs, thereby reducing uncertainties arising from modeling assumptions and data resolution constraints.  The study further includes a pre-processing of the ERA5 reanalysis data to address the existing annual unbalance between ERA5 precipitation and evaporation. Overall, the entire reconciliation framework ensures that the total tracked atmospheric moisture matches the total precipitation at the sink and the total evaporation at the source on an annual basis and in each cell.

The outcome is a new dataset of moisture flow volumes from sources of evaporation to fates of precipitation at 0.5°, named RECON, with global coverage and centred over 2008-2017, which aligns coherently with annual precipitation and evaporation volumes from ERA5 reanalysis. The reconciled dataset is available at https://zenodo.org/records/14191920.

The reconciled cell-grid dataset provided within this study offers post-processed atmospheric moisture portions of evaporation at the source precipitating at the sink which closes the atmospheric hydrological balance on the annual basis. This marks a significant advancement in enhancing the reliability of the UTrack dataset and paves the way for future applications across multiple tracking models and forcing data.

A first application of the reconciliation approach is  a country-ocean and subcontinental analysis of transboundary atmospheric  water flows which reveals that 45% of total terrestrial precipitation  originates from land evaporation. The country-ocean and subcontinental datasets are available at 10.5281/zenodo.10400695 and published in Communications Earth & Environment (https://doi.org/10.1038/s43247-025-02289-y).

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Earth Sciences
Physical Sciences > Earth and Environmental Sciences > Earth Sciences
Mathematical Applications in Environmental Science
Physical Sciences > Earth and Environmental Sciences > Environmental Sciences > Mathematical Applications in Environmental Science
Atmospheric Science
Physical Sciences > Earth and Environmental Sciences > Earth Sciences > Atmospheric Science
Water
Physical Sciences > Earth and Environmental Sciences > Environmental Sciences > Water

Related Collections

With collections, you can get published faster and increase your visibility.

Clinical informatics

This Scientific Data Collection presents descriptions of a series of datasets for use in clinical informatics fields. Datasets in clinical informatics are vital for improving healthcare quality, efficiency, and patient outcomes.

Publishing Model: Open Access

Deadline: Sep 19, 2025

Text and speech corpora for natural language processing and corpus linguistics

This Collection presents a series of annotated text and speech corpora alongside linguistic models tailored for CL and NLP applications. These resources aim to enrich the arsenals of CL and NLP users and facilitate interdisciplinary research.

Publishing Model: Open Access

Deadline: Jul 24, 2025