SCDNA: a serially complete precipitation and temperature dataset in North America from 1979 to 2018
Station-based serially complete datasets (SCDs) of precipitation and temperature observations are important for hydrometeorological studies. We developed a SCD for North America (SCDNA) of precipitation, minimum temperature, and maximum temperature from 1979 to 2018. Raw meteorological station data were obtained from the Global Historical Climate Network Daily (GHCN-D), the Global Surface Summary of the Day (GSOD), Environment and Climate Change Canada (ECCC), and a compiled station database in Mexico (Livneh et al. 2015).
There are three types of missing values that are infilled/reconstructed by this dataset:
1. Missing value during the observation period when the station still works.
2. Missing value beyond the observation period (reconstruction period) before the station is deployed or after the station ceases working.
3. Station measurements that fail quality control checks are treated as missing values and imputed.
This dataset is useful for various purposes of applications that require:
1. Quality-controlled actual station observations from multiple datasets in North America;
2. Station observations without missing values in the observation period;
3. Serially complete station observations. Users should be cautious when using this dataset for trend analysis because it is possible that trends are not well reconstructed.
Three types of dataset files are provided:
“SCDNA_v1.1.nc4”. This NetCDF file contains basic information (ID, location, elevation) and the final variables of stations. For each variable (precipitation, minimum temperature, and maximum temperature), this file provides the serially complete data, the estimation flag indicating whether a value is from observation or estimation, and accuracy index (KGE) of estimated data.
“SCD_complete_part1.zip” to “SCD_complete_part10.zip”. These ten compressed files contain complete data for the production of the SCD, including quality flags, estimates from 16 strategies (quantile mapping, interpolation, machine learning, and multiple-strategy merging), corrected/uncorrected SCD estimates, accuracy indices, etc.
"overlap_station.zip". This file contains the list and data of stations that have the same latitude and longitude records due to various reasons, such as the same stations from different sources, naming rules, recording bias, etc, network design, etc.
We recommend that users download "SCDNA_v1.1.nc4" for quick and direct application, and adopt the second type for in-depth investigation of different strategies and potential methodology improvement. Please refer to Readme.txt for more details.
The codes used to produce this dataset are available on GitHub (
https://github.com/tgq14/GapFill).
This dataset was created as part of the spatial meteorological forcing data (SMFD) theme of the GWF Core Modelling and Forecasting Team.
Tang G., Clark M. P., Newman A. J., Wood A. W., Papalexiou S. M., Vionnet V., Whitfield P. H. (2020). SCDNA: a serially complete precipitation and temperature dataset in North America from 1979 to 2018 (Version 1.1) [Dataset]. Zenodo.
http://doi.org/10.5281/zenodo.3735533Tang, G., Clark, M. P., Newman, A. J., Wood, A. W., Papalexiou, S. M., Vionnet, V., and Whitfield, P. H. (2020). SCDNA: a serially complete precipitation and temperature dataset for North America from 1979 to 2018. Earth Syst. Sci. Data, 12, 2381–2409,
https://doi.org/10.5194/essd-12-2381-2020.