JSON

RDF ISO19115-3 XML

HUN GW Uncertainty Analysis v01

Created 18/06/2018

Updated 28/06/2022

JSON

RDF ISO19115-3 XML

Abstract

The dataset was derived by the Bioregional Assessment Programme from multiple source datasets. The source datasets are identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.

This dataset contains all the scripts used to carry out the uncertainty analysis for the maximum drawdown and time to maximum drawdown at the groundwater receptors in the Hunter bioregion and all the resulting posterior predictions. This is described in product 2.6.2 Groundwater numerical modelling (Herron et al. 2016). See History for a detailed explanation of the dataset contents.

References:

Herron N, Crosbie R, Peeters L, Marvanek S, Ramage A and Wilkins A (2016) Groundwater numerical modelling for the Hunter subregion. Product 2.6.2 for the Hunter subregion from the Northern Sydney Basin Bioregional Assessment. Department of the Environment, Bureau of Meteorology, CSIRO and Geoscience Australia, Australia.

Dataset History

This dataset uses the results of the design of experiment runs of the groundwater model of the Hunter subregion to train emulators to (a) constrain the prior parameter ensembles into the posterior parameter ensembles and to (b) generate the predictive posterior ensembles of maximum drawdown and time to maximum drawdown. This is described in product 2.6.2 Groundwater numerical modelling (Herron et al. 2016).

A flow chart of the way the various files and scripts interact is provided in HUN_GW_UA_Flowchart.png (editable version in HUN_GW_UA_Flowchart.gliffy).

R-script HUN_DoE_Parameters.R creates the set of parameters for the design of experiment in HUN_DoE_Parameters.csv. Each of these parameter combinations is evaluated with the groundwater model (dataset HUN GW Model v01). Associated with this spreadsheet is file HUN_GW_Parameters.csv. This file contains, for each parameter, if it is included in the sensitivity analysis, tied to another parameters, the initial value and range, the transformation, the type of prior distribution with its mean and covariance structure.

The results of the design of experiment model runs are summarised in files HUN_GW_dmax_DoE_Predictions.csv, HUN_GW_tmax_DoE_Predictions.csv, HUN_GW_DoE_Observations.csv, HUN_GW_DoE_mean_BL_BF_hist.csv which have the maximum additional drawdown, the time to maximum additional drawdown for each receptor and the simulated equivalents to observed groundwater levels and SW-GW fluxes respectively. These are generated with post-processing scripts in dataset HUN GW Model v01 from the output (as exemplified in dataset HUN GW Model simulate ua999 pawsey v01).

Spreadsheets HUN_GW_dmax_Predictions.csv and HUN_GW_tmax_Predictions.csv capture additional information on each prediction; the name of the prediction, transformation, min, max and median of design of experiment, a boolean to indicate the prediction is to be included in the uncertainty analysis, the layer it is assigned to and which objective function to use to constrain the prediction.

Spreadsheet HUN_GW_Observations.csv has additional information on each observation; the name of the observation, a boolean to indicate to use the observation, the min and max of the design of experiment, a metadata statement describing the observation, the spatial coordinates, the observed value and the number of observations at this location (from dataset HUN bores v01). Further it has the distance of each bore to the nearest blue line network and the distance to each prediction (both in km). Spreadsheet HUN_GW_mean_BL_BF_hist.csv has similar information, but on the SW-GW flux. The observed values are from dataset HUN Groundwater Flowrate Time Series v01

These files are used in script HUN_GW_SI.py to generate sensitivity indices (based on the Plischke et al. (2013) method) for each group of observations and predictions. These indices are saved in spreadsheets HUN_GW_dmax_SI.csv, HUN_GW_tmax_SI.csv, HUN_GW_hobs_SI.py, HUN_GW_mean_BF_hist_SI.csv

Script HUN_GW_dmax_ObjFun.py calculates the objective function values for the design of experiment runs. Each prediction has a tailored objective function which is a weighted sum of the residuals between observations and predictions with weights based on the distance between observation and prediction. In addition to that there is an objective function for the baseflow rates. The results are stored in HUN_GW_DoE_ObjFun.csv and HUN_GW_ObjFun.csv.

The latter files are used in scripts HUN_GW_dmax_CreatePosteriorParameters.R to carry out the Monte Carlo sampling of the prior parameter distributions with the Approximate Bayesian Computation methodology as described in Herron et al (2016) by generating and applying emulators for each objective function. The scripts use the scripts in dataset R-scripts for uncertainty analysis v01. These files are run on the high performance computation cluster machines with batch file HUN_GW_dmax_CreatePosterior.slurm. These scripts result in posterior parameter combinations for each objective function, stored in directory PosteriorParameters, with filename convention HUN_GW_dmax_Posterior_Parameters_OO_$OFName$.csv where $OFName$ is the name of the objective function. Python script HUN_GW_PosteriorParameters_Percentiles.py summarizes these posterior parameter combinations and stores the results in HUN_GW_PosteriorParameters_Percentiles.csv.

The same set of spreadsheets is used to test convergence of the emulator performance with script HUN_GW_emulator_convergence.R and batch file HUN_GW_emulator_convergence.slurm to produce spreadsheet HUN_GW_convergence_objfun_BF.csv.

The posterior parameter distributions are sampled with scripts HUN_GW_dmax_tmax_MCsampler.R and associated .slurm batch file. The script create and apply an emulator for each prediction. The emulator and results are stored in directory Emulators. This directory is not part of the this dataset but can be regenerated by running the scripts on the high performance computation clusters. A single emulator and associated output is included for illustrative purposes.

Script HUN_GW_collate_predictions.csv collates all posterior predictive distributions in spreadsheets HUN_GW_dmax_PosteriorPredictions.csv and HUN_GW_tmax_PosteriorPredictions.csv. These files are further summarised in spreadsheet HUN_GW_dmax_tmax_excprob.csv with script HUN_GW_exc_prob. This spreadsheet contains for all predictions the coordinates, layer, number of samples in the posterior parameter distribution and the 5th, 50th and 95th percentile of dmax and tmax, the probability of exceeding 1 cm and 20 cm drawdown, the maximum dmax value from the design of experiment and the threshold of the objective function and the acceptance rate.

The script HUN_GW_dmax_tmax_MCsampler.R is also used to evaluate parameter distributions HUN_GW_dmax_Posterior_Parameters_HUN_OF_probe439.csv and HUN_GW_dmax_Posterior_Parameters_Mackie_OF_probe439.csv. These are, for one predictions, different parameter distributions, in which the latter represents local information. The corresponding dmax values are stored in HUN_GW_dmax_probe439_HUN.csv and HUN_GW_dmax_probe439_Mackie.csv

Dataset Citation

Bioregional Assessment Programme (XXXX) HUN GW Uncertainty Analysis v01. Bioregional Assessment Derived Dataset. Viewed 13 March 2019, http://data.bioregionalassessments.gov.au/dataset/c25db039-5082-4dd6-bb9d-de7c37f6949a.

Dataset Ancestors

Derived From HUN GW Model code v01
Derived From Hydstra Groundwater Measurement Update - NSW Office of Water, Nov2013
Derived From Groundwater Economic Elements Hunter NSW 20150520 PersRem v02
Derived From NSW Office of Water - National Groundwater Information System 20140701
Derived From Travelling Stock Route Conservation Values
Derived From HUN GW Model v01
Derived From NSW Wetlands
Derived From Climate Change Corridors Coastal North East NSW
Derived From Communities of National Environmental Significance Database - RESTRICTED - Metadata only
Derived From Climate Change Corridors for Nandewar and New England Tablelands
Derived From National Groundwater Dependent Ecosystems (GDE) Atlas
Derived From Fauna Corridors for North East NSW
Derived From R-scripts for uncertainty analysis v01
Derived From Asset database for the Hunter subregion on 27 August 2015
Derived From Hunter CMA GDEs (DRAFT DPI pre-release)
Derived From Estuarine Macrophytes of Hunter Subregion NSW DPI Hunter 2004
Derived From Birds Australia - Important Bird Areas (IBA) 2009
Derived From Camerons Gorge Grassy White Box Endangered Ecological Community (EEC) 2008
Derived From Asset database for the Hunter subregion on 16 June 2015
Derived From Spatial Threatened Species and Communities (TESC) NSW 20131129
Derived From Gippsland Project boundary
Derived From Bioregional Assessment areas v04
Derived From Asset database for the Hunter subregion on 24 February 2016
Derived From Natural Resource Management (NRM) Regions 2010
Derived From Gosford Council Endangered Ecological Communities (Umina woodlands) EEC3906
Derived From NSW Office of Water Surface Water Offtakes - Hunter v1 24102013
Derived From National Groundwater Dependent Ecosystems (GDE) Atlas (including WA)
Derived From Bioregional Assessment areas v03
Derived From HUN groundwater flow rate time series v01
Derived From Asset list for Hunter - CURRENT
Derived From NSW Office of Water Surface Water Entitlements Locations v1_Oct2013
Derived From Species Profile and Threats Database (SPRAT) - Australia - Species of National Environmental Significance Database (BA subset - RESTRICTED - Metadata only)
Derived From HUN GW Model simulate ua999 pawsey v01
Derived From Northern Rivers CMA GDEs (DRAFT DPI pre-release)
Derived From GEODATA TOPO 250K Series 3, File Geodatabase format (.gdb)
Derived From Ramsar Wetlands of Australia
Derived From Native Vegetation Management (NVM) - Manage Benefits
Derived From GEODATA TOPO 250K Series 3
Derived From NSW Catchment Management Authority Boundaries 20130917
Derived From Geological Provinces - Full Extent
Derived From HUN bores v01
Derived From NSW Office of Water Surface Water Licences Processed for Hunter v1 20140516
Derived From GW Element Bores with Unknown FTYPE Hunter NSW Office of Water 20150514
Derived From Atlas of Living Australia NSW ALA Portal 20140613
Derived From Bioregional Assessment areas v02
Derived From Bioregional Assessment areas v05
Derived From HUN GW Model Mines raw data v01
Derived From National Heritage List Spatial Database (NHL) (v2.1)
Derived From Climate Change Corridors (Dry Habitat) for North East NSW
Derived From Groundwater Entitlement Hunter NSW Office of Water 20150324
Derived From Asset database for the Hunter subregion on 20 July 2015
Derived From Darling River Hardyhead Predicted Distribution in Hunter River Catchment NSW 2015
Derived From Asset database for the Hunter subregion on 22 September 2015
Derived From Commonwealth Heritage List Spatial Database (CHL)
Derived From Australia World Heritage Areas
Derived From Lower Hunter Spotted Gum Forest EEC 2010
Derived From New South Wales NSW Regional CMA Water Asset Information WAIT tool databases, RESTRICTED Includes ALL Reports
Derived From Threatened migratory shorebird habitat mapping DECCW May 2006
Derived From Surface Geology of Australia, 1:1 000 000 scale, 2012 edition
Derived From Bioregional Assessment areas v06
Derived From NSW Office of Water - GW licence extract linked to spatial locations for North and South Sydney v2 20140228
Derived From HUN AssetList Database v1p2 20150128
Derived From New South Wales NSW - Regional - CMA - Water Asset Information Tool - WAIT - databases
Derived From Climate Change Corridors (Moist Habitat) for North East NSW
Derived From Operating Mines OZMIN Geoscience Australia 20150201
Derived From NSW Office of Water - National Groundwater Information System 20141101v02
Derived From Victoria - Seamless Geology 2014
Derived From Groundwater Economic Assets Hunter NSW 20150331 PersRem
Derived From Australia - Species of National Environmental Significance Database
Derived From Monitoring Power Generation and Water Supply Bores Hunter NOW 20150514
Derived From Bioregional Assessment areas v01
Derived From NSW Office of Water GW licence extract linked to spatial locations for NorthandSouthSydney v3 13032014
Derived From Australia, Register of the National Estate (RNE) - Spatial Database (RNESDB) Internal
Derived From Asset database for the Hunter subregion on 12 February 2015
Derived From NSW Office of Water Groundwater Entitlements Spatial Locations
Derived From NSW Office of Water Groundwater Licence Extract, North and South Sydney - Oct 2013
Derived From HUN Alluvium (1:1m Geology)
Derived From Directory of Important Wetlands in Australia (DIWA) Spatial Database (Public)
Derived From Collaborative Australian Protected Areas Database (CAPAD) 2010 (Not current release)
Derived From NSW Office of Water combined geodatabase of regulated rivers and water sharing plan regions

Files and APIs

ZIP

HUN GW Uncertainty Analysis v01
Creative Commons Attribution 2.5 Australia
Explore

More information

Download

Additional Info

Field	Value
Title	HUN GW Uncertainty Analysis v01
Language	eng
Licence	Creative Commons Attribution 2.5 Australia
Landing Page	https://data.gov.au/data/dataset/3b9239f2-561b-47f4-b5f5-eb3bea4bdd47
Contact Point	Bioregional Assessment Program bioregionalassessments@environment.gov.au
Reference Period	18/06/2018
Geospatial Coverage	POLYGON ((0 0, 0 0, 0 0, 0 0))
Data Portal	Data.gov.au