Automatic relevance determination for Gaussian process regression with functional inputs

dc.contributor.advisor Niemi, Jarad
dc.contributor.advisor Caragea, Petruţa
dc.contributor.advisor Morris, Max D
dc.contributor.advisor Dutta, Somak
dc.contributor.advisor Qiu, Yumou
dc.contributor.author Damiano, Luis
dc.contributor.department Statistics en_US
dc.date.accessioned 2023-06-20T22:19:04Z
dc.date.available 2023-06-20T22:19:04Z
dc.date.issued 2023-05
dc.date.updated 2023-06-20T22:19:04Z
dc.description.abstract We introduce the novel automatic dynamic relevance determination (ADRD) framework for Gaussian process regression with functional inputs, an adaptation of automatic relevance determination (ARD) priors for vector inputs. In this framework, relevance varies smoothly over the input index space resulting in smooth and parsimonious relevance profiles learned from data whose posterior can be inspected for scientific interpretation and used in downstream analyses. An ADRD model requires us to specify a weight function form that is appropriate for a given application. We explore two strategies to design the weights, namely setting up a parametric form and generating them via a basis expansion. First, we introduce the asymmetric double and squared exponential weight functions for unimodal, smoothly decaying predictive relevance profiles. Second, we present a general form for the basis expansion of the weights and explore, specifically, the Fourier, B-spline, and adaptive spline expansions. We establish an equivalence between the ADRD and ARD weights and propose an adaptation to permutation feature importance. Both motivate different exploratory tools to elicit a weight function form from data. We also discuss a fully Bayesian estimation framework via MCMC, including a set of weakly informative priors for the model parameters, as well as statistics for model validation. In two simulation studies, we show that a well specified model is able to recover the true weight function. Moreover, we present two applications to scientific data generated by an atmospheric radiative transfer computer model and a soil erosion computer model. We show empirically that, compared to ARD, ADRD generates smoother weight patterns and produces information useful for scientific interpretation and downstream analyses with a drastic reduction in the number of model parameters without compromising on prediction accuracy.
dc.format.mimetype PDF
dc.identifier.orcid 0000-0001-9107-0706
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/azJ4x0Gv
dc.language.iso en
dc.language.rfc3066 en
dc.subject.disciplines Statistics en_US
dc.subject.keywords Automatic relevance determination en_US
dc.subject.keywords computer experiments en_US
dc.subject.keywords functional input en_US
dc.subject.keywords Gaussian process en_US
dc.subject.keywords metamodeling en_US
dc.subject.keywords surrogate en_US
dc.title Automatic relevance determination for Gaussian process regression with functional inputs
dc.type article en_US
dc.type.genre dissertation en_US
dspace.entity.type Publication
thesis.degree.discipline Statistics en_US
thesis.degree.grantor Iowa State University en_US
thesis.degree.level dissertation $
thesis.degree.name Doctor of Philosophy en_US
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Damiano_iastate_0097E_20802.pdf
Size:
2.56 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: