Some Bayes methods for biclustering and vector data with binary coordinates

dc.contributor.advisor Stephen B. Vardeman
dc.contributor.author Chakraborty, Abhishek
dc.contributor.department Statistics (LAS)
dc.date 2019-11-04T21:45:23.000
dc.date.accessioned 2020-06-30T03:18:18Z
dc.date.available 2020-06-30T03:18:18Z
dc.date.copyright Thu Aug 01 00:00:00 UTC 2019
dc.date.embargo 2001-01-01
dc.date.issued 2019-01-01
dc.description.abstract <p>We consider Bayes methods for two problems that share a common need to partition index sets encoding commonalities between observations. The first is a biclustering problem. The second is inference for mixture models for $p$-vectors with binary coordinates.</p> <p>Standard one-way clustering methods form homogeneous groups in a set of objects. Biclustering methods simultaneously cluster rows and columns of a rectangular dataset in such a way that responses are homogeneous for all row-cluster by column-cluster groups. Assuming that data entries follow a normal distribution with a bicluster-specific mean term and a common variance, we propose a Bayes methodology for biclustering and corresponding Markov Chain Monte Carlo (MCMC) algorithms. Our proposed method not only identifies homogeneous biclusters, but also generates plausible predictions for missing/unobserved entries in the potential rectangular dataset as illustrated through simulation studies and applications to real datasets.</p> <p>In the second problem, we propose a tractable symmetric distribution for modeling multivariate vectors of 0's and 1's on $p$ dimensions that allows for nontrivial amounts of variation around some central value. We then consider Bayesian analysis of mixture models where the component distributions have this above form. Inferences are made from the posterior samples generated by MCMC algorithms. We also extend our proposed Bayesian mixture model analysis to datasets with missing entries. Model performance is illustrated through simulation studies and applications to real datasets.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/etd/17420/
dc.identifier.articleid 8427
dc.identifier.contextkey 15681376
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath etd/17420
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/31603
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/etd/17420/Chakraborty_iastate_0097E_18226.pdf|||Fri Jan 14 21:22:49 UTC 2022
dc.subject.disciplines Statistics and Probability
dc.subject.keywords Bayes
dc.subject.keywords Biclustering
dc.subject.keywords Dirichlet process
dc.subject.keywords MCMC
dc.subject.keywords Mixture models
dc.title Some Bayes methods for biclustering and vector data with binary coordinates
dc.type dissertation
dc.type.genre dissertation
dspace.entity.type Publication
relation.isOrgUnitOfPublication 264904d9-9e66-4169-8e11-034e537ddbca
thesis.degree.discipline Statistics
thesis.degree.level dissertation
thesis.degree.name Doctor of Philosophy
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Chakraborty_iastate_0097E_18226.pdf
Size:
10.74 MB
Format:
Adobe Portable Document Format
Description: