Bayesian Influence Diagnostic Methods for Parametric Regression Models

Cho, Hyunsoon

Download PDF

Request Version for Screen Reader

Last Modified

October 21, 2019

Creator

Cho, Hyunsoon
- Affiliation: Gillings School of Global Public Health, Department of Biostatistics

Abstract

The goals of assessing the influence of individual observations in statistical analysis are not only to identify influential observations such as outliers and high leverage points, but also to determine the importance of each observation in the analysis for a better model fit. Thus, assessing the influence of individual observations on a model, choosing an appropriate dimensionality of a model and selecting the best model for a given dataset are very important and highly relevant problems in any formal statistical analysis. Recently, Bayesian methodologies have been getting enormous attention in biomedical research due to the potential advantages of fitting a vast array of complex models posed by modern data. As the demand for Bayesian data analysis and modeling increases, we need good diagnostic methods for model assessment and selection. In this dissertation, we develop Bayesian diagnostic measures based on case-deletion to assess the influence of each observation to model fit and model complexity. First, we propose Bayesian case influence diagnostics for complex survival models. In detail, we develop case deletion influence diagnostics for both the joint and marginal posterior distributions based on the Kullback-Leibler divergence. Second, we introduce three types of Bayesian case influence measures based on case deletion, namely the Φ-divergence, Cook's posterior mode distance and Cook's posterior mean distance to evaluate the effects of deleting a set of observations in general Bayesian parametric models. We also examine the statistical properties of these three Bayesian case influence measures and their applications to identification of influential sets and model complexity. In any deletion diagnostic, "size matters" issue persists and it is a fundamental issue of influence analysis, because the size of the deletion diagnostic is associated with the size of the perturbation. For Cook's distance, that is Cook's distance is a monotonic function of the size of perturbation. Thus, we develop a scaled version of Cook's distance to address the size issue for deletion diagnostics in general parametric models.

Date of publication

December 2009

DOI

https://doi.org/10.17615/9sde-5372

Resource type

Dissertation

Rights statement

In Copyright

Note

"... in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Department of Biostatistics, Gillings School of Global Public Health."

Advisor

Ibrahim, Joseph

Degree

Doctor of Philosophy

Degree granting institution

University of North Carolina at Chapel Hill

Graduation year

2009

Language

English

Publisher

University of North Carolina at Chapel Hill

Place of publication

Chapel Hill, NC

Access right

Open access

Date uploaded

March 18, 2013

Relations

Parents:

Items

Thumbnail	Title	Date Uploaded	Visibility	Actions
		2019-04-10	Public	Download

Bayesian Influence Diagnostic Methods for Parametric Regression Models

Downloadable Content

Relations

Items