A joint project of the Graduate School, Peabody College, and the Jean & Alexander Heard Library

Title page for ETD etd-03152019-155008


Type of Document Master's Thesis
Author Stanley, Brooklyn
Author's Email Address brooklynostanley@gmail.com
URN etd-03152019-155008
Title Outcome Misclassification in Logistic Regression: Examining Hospitalization Risk and its Association with Health Literacy
Degree Master of Science
Department Biostatistics
Advisory Committee
Advisor Name Title
Jonathan Schildcrout Committee Chair
Robert Greevy Committee Member
Keywords
  • misclassification
  • hospitalization
  • logistic regression
Date of Defense 2019-03-14
Availability unrestricted
Abstract
In a cohort of patients seeking primary care, the Mid-South Coronary Heart Disease Cohort Study (MSCHDCS) sought to examine the association between health literacy and hospitalization risk in the year following enrollment. The hospital admission outcome data were originally collected from the Vanderbilt University Medical Center (VUMC); however, researchers eventually realized hospitalizations might be missed, and so they expanded hospital admissions to also include the surrounding Vanderbilt Health Affiliated Network (VHAN). By including admissions to non-VUMC hospitals, new hospitalizations were identified, and so by only using VUMC admissions data, many outcomes were misclassified. The goal of this research is to explore the potential impact outcome misclassification can have in settings similar to MSCHDCS where the hospitalizations might be missed due to inadequate outcome measurement. We explore the impact of non-differential and differential misclassification on naïve analyses and demonstrate the impact that misclassification can have on the results when it is dependent on the variables of interest, as in the MSCHDCS data. In the presence of suspected differential misclassification, we propose to collect validation data on a subset of patients and describe methods that can be used in conjunction with this data to obtain unbiased coefficient estimates. While older methods depend on prior knowledge of the sensitivity and specificity for the misclassification, the validation data removes this limitation and allows for adjustment with more flexible, non-parametric methods. With the highlighted adjustment methods, non-biased estimates were obtained in a simulation study as well as in the example data.
Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  Stanley.pdf 383.38 Kb 00:01:46 00:00:54 00:00:47 00:00:23 00:00:02

Browse All Available ETDs by ( Author | Department )

If you have more questions or technical problems, please Contact LITS.