sábado, 5 de abril de 2014

Preventing Chronic Disease | Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer? - CDC

full-text ►

Preventing Chronic Disease | Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer? - CDC



Preventing Chronic Disease Logo

Image of eCard



Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer?

Hong Zhou, MS, MPH; Paul Z. Siegel, MD, MPH; John Barile, PhD; Rashid S. Njai, PhD; William W. Thompson, PhD; Charlotte Kent, PhD; Youlian Liao, MD

Suggested citation for this article: Zhou H, Siegel PZ, Barile J, Njai RS, Thompson WW, Kent C, et al. Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer? Prev Chronic Dis 2014;11:130252. DOI: http://dx.doi.org/10.5888/pcd11.130252External Web Site Icon.

PEER REVIEWED

Abstract

Introduction
Count data are often collected in chronic disease research, and sometimes these data have a skewed distribution. The number of unhealthy days reported in the Behavioral Risk Factor Surveillance System (BRFSS) is an example of such data: most respondents report zero days. Studies have either categorized the Healthy Days measure or used linear regression models. We used alternative regression models for these count data and examined the effect on statistical inference.
Methods
Using responses from participants aged 35 years or older from 12 states that included a homeownership question in their 2009 BRFSS, we compared 5 multivariate regression models — logistic, linear, Poisson, negative binomial, and zero-inflated negative binomial — with respect to 1) how well the modeled data fit the observed data and 2) how model selections affect inferences.
Results
Most respondents (66.8%) reported zero mentally unhealthy days. The distribution was highly skewed (variance = 58.7, mean = 3.3 d). Zero-inflated negative binomial regression provided the best-fitting model, followed by negative binomial regression. A significant independent association between homeownership and number of mentally unhealthy days was not found in the logistic, linear, or Poisson regression model but was found in the negative binomial model. The zero-inflated negative binomial model showed that homeowners were 24% more likely than nonhomeowners to have excess zero mentally unhealthy days (adjusted odds ratio, 1.24; 95% confidence interval, 1.08–1.43), but it did not show an association between homeownership and the number of unhealthy days.
Conclusion
Our comparison of regression models indicates the importance of examining data distribution and selecting models with appropriate assumptions. Otherwise, statistical inferences might be misleading.

Author Information

Corresponding Author: Hong Zhou, MS, MPH, Division of Health Informatics and Surveillance, Center for Surveillance, Epidemiology and Laboratory Services, Centers for Disease Control and Prevention, 1600 Clifton Rd NE, Mailstop E91, Atlanta, GA 30333. Telephone: 404-498-6293. E-mail: HZhou1@cdc.gov.
Author Affiliations: Paul Z. Siegel, Rashid S. Njai, Charlotte Kent, Youlian Liao, William W. Thompson, Centers for Disease Control and Prevention, Atlanta, Georgia; John Barile, University of Hawaii at Manoa, Manoa, Hawaii.

References

  1. Centers for Disease Control and Prevention. Measuring healthy days. Population assessment of health-related quality of life. Atlanta (GA): Centers for Disease Control and Prevention; 2000.
  2. Zahran HS, Kobau R, Moriarty DG, Zack MM, Holt J, Donehoo R, et al. Health-related quality of life surveillance — United States, 1993–2002. MMWR Surveill Summ 2005;54(4):1–35. PubMedExternal Web Site Icon
  3. Chen HY, Baumgardner DJ, Rice JP. Health-related quality of life among adults with multiple chronic conditions in the United States, Behavioral Risk Factor Surveillance System, 2007. Prev Chronic Dis 2011;8(1):A09. PubMedExternal Web Site Icon
  4. Jiang Y, Hesser JE. Using item response theory to analyze the relationship between health-related quality of life and health risk factors. Prev Chronic Dis 2009;6(1):A30. PubMedExternal Web Site Icon
  5. Brown DW, Balluz LS, Heath GW, Moriarty DG, Ford ES, Giles WH, et al. Associations between recommended levels of physical activity and health-related quality of life. Findings from the 2001 Behavioral Risk Factor Surveillance System (BRFSS) survey. Prev Med 2003;37(5):520–8. CrossRefExternal Web Site IconPubMedExternal Web Site Icon
  6. Hayes DK, Greenlund KJ, Denny CH, Neyer JR, Croft JB, Keenan NL. Racial/ethnic and socioeconomic disparities in health-related quality of life among people with coronary heart disease, 2007. Prev Chronic Dis 2011;8(4):A78. PubMedExternal Web Site Icon
  7. Froshaug DB, Dickinson LM, Fernald DH, Green LA. Personal health behaviors are associated with physical and mental unhealthy days: a Prescription for Health (P4H) practice-based research networks study. J Am Board Fam Med 2009;22(4):368–74. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  8. Royston P, Altman DG, Sauerbrei W. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med 2006;25(1):127–41. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  9. Taylor J, Yu M. Bias and efficiency loss due to categorizing an explanatory variable. J Multivariate Anal 2002;83(1):248–63. CrossRefExternal Web Site Icon
  10. MacCallum RC, Zhang S, Preacher KJ, Rucker DD. On the practice of dichotomization of quantitative variables. Psychol Methods 2002;7(1):19–40.CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  11. Naggara O, Raymond J, Guilbert F, Roy D, Weill A, Altman DG. Analysis by categorizing or dichotomizing continuous variables is inadvisable: an example from the natural history of unruptured aneurysms. AJNR Am J Neuroradiol 2011;32(3):437–40. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  12. Austin PC, Brunner LJ. Inflation of the type I error rate when a continuous confounding variable is categorized in logistic regression analyses. Stat Med 2004;23(7):1159–78. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  13. Wen XJ, Kanny D, Thompson WW, Okoro CA, Town M, Balluz LS. Binge drinking intensity and health-related quality of life among US adult binge drinkers. Prev Chronic Dis 2012;9:E86. PubMedExternal Web Site Icon
  14. Goins RT, Spencer SM, Krummel DA. Effect of obesity on health-related quality of life among Appalachian elderly. South Med J 2003;96(6):552–7.CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  15. Zullig KJ, Hendryx M. Health-related quality of life among central Appalachian residents in mountaintop mining counties. Am J Public Health 2011;101(5):848–53. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  16. Elhai JD, Calhoun PS, Ford JD. Statistical procedures for analyzing mental health services data. Psychiatry Res 2008;160(2):129–36. CrossRefExternal Web Site IconPubMedExternal Web Site Icon
  17. Gardner W, Mulvey EP, Shaw EC. Regression analyses of counts and rates: Poisson, overdispersed Poisson, and negative binomial models. Psychol Bull 1995;118(3):392–404. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  18. Hilbe JM. Negative binomial regression. Cambridge (UK): Cambridge University Press; 2011.
  19. Gee GC, Ponce N. Associations between racial discrimination, limited English proficiency, and health-related quality of life among 6 Asian ethnic groups in California. Am J Public Health 2010;100(5):888–95. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  20. Macintyre S, Ellaway A, Der G, Ford G, Hunt K. Do housing tenure and car access predict health because they are simply markers of income or self esteem? A Scottish study. J Epidemiol Community Health 1998;52(10):657–64. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  21. Pollack CE, von dem Knesebeck O, Siegrist J. Housing and health in Germany. J Epidemiol Community Health 2004;58(3):216–22. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  22. Mokdad AH, Stroup DF, Giles WH, Behavioral Risk Factor Surveillance Team. Public health surveillance for behavioral risk factors in a changing environment. Recommendations from the Behavioral Risk Factor Surveillance Team. MMWR Recomm Rep 2003;52(RR-9):1–12. PubMedExternal Web Site Icon
  23. Cohen J, Cohen P, West SG, Aiken LS. Applied multiple regression/correlation analysis for the behavioral sciences, 3rd edition. New York (NY): Routledge; 2002.
  24. Vuong QH. Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica 1989;57(2):307–33. CrossRefExternal Web Site Icon
  25. Zaninotto P, Falaschetti E. Comparison of methods for modelling a count outcome with excess zeros: application to Activities of Daily Living (ADL-s). J Epidemiol Community Health 2011;65(3):205–10. CrossRefExternal Web Site Icon PubMedExternal Web Site Icon
  26. Burnham KP, Anderson DR. Model selection and multimodel inference: a practical information-theoretic approach. New York (NY): Springer-Verlag, Inc; 2002.

No hay comentarios: