The goal is to replicate (and extend) some of the analysis results in the research question
that has been studied
by K.C. Madhava et al. (2017) in their paper Association between screen time and depression among US adults
which used the 2011-2012 National Health and Nutrition Examination survey (NHANES) sample data where it
concluded that there was an association between people who spent more than 4 hours per day on TV/computer
with experiencing moderate or severe depression. Various survey sampling techniques are implemented throughout
the analysis.
The most important (and difficult) part is to obtain the variance estimates for the interested
quantities (population proportions).
The study of this report is conducted using the sample from the 2011-2012 National Health and Nutrition Examination survey (NHANES), a large cross-sectional U.S. survey conducted by the Centers for Disease Control (CDC) and National Center for Health Statistics (NCHS). The sampling design consists of four stages which is quite complex and is summarized as follows:
It is noted that the estimates and their standard errors are significantly different between the weighted and unweighted analyses. This makes sense as the NHANES sampling design uses unequal probability selection where the sample weights are different to reflect the number of people in the population that they represent. In this case, using an unweighted analysis will be inappropriate as equal sample probability increases the bias of the estimates as well as decreases the precision of the estimates. This is particularly true for categories that involves a lot of oversampling.
Last updated on Nov 1, 2019