期刊名称:International Journal of Statistics and Probability
印刷版ISSN:1927-7032
电子版ISSN:1927-7040
出版年度:2013
卷号:2
期号:4
页码:1
DOI:10.5539/ijsp.v2n4p1
出版社:Canadian Center of Science and Education
摘要:Generalized linear mixed models (GLMM) are commonly used to analyze hierarchical data. Unlike linear mixed models, they do not automatically provide parametric marginal regression functions, while such functions are needed for population-averaged inferences. This issue has received considerable attention and here three approaches to address it are reviewed, expanded, and compared: (1) the closed-form expressions of the marginal moments and distributions for a variety of GLMMs, derived by Molenberghs et al. (2010), as well as an extension that accommodates overdispersion; (2) the marginalized multilevel models of Heagerty (1999); (3) the bridge distribution of Wang and Louis (2003), a form for the random-effects distribution that allows the conditional and hierarchical mean to be described by the same link function. Our derivations are for the identity link function, the log link, and a collection of links for binary data. We highlight a number of useful connections: (a) it is shown that the bridge distribution for data with a mean on the unit interval is unique; (b) the three approaches are different for unit-interval data with the logit link, but are connected for the probit link; for the latter, there exist closed forms; (c) further results are derived for the bridge distribution in the case of unit-interval data and a Student's $t$ link; (d) in contrast to the unit-interval case, it is shown how large classes of distributions act as bridge distributions when an identity or a logarithmic link is adopted; (e) for these links, the three approaches are either identical or closely connected; (f) it is underscored for a random-intercepts model and logarithmic link, that the data contain no information about the particular distribution for the random intercept, given that the same fit to the data can be ascribed to an entire class of random-intercept distribution; (g) the implications of the difference between the unit-interval case on the one hand and the identity and logarithmic cases on the other, regarding sensitivity to model assumptions, are discussed.