Sometimes, Sample Proportions Are Continuous Rather Than Of The Binomial

Sometimes, sample proportions are continuous rather than of the binomial form (number of successes)/(number of trials). Each observation is any real number between 0 and 1, such as the proportion of a tooth surface that is covered with plaque. For independent responses {y_i}, Aitchison and Shen (1980) and Bartlett (1937) modeled logit (Y_i) ∼ N(β_i, σ²). Then Y_i itself is said to have a logistic-normal distribution.

a. Expressing a N(β, σ²) variate as β + σZ, where Z is standard normal, show that Y_i = exp(β_i + σZ)/[1 + exp(β_i + σZ)].

b. Show that for small σ,

c. Letting µ_i = e^βi/(1 + e^βi), when σ is close to 0 show that E(Y_i) ≈ µ_i, var(Y_i) ≈ [µ_i(1–µ_i)]² σ².

d. For independent continuous proportions {y_i}, let µ_i = E(Y_i). For a GLM, it is sensible to use an inverse cdf link for µ_i, but it is unclear how to choose a for Y_i. The approximate moments for the logistic-normal motivate a quasi-likelihood approach (Wedder-burn 1974) with variance function υ(µ_i) = ϕ[µ_i(1 – µ_i)]² for unknown ϕ. Explain why this provides similar results as fitting a normal regression model the sample logits assuming constant variance. (The QL approach has the advantage of not requiring adjustment of 0 or 1 observations for which sample logits don’t exist.)

e. Wedderburn (1974) gave an example with response the proportion of a leaf showing a type of blotch. Envision an approximation of binomial from based on cutting each leaf into a large number of small regions of the same size and observing for each region whether it is mostly covered with blotch. Explain why this suggests that υ(µ_i) = ϕµ_i(1 – µ_i). What violation of the binomial assumptions might make this questionable? [The parametric family of beta distributions has variance function of this form. Barndorff-Nielsen and Jorgensen (1991) proposed a having υ(µ_i) = ϕ[µ_i(1 – µ_i)]³].

Leave a Comment

Contact Us

Leave a Comment

Contact Us

WE HAVE A GIFT FOR YOU!

15% OFF