The Analysis of Covariance

4/25/2002

This lab is intended to raise some important questions with respect to the Analysis of Covariance and standard t tests. It is based on an actual study: Rodriguez, N., Ryan, S. W., Vande Kemp, H., and Foy, D. W. (1997) Posttraumatic stress disorder in adult female survivors of childhood sexual abuse: A comparison study. Journal of Consulting and Clinical Psychology, 65, 53-59. The first part of the study is a fairly close replication of their results, but the "treatment" part is pure fabrication.

The data can be found at sexabuse.sav.

Rodriguez et al. collected data on childhood sexual abuse from 45 adult women who reported abuse, and 31 women who did not report abuse. We will treat these as our two groups. The dependent variable was a measure of lifetime PTSD, based on the Structured Clinical Interview for DSM-III-R (SCID). This measure is labeled as PTSD. They also collected data on childhood physical abuse, using a measure known as AEIII, and this variable in denoted PhysAbus.

Be careful about one problem. When I speak of Sexual Abuse I am talking about a dichotomous variable, where 1 = Yes, 2 = No. When I speak about Physical Abuse, I am speaking about a continuous variable where higher scores mean more abuse. That is the nature of the data that they collected.

The first part of this lab asks you to investigate these variables. Cast your answers in some form of analysis of variance or regression analysis..

Now comes the fictitious part. Assume that we intervened with these women and provided some form of therapy. Therapy might not actually be appropriate for the non-abused women, but we intervened anyway. I have given you another dependent variable called PostPTSD that represents a hypothetical set of scores that might reasonably be obtained after a very effective form of therapy.

It is important to understand how I generated the PostPTSD data. I assumed that there would be no mean change in the non-abused group, so I just added a random number to every value. These numbers were N(0,3), so some people went up and some went down. For the abused group, I assumed that the effect of therapy was to reduce their pre-therapy score to 60% of what it was initially (plus or minus the same kind of error applied to the non-abused group.) How would you have created PostPTSD for this group?

I have deliberately led you into a confusing situation--but one that is perfectly conceivable. If these were my data I could see several ways to deal with them, and they may not all be equally good. Run each of these analyses and draw conclusions.

Which analysis was most appropriate, and why?

Last revised: 04/20/02