The word confounding has been used to refer to at least three distinct concepts. In the oldest and most widespread usage, confounding is a source of bias in estimating causal effects. This bias is sometimes informally described as mixing of effects of extraneous factors (called confounders) with the effect of interest. This usage predominates in nonexperimental research, especially in epidemiology and sociology. In a second and more recent usage originating in statistics, confounding is a synonym for change in an effect measure upon stratification or adjustment for extraneous factors (a phenomenon called noncollapsibility or Simpson's paradox). In a third usage, originating in the experimental-design literature, confounding refers to inseparability to main effects and interactions under a particular design. The three concepts are closely related and are not always distinguished from one another. In particular, the concepts of confounding as a bias in effect estimation and as noncollapsibility are often treated as equivalent, even though they are not. Only the former concept will be described here.
A classic discussion of confounding in which explicit reference is made is to "confounded effects"
In Mill's time, the world experiment referred to an observation in which some circumstances were under the control of the observer, as it still is used in ordinary English, rather than to the notion of a comparative trial. Nonetheless, Mill's requirement suggests that a comparison is to be made between the outcome of one's "experiment" (which is essentially, an uncontrolled trial) and what one would expect the outcome to be if the agents one wished to study had been absent. If the outcome is not as one would expect in the absence of the study agents, then Mill's requirement ensures that the unexpected outcome was not brought about by extraneous "circumstances" (factors). If, however, those circumstances do bring about the unexpected outcome, and that outcome is mistakenly attributed to effects of the study agents, then the mistake is one of confounding (or confusion) of the extraneous effects with the agent effects.
Much of the modern literature follows the same informal conceptualization give by Mill. Terminology is now more specific, with "treatment" used to refer to an agent administered by the investigator and "exposure" often used to denote an unmanipulated agent. The chief development beyond Mill is that the expectation for the outcome in the absence of the study exposure is now almost always explicitly derived from observation of a control group that is untreated or unexposed. For example, D. Clayton and M. Hills (1993) state of observational studies:
there is always the possibility that an important influence on the outcome … differs systematically between the comparison [exposed and unexposed] groups. It is then possible [that] part of the apparent effect of exposure is due to these differences, [in which case] the comparison of the exposure groups is said to be confounded [emphasis in the original].
In fact, confounding is also possible in randomized experiments owing to systematic improprieties in treatment allocation, administration, and compliance. A further and somewhat controversial point that confounding (as per Mill's original definition) can also occur perfect randomized trials due to random differences between comparison groups.
Various mathematical formalizations of confounding have been proposed for use in statistical analyses. Perhaps the one closest to Mill's concept is based on the counterfactual model for casual effects. Suppose one wishes to consider how a health-status (outcome) measure of a population would change in response to an intervention (population treatment). More precisely, suppose one's objective is to determine the effect that applying a treatment x1 had or would have an outcome measure µ relative to applying treatment x0 to a specific target population A. For example, A could be a cohort of breast-cancer patients, treatment x1 could be a new hormone therapy, x0 could be a placebo therapy, and the measure µ could be a five-year survival probability. The treatment x1 is sometimes called the index treatment; and x0 is sometimes called the control or reference treatment (which if often a standard or placebo treatment).
The counterfactual model posits that, in population A, µ will equal µA1 if x1 is applied, µA0 is applied; the casual effect of x1 relative to x0 is defined as the change from µA0 to µA1, which might be measured as µA1 − µA0 or µA1/µA0. If A is given treatment x1 then µ will equal µA1 and µA1 will be observable, but µA0 will be unobserved. Suppose, however, we expect µA0 to equal µB0, where µB0 is the value of the outcome µ observed or estimated for a population B that was administered treatment x0. The latter population is sometimes called the control or reference population. Confounding is said to be present if in fact µA0 [.notequal] µB0, for then there must be some difference between populations A and B (other than treatment) that is affecting µ.
If confounding is present, a naïve (crude) association measure obtained by substituting µB0 for µA0 is an effect measure will not equal the effect measure, and the association measure is said to be
The preceding counterfactual approach to confounding gradually emerged through attempts to separate effect measures into a component due to the effect of interest and a component due to the effect of interest and a component due to extraneous effects. One noteworthy aspect of this approach is that confounding depends on the outcome measure. For example, suppose populations A and B have a different five-year survival probability µ under placebo treatment x0; that is, suppose µB0 [.notequal] µA0, so that µA1 − µB0 is confounded for the actual effect µA1 − µB0 of treatment on five-year survival. It is then still possible that ten-year survival, µ, under the placebo would be identical in both populations; that is, µA0 could equal µB0, so that µA1 − µB0 is not confounded for the actual effect of treatment on ten-year survival. (We should generally expect no confounding for 200-year survival, since no treatment is likely to raise the 200-year survival probability of human patients above zero.)
A second noteworthy point is that confounding depends on the target population of inference. The preceding example, with A as the target, had different five-year survivals µA0 and µA0 for A and B under placebo therapy, and hence µA1 − µB0 was confounded for the effect µA1 − µA0 of treatment on population A. A lawyer or ethicist may also be interested in what effect the hormone treatment would have had on population B. Writing µB1 for the (unobserved) outcome of B under treatment, this effect on B may measured by µB1 − µB0. Substituting µA1 for the unobserved µB1 yields µA1− µB0. This measure of association is confounded for µB1 − µB0 (the effect of treatment x1 on five-year survival in population B) if and only if µA1 [.notequal] µB1. Thus, the same measure of association, µA1, may be confounded for the effect of treatment on neither, one, or both of populations A and B, and may or may not be confounded for the effect of treatment on other targets.
|
|
Author Info: SANDER GREENLAND, The Gale Group Inc., Macmillan Reference USA, New York, Gale Encyclopedia of Public Health, 2002 |