Chapter 17

Experimental and Quasi-Experimental Research

Experimental Research attempts to establish cause and effect relationships. That is, an independent variable is manipulated to judge its effects on a dependent variable.

Three criteria are used to establish cause and effect:

      The cause must precede the effect in time.

      The cause and effect must be correlated with each other

      The correlation between cause and effect cannot be explained by another variable.

Cause and Effect are not established by Statistics

Statistical techniques can only reject the Null Hypothesis, and identify the percentage of the variance accounted for by the independent variable or the effect size.

Both of these procedures are necessary but not sufficient to establish C&E.

Cause and Effect can only be established by the application of logical thinking to well-designed experiments.

The Logical Process

This logical process establishes that no other reasonable explanation exists for the changes in the dependent variable except the manipulation of the independent variable.

To facilitate the logical process:

    Select a good theoretical framework

    Use appropriate participants

    Apply appropriate experimental design

    Properly select and control the independent variable.

 

5) Appropriately select and measure the dependent variable

6) Use the correct statistical model and analysis

7) Interpret the results correctly.

Review the following terms

Independent variable – the cause

Dependent variable – the effect, the variable measured.

Categorical variable –A kind of independent variable that cannot be manipulated; age, sex, race, etc.

Control variable – a factor that could possibly influence the results and that is kept out of the study

Extraneous variable - a factor that could possibly influence the relationship between the independent and dependent variables, but is not controlled or included.

Sources of Invalidity

Internal validity is the basic minimum without which any experiment is uninterpretable.

Did in fact the experimental treatments make a difference in this specific experimental instance?

External validity asks the question of generalizability.

To what populations, settings, or treatment variables can this effect be generalized?

Internal Validity

Gaining internal validity involves controlling all variables so that the researcher can eliminate all rival hypotheses as explanations for the outcomes observed.

By controlling and constraining the research setting to gain internal validity, generalizability is in jeopardy.

 

The researcher must decide:  Is it more important to be certain that the manipulation of the independent variable caused the observed changes in the dependent variable, or is it more important to be able to generalize the results to other populations, settings, etc.

No single experiment can meet all design considerations.

Is internal or external validity more important?

Threats to Internal Validity

History – events occurring during the experiment that are not part of the treatment

Maturation – processes within the participants that operate as a result of time passing (e.g., aging, fatigue, hunger)

Testing – the effects of one test on subsequent administrations of the same test

Instrumentation – changes in instrument calibration, including lack of agreement within and between observers.

 

Statistical regression – the fact that groups selected on the basis of extreme scores are not as extreme on subsequent testing

Selection bias – choosing comparison groups in a nonrandom manner

Experimental mortality – loss of participants from comparison groups for nonrandom reasons

Selection-maturation interaction – the passage of time affecting one group but not the other in nonequivalent group designs.

Expectancy – experimenters’ anticipating that certain participants will perform better.

Threats to External Validity

Reactive of interactive effects of testing – The pretest may make the participant more aware of or sensitive to the upcoming treatment.  The treatment is not as effective without the pretest.

Interaction of Selection Bias and Experimental treatment – When a group is selected on some characteristic, the treatment may work only on groups possessing that characteristic

 

Reactive Effects of Experimental Arrangements – Treatments that are effective in very constrained situations may not be effective in less constrained settings.  Hawthorne effect is when people’s performances change when attention is paid to them.

Multiple-treatment Interference – When participants receive more than one treatment, the effects of previous treatments may influence subsequent ones.   Two groups may be better

Controlling Threats to Internal Validity

To make the threats to internal validity less the experimental and control groups must be as much alike as possible.

Randomization – allows the assumption that the groups do not differ at the beginning of the experiment. Controls for history before the experiment, for maturation, for statistical regression, selection biases, and selection-maturation interaction (the latter 3 occur when randomization does not occur).

 

A matched-group technique may also control for equality of groups, by matching subjects on some characteristic, but this may also mean that they are not similar on another characteristic.

In within subjects design the group serves as both the control and the experimental group.

Placebos, Blind Setups, and Double-blind Setups

Placebo is used to evaluate whether the observed effect is produced by the treatment or is a psychological effect. 

Blind setup is when the participant does not know whether they are receiving the experimental treatment or the control.

Double-blind is when neither the experimenter of the subject know whether the subject is receiving the experimental treatment or the control.

 

All these techniques are useful  in controlling the Hawthorne effect, the halo effect, and the Avis effect (where the subjects in the control group try harder just because they are not suppose to get the treatment).

Uncontrolled threats to Internal Validity

Not controlled by randomization:

      Reactive or Interactive Effects can only be controlled by eliminating the pretest, or use another design; either the pretest-posttest randomized-groups, or Solomon Four-group

      Instrumentation cannot be controlled or evaluated by any design

      Experimental mortality cannot be controlled by any design

Controlling Threats to External Validity

     External validity is generally controlled by selecting the participants, treatments, experimental situation, and tests to represent some larger population.

     Randomization is the key to controlling most threats to external validity.

     Randomize the selection of subjects, treatment levels, experimental situations, and dependent variables.

 

     How do the participants perceive the study? 

Types of Designs

Preexperimental Designs

True Experimental Designs

Quasi-experimental designs

Notations:

    Each line is a group of participants

    R signifies random assignment of participants to groups

    O signifies an observation or a test

    T signifies that a treatment is applied; blank space in a line indicates a Control

Preexperimental Designs

     Control very few sources of invalidity

     None has random selection of subjects

One-shot study:  all subjects receive a treatment followed by a test to evaluate the treatment. Cannot attribute the level of performance (O) to the treatment.

                   T  O

One-Group Pretest-Posttest Design:

This is weak but better than the one-shot design:

 

O1  T  O2

 

This design does not tell why the subjects improved. 

Static Group Comparison

Compares two groups, one of which receives the treatment and one of which does not.

 

T     O1

----------

       O2

The groups were not equivalent before the study.

 

True Experimental Designs

Groups are randomly formed, allowing the assumption that the groups are equivalent.

Randomized Group Design:

 

R   T   O1    or    R    T1    O1

R        O2                R    T2   O2

                                       R           O3

 

Randomized Factorial Design

     R     A1     O1

B1  R     A2     O2

     R     A3     O3

---------------------

     R     A1     O4

B2  R     A2     O5

     R     A3     O6

Independent variable A has 3 levels and B has 2 levels. A 3X2 ANOVA, but not a true experimental design since B levels are not randomized.

 

Pretest-Posttest Randomized-Groups Design

     Both groups are randomly formed and both get a pretest and a posttest

 

R   O1   T   O2

 

R   O3         O4

 

Purpose is to determine the amount of change  produced by the treatment

 

There are three ways to do a statistical analysis:

     A 2-factor ANOVA where the treatment vs. notreatment is one factor and pretest vs posttest is the other.

     ANCOVA – Analysis of Covariance uses the pretest scores of each group to adjust the posttest scores.

     Difference scores – the pretest score is subtracted from the posttest and a simple ANOVA is performed.

Difference Scores

      Tend to be unreliable

     The level of initial values applies: participants who start lower in performance can improve more easily than those who begin with high scores. 

     Initial scores are negatively correlated with the difference scores

Solomon Four-Group Design

A True Experimental Design which specifically evaluates the threat to external validity: reactive or interactive effects of testing.

R  O1  T   O2

R  O3       O4

R       T    O5

R             O6

 

This combines the randomized-groups and the pretest-posttest randomized-groups designs.

This determines whether the pretest increased the sensitivity of the participants to the treatment.

Is O2 > O4; is O5  > O6 ;  replication of the treatment effect.

Is O2  - O1 > O4  - O3 ; an assessment of the amount of change due to the treatment

 

Is O4  >  O6 ; an evaluation of the testing effect.

Is O2  >  O5 ;  an assessment of whether the pretest interacts with the treatment

This design is powerful but inefficient, since it requires twice as many subjects

The best alternative is the 2X2 ANOVA

                 No T      T

Pretest          O4        O2

Nopretest      O6        O5

 

Quasi-Experimental Designs

The Purpose is to fit the design to settings more like the real world while still controlling as many of the threats to internal validity as possible.

Time-Series Design attempts to show that the changes from the treatment differ from the times when the treatment was not administered.

O1  O2  O3  O4  T  O5   O6  O7  O8

 

 

Reversal Design is used increasingly in school settings.

O1  O2  T1  O3  O4  T2  O5  O6

 

 

Nonequivalent-Control-Group Design is frequently used in real-world settings where groups cannot be randomly formed

O1   T   O2

------------

O3        O4

This is a pretest-posttest design without randomization

O1 and O3 are compared and if there is no significant difference they are deemed equivalent, even though they may differ on other variables.  If they differ ANCOVA is used to adjust O2 and  O4

 

Ex Post Facto Design is a static group comparison where the treatment is not under the control of the experimenter.

Example:  athletes vs. nonathletes; fit vs. unfit; male vs. female

Purpose is to search for variables that differ between the groups

It is asking the question, “Did these variables influence the way these groups became different?”  This question cannot be answered by this design, but it may increase the insight into the characteristics that could be manipulated in a different experimental design.

Switched-Replication Design

Can be either true or quasi-experimental depending on whether levels are random or intact groups.

If they are randomly assigned to the groups it is a true experimental design.

If the levels are intact groups then the design is quasi-experimental

The number of trials must be one greater than the number of levels