Chapter 8:  Differences Among Groups

l   Techniques described in this chapter are not used by themselves to establish cause and effect, but only to evaluate the influence of the independent variable.

l   Cause and effect are not established by statistics but by theory, logic and the total nature of the experimental situation.

 

The purpose of the statistical test is to evaluate the Null Hypothesis;  Ho.

In other words, do the levels of a treatment differ significantly at a probability level?

The tests cannot tell why they are different, though.

Besides showing that groups are different you want to show the size of the difference, and the strength of the association between the independent and dependent variables

 

t and the F ratios are used to determine the difference between groups.

w2 is  used to estimate the degree of association (or % variance accounted for) between the independent and dependent variables. (similar to r2 for the correlations)

Effect size (the standardized difference between groups) is also use to estimate meaningfulness.

Four Assumptions for the use of the t and F distributions

l    Observations are drawn from a normally  distributed  population

l    Observations represent random samples from populations

l    The numerator and denominator are estimates of the same population variance

l    The numerator and denominator of F (or t) ratios are independent.

The t and F tests are only slightly influenced by violations of these assumptions, but do not neglect them.

t test between a Sample and a Population Mean

         M - m

T = ------------------     equation 8.1

                _____

       sm / Ö  n

 sm = the standard deviation for the sample mean

 n = the number of obeservations in the sample

Example 8.1

Population: N = 10,000

Fitness class: n = 32

Sample Mean: M = 81

Population Mean: m = 76

Standard Deviatio sm = 9                              __

Put data in equation  8.1: t = (81 – 76)/ (9/Ö32

t = 5/ 1.59 = 3.14

 

For significance of t = 3.14, check Appendix A.5

For df = n-1= 32 –1 = 31

Independent t test

 

        

 

Differences between two sample means

l   Example: Two levels of intensity of training 40% and 70% of VO2max on performing the 12-minute run.

 

How meaningful is the effect of training intensities?

 

Differences between groups: t = 13.81

Number of subjects in group 1: n1= 15

Number of subjects in group 2: n2 = 15

 w2 = (13.812 – 1)/(13.812 + 15 + 15 – 1)

 w2 = 189.72/219.72 = .86

86% of the total variance is accounted for by the difference in the treatments.

Effect Size: The standardized difference between the means

 

An effect size of 0.8 or greater is large, ~0.5 is moderate,  0.2 or less is low.

Dependent t Test

Two groups of scores are related in some manner.

   One group of subjects is tested twice

   Two groups of subjects are matched on one or more characteristics and are no longer independent

 

 

df = N – 1, N is the number of paired observations

 

One-tailed vs. Two-tailed Tests

l   Appendix A.5

l   If we do not know which mean will exceed the other a two-tailed test is used.

l   But if our hypothesis says that one will be equal to or greater than the other, we could use a one-tailed test.

l   Easier to get significance using one-tailed

Power: The probability of rejecting a Null Hypothesis when it is false

 

l   Increasing difference between the means:

   Using stronger, more concentrated treatments, i.e., 12 week treatment instead of a 6 week treatment

2. Reducing the Variances of the scores:

    If the denominator becomes smaller the power is increased.

    Apply the treatments consistently.  The more consistent the application of the treatments the more similar the subject’s response to the dependent variable.

 

3. Number of participants in each group:

      The larger the number of subjects the greater smaller the denominator and the greater the t and the power is increased.

     The larger the N, the lower the t-ratio needed for significance.

  Summary: power is obtained by using strong treatments, administering those treatments consistently, using as many participants as feasible, varying alpha, or using an appropriate research design and statistical analysis

Analysis of Variance (ANOVA)

Using t-tests is a good method to determine differences between two groups, but when using more than two groups a ANOVA is used.

The groups represent levels of an independent variable.

Cardiovascular training at three levels of intensity: 40%, 60% and 80% of VO2max

 

A t-test to compare groups 1&2, 1&3, and 2&3, could also be used.

This would violate the assumption that at the .05 level we expect 1 chance in 20 that we got a sampling error and a significant difference when there was none.

With each group appearing in 2 t-tests we have increased our chances of committing a type I error, rejecting the Ho when it is true.

ANOVA allows all the groups to be compared simultaneously.

 

 

 

Dividing the Between Groups MS by the Within Groups MS gives the F-ratio.

In Appendix A.6, the F-ratio is determined by finding the numerator df column, and the denominator df row.

MSb df = 2; MSw df = 12. 

(p < .05) we need an F-ratio ³ 3.88

(p < .01) we need an F-ratio ³ 6.93

Follow-up (post hoc) tests

Several follow-up tests protect the type I error

These include: Scheffe’, Tukey, Newman-Kuels, Duncan.  These are in order of the most conservative to the most liberal.  Conservative means it is more difficult to get significance, and liberal means it is easier to get significance.

 

Factorial ANOVA

Manipulate more than one independent variable and statistically evaluate the effects on a dependent variable.

There are more studies in the literature using the Factorial ANOVA than the simple (one-way) ANOVA

Components of a Factorial ANOVA

Main Effects are tests of each independent variable when the other is disregarded. 

In table 8.2 there are two levels of intensity; high and low, and there are two levels of fitness of participants; high fitness and low fitness

We are also interested in the interaction between levels of each factor.