|
2 X 5 factorial
design |
A factorial design
with one variable having two levels and the other having five levels. |
|
|
|
|
Alpha (a) |
The probability of a
Type I error. |
|
|
|
|
Abscissa |
Horizontal axis. |
|
|
|
|
Additive law of
probability |
The rule giving the
probability of the occurrence of one or more mutually exclusive events. |
|
|
|
|
Adjacent values |
Actual data points
that are no more extreme than the inner fences. |
|
|
|
|
Adjusted
correlation (radj) |
A correction to the
computed correlation coefficient to adjust for the number of predictors
relative to the sample size. |
|
|
|
|
Adjusted means |
Means that have been
adjusted for differences on a covariate. |
|
|
|
|
All subsets
regression |
The result of a
stepwise multiple regression when the program chooses that set of variables
that has the best correlation with the critierion. |
|
|
|
|
Alternative
hypothesis (H1) |
The hypothesis that is
adopted when H0 is rejected. Usually the same as the research
hypothesis. |
|
|
|
|
Analysis of
variance (ANOVA) |
A statistical
technique for testing for differences in the means of several groups. |
|
|
|
|
Analysis of
covariance |
An analysis of
variance in which the data are adjusted (or controlled) for the presence of
one or more other variables. |
|
|
|
|
Analytic view |
Definition of probability
in terms of analysis of possible outcomes. |
|
|
|
|
Array |
The set of Y values associated
with a given X,
or the set of X
values associated with a given Y. |
|
|
|
|
Asymmetric
relationships |
Log-linear models
where at least one variable is treated as an independent variable and at
least one variable is treated as a dependent variable. |
|
|
|
|
Backward
elimination |
A stepwise regression
procedure in which we start with all predictors and them eliminate those that
do not contribute significantly or up to some predetermined standard. |
|
|
|
|
Behrens-Fisher
problem |
An old name given to
the problem of how to compare two independent means when we can not assume
homogeneity of variance. |
|
|
|
|
Bernoulli trial |
A outcome with one of
two mutually exclusive outcomes--such as pass/fail. |
|
|
|
|
b (Beta) |
The probability of a
Type II error. |
|
|
|
|
Betweensubjects designs |
Designs in which
different subjects serve under the different treatment levels. |
|
|
|
|
Bimodal |
A distribution having
two distinct peaks. |
|
|
|
|
Binomial
distribution |
The distribution in
which each of a number of independent trials results in one of two mutually
exclusive outcomes. |
|
|
|
|
Biserial
correlation |
The correlation
between a continuous variable and a dichotomous variable, where we assume an
underlying normality to the dichotomous variable. Rarely used. |
|
|
|
|
Bivariate normal
model |
A regression model in
which both X
and Y
are subject to random error. |
|
|
|
|
Bonferroni
inequality |
An inequality on which
the Bonferrone test is based. It states that the probability of the
occurrence of one or more events can never exceed the sum of their individual
probabilities. |
|
|
|
|
Bonferroni test |
A multiple comparison
procedure in which the familywise error rate is divided by the number of
comparisons. |
|
|
|
|
Box-and-whisker
plot |
A graphical
representation of the dispersion of a sample. |
|
|
|
|
Boxplot |
A graphical
representation of the dispersion of a sample. |
|
|
|
|
Carryover effect |
The effect of previous
trials (conditions) on a subject's performance on subsequent trials. |
|
|
|
|
Categorical data |
Data representing
counts or number of observations in each category. |
|
|
|
|
Cell |
The combination of a
particular row and column‹the set of observations obtained under identical
treatment conditions. |
|
|
|
|
Censored data |
Data which have been
categorized into two or more groups on the basis of a cutoff score on some
criterion variable. Often a consideration in logistic regression. |
|
|
|
|
Centering |
The process of
converting data to deviation scores. |
|
|
|
|
Central limit
theorem |
The theorem that
specifies the nature of the sampling distribution of the mean. |
|
|
|
|
Chi-square
distribution |
The distribution of
the chi-square (c2) statistic. |
|
|
|
|
Chi-square test |
A statistical test
often used for analyzing categorical data. |
|
|
|
|
Coefficient of
variation (CV) |
The standard deviation
divided by the mean. |
|
|
|
|
Collinearity |
The condition in which
the independent variables are (usually highly) correlated with each other. |
|
|
|
|
Column totals |
The total number of
observations occurring in a column of a contingency table. |
|
|
|
|
Combinations |
The number of ways
objects can be selected without regard to order. |
|
|
|
|
Combinatorics |
The branch of
mathematics dealing with the number of different ways objects can be selected
or arranged. |
|
|
|
|
Compound symmetry |
The condition with
constant variances on the main diagonal of a matrix, and constant covariances
off the main diagonal. |
|
|
|
|
Concordant pairs |
A pair of observations
that are ordered in the same direction on two variables. |
|
|
|
|
Conditional
distribution |
The distribution of Y for a fixed level of X. |
|
|
|
|
Conditional means |
The means for one
variable at individual levels of a second variable. |
|
|
|
|
Conditional odds |
The odds of success given some level of another
variable. |
|
|
|
|
Conditional
probability |
The probability of one
event given
the occurrence of some other event. |
|
|
|
|
Confidence interval |
An interval, with
limits at either end, with a specified probability of including the parameter
being estimated. |
|
|
|
|
Confidence limits |
An interval, with
limits at either end, with a specified probability of including the parameter
being estimated. |
|
|
|
|
Confounded |
Two variables are said
to be confounded when they are varied simultaneously and their effects cannot
be separated. |
|
|
|
|
Constant |
A number that does not
change in value in a given situation. |
|
|
|
|
Contingency table |
A twodimensional table in which each observation is classified
on the basis of two variables simultaneously. |
|
|
|
|
Contingency
coefficient |
A coefficient, based
on chi-square, reflecting the degree of relationship exhibited in a
contingency table. |
|
|
|
|
Continuous
variables |
Variables that take on
any
value. |
|
|
|
|
Contrast |
A comparison between
two levels (or two sets of levels) of the independent variable following an
analysis of variance. |
|
|
|
|
Cook's D |
A measure of the
influence of an observation in multiple regression. |
|
|
|
|
Correlation (r) |
Relationship between
variables. |
|
|
|
|
Correlation
coefficient |
A measure of the
relationship between variables. |
|
|
|
|
Correlational
measures |
A measure of the
degree of relationship between two variables that are each at least ordinal. |
|
|
|
|
Count data |
Data representing
counts or number of observations in each category. |
|
|
|
|
Counterbalancing |
An arrangement of
treatment conditions designed to balance out practice effects. |
|
|
|
|
Covariance (sxy
or covxy) |
A statistic
representing the degree to which two variables vary together. |
|
|
|
|
Covariance matrix (S) |
A matrix of variances
and covariances among variables. |
|
|
|
|
Covariate |
A variable whose
influence is controlled in the analysis of covariance. |
|
|
|
|
Cramér's phi (Fc) |
The extension of the
phi coefficient to the case of larger contingency tables. |
|
|
|
|
Criterion variable |
The variable to be
predicted. |
|
|
|
|
Critical value |
The value of a test
statistic at or beyond which we will reject H0 . |
|
|
|
|
Cross-correlation |
The correlation
between one predictor and all other predictors. |
|
|
|
|
Cross-validation |
The result of taking a
regression equation from one set of data, applying it to a new set of data,
and examining the correlation between the predicted and obtained values on
the new set of data. |
|
|
|
|
Curvilinear
relationship |
A situation that is
best represented by something other than a straight line. |
|
|
|
|
Deciles |
Points that divide the
distribution into tenths. |
|
|
|
|
Decision tree |
Graphical
representation of decisions involved in the choice of statistical procedures. |
|
|
|
|
Decision making |
A procedure for making
logical decisions on the basis of sample data. |
|
|
|
|
Degrees of freedom
(df) |
The number of
independent pieces of information remaining after estimating one or more
parameters. |
|
|
|
|
Delta (d) |
A value used in
referring to power tables that combines gamma and the sample size. |
|
|
|
|
Density |
Height of the curve
for a given value of X- closely related to the probability of an observation
in an interval around X. |
|
|
|
|
Dependent variables |
The variable being
measured. The data or score. |
|
|
|
|
Depth |
Cumulative frequency
counting in from the nearer end. |
|
|
|
|
Design matrix |
A matrix of coded or
dummy variables representing group membership. |
|
|
|
|
dferror |
Degrees of freedom
associated with SSerror = k(n - 1). |
|
|
|
|
dfgroup |
Degrees of freedom
associated with SSgroup = k - 1. |
|
|
|
|
dftotal |
Degrees of freedom
associated with SStotal = N - 1. |
|
|
|
|
Deviation scores |
Data in which the mean
has been subtracted from each observation. |
|
|
|
|
Descriptive
statistics |
Statistics which
describe the sample data without drawing inferences about the larger
population. |
|
|
|
|
Dichotomous
variables |
Variables that can
take on only two different values. |
|
|
|
|
Difference scores |
The set of scores
representing the difference between the subjects' performance on two
occasions. Also known as "gain scores." |
|
|
|
|
Directional test |
A test that rejects
extreme outcomes in only one specified tail of the distribution. |
|
|
|
|
Discordant pairs |
A pair of observations
that are ordered in opposite directions on two variables. |
|
|
|
|
Discriminant
analysis |
A procedure for
developing a procedure for optimally discriminating between two groups. This
technique often being replaced with logistic regression. |
|
|
|
|
Discrete variables |
Variables that take on
a small set of possible values. |
|
|
|
|
Disordinal
interaction |
An interaction in
which group differences reverse their sign at some level of the other
variable. |
|
|
|
|
Dispersion |
The degree to which
individual data points are distributed around the mean. |
|
|
|
|
Distance |
The vertical distance
between a point and the regression line. Usually known as the
"residual." |
|
|
|
|
Distributionfree tests |
Statistical tests that
do not rely on parameter estimation or precise distributional assumptions. |
|
|
|
|
Dotplot |
A distribution that
represents the frequencies of individual points by stacking dots about the
axis--similar to a histogram. |