|
KIN 610 Quantitative Analysis of Research Key Terms & Definitions Notes from Lectures, Textbook, & Blackboard.Com Weblinks (Last Updated On: 11-3-01) Text: Statistics in Kinesiology by William J. Vincent Instructor: Dr. Ann Maliszewski Website As:
www.ronjones.org/csun610notes.htm
Absolute Value:
The positive value. Drop negative
if present.
Alpha: Area
under normal curve for rejection of null hypothesis (H0). ANCOVA:
(Analysis of Covariance) The adjustment of dependent variable mean values to
account for the influence of one or more covariates that are not controlled by
the research design.
ANOVA:
(Analysis of Variance) An F value that represents the ratio of
between-group and within-group variance. Used
when you want to test for statistical significance but some factor wasn’t
equivalent during pre-test. You
need to adjust to make sure post-test values are truly equivalent.
ANOVA/Factorial Analysis
(or 2-Way): Analysis of variance
performed on more than one factor (e.g. the effects of gender (factor A) and
treatment (factor B) analyzed simultaneously).
Factorial ANOVA permits evaluation of the interaction of the factors on
the DV.
Ø
If no interaction, then the lines
would be parallel. If interaction
exists, then the lines cross i.e. menopause for women during 66-75 age group was
example given in class. Ø
If interaction is significant, then
don’t even look at main effects. If no significance, then you are only looking at “main”
effects. Ø
Perform Post Hoc to determine where
the significant differences occur.
Ø
Are differences similar over time?
i.e. interaction Ø
If no significant interaction
between the two levels, then look at main effects. Ø
Perform Post Hoc to determine where
the significant differences occur. ANOVA/One-Way:
Compares two or more means.
ANOVA/Repeated Measures:
Two or more means from the same people so data IS related now.
ANOVA Table:
(Subpoint of Step Wise Regression Analysis) Does the next variable add strength
to the F ratio (or equation)? Axes:
Manipulating can alter the appearance of the table.
Need to make as “realistic” as possible. Best Fit Line:
Line on scatter plot that best indicates the relationship between plotted
values; line on scatter plot that balances positive and negative residual values
so that they sum to zero. Between-Within:
Factorial ANOVA comparing independent groups (between) measured tow or more
times (within) that is sometimes called a “mixed model.” Bias:
The factors operating on a sample so it is not representative of the population
from which it was drawn. Bimodal: A
distribution of values with more than one mode. Bonferroni Adjustment: Adjustment
of p value (probability of error) to correct for a familywise error rate when
making multiple comparisons on the same set of subjects.
Central Tendency:
Values that describe the middle, or central, characteristics of a set of
data. The three values of
central tendency are the mode, median, and mean plus Confidence Interval
that describes your confidence in the probability that statement is true.
Coefficient of
Variability:
Condition:
Equivalent in meaning to “level or variables” and “treatment.” Correlation: A
numerical coefficient between +1.00 and –1.00 that indicates the extent to
which two variables are related (determines relationship) or associated; the
extent to which the direction and size of deviations from the mean in one
variable are related to the direction and size of deviations from the mean in
another variable.
Correlation Coefficient: Reflects
the slope of the line. Correlation Graph:
How does factor Y and X change relevant to one another?
Correlation “r”
Value: Use absolute value of “r”
to determine strength. The closer
to “r” the stronger the relationship.
Confidence Interval:
(LOC) The amount of confidence that can be placed in a conclusion; a value
expressed as a percentage that establishes the probability that a statement is
correct.
Continuous Variables: A
variable that theoretically can assume any value such as: distance, force, and
time. Correlation:
Examines the relationship between two variables measured on the SAME
person i.e. bivariate. Cumulative Frequency: Data:
Information gathered by measurement. Decile:
One-tenth of the range of values. Degrees of Freedom:
The number of values in a data set that are free to vary when restrictions are
imposed on the set.
Dependent Variable:
Depends on something you are manipulating i.e. depends on the independent
variable. 1.
The effect or consequent of the IV; also called the yield. 2.
Variable whose value is partially determined by the effects of other
variables. It is not free to assume
any value. It is usually the
variable that is measured in the research design.
Descriptive Statistics:
Mean, mode, median, range, standard deviation, variance, sum of squares.
Deviations:
å
(X-mean) Discrete Variables:
A variable that is limited in its assessment to certain values, usually integers
i.e. the data is not continuous; there are gaps between values in the range of
data. Example is gender.
EXCEL Symbols:
EXCEL Formulas &
Directions: *Note: [ ] symbols=cell
boundaries
External Validity:
The ability to generalize the results of an experiment to the population from
which the samples were drawn. Factor:
A component in the design of a study that is combined with other factors to
answer multiple questions about the data; a virtual variable that is the result
of a combination of two or more variables in a factor analysis design.
Factorial ANOVA:
*(See ANOVA/Factorial Analysis) Analysis of variance performed on more than one
factor (e.g. the effects of gender (factor A) and treatment (factor B) analyzed
simultaneously). Factorial ANOVA
permits evaluation of the interaction of the factors on the DV. F Value:
Determined by ANOVA. Figures:
Figure titles go on bottom. Fisher Post Hoc Test:
Not used much. Frequency Distribution:
History of scores. Frequency: Graph:
A diagrammatic representation of quantities designed to show their relative
values; visual representation of data. Grouped Frequency
Distribution: An ordered listing of
the values of a variable organized into groups with a frequency column
indicating the number of cases included in each group.
Groupings:
General rule is 6-15 intervals. <6
loses meaning. Histogram:
A graph plotting blocked scores against frequency; it is commonly known as a bar
graph. Independent Variable:
The part of the experiment that the researcher is manipulating; also
called the experimental or treatment variable. ·
The one the researcher is trying
to understand. ·
A Categorical Variable (also
called a Moderator Variable) is a kind of independent variable except that it
cannot be manipulated, for example, age, race, or sex.
·
Variable that is free to vary and
that is not dependent on the influence of another variable; a variable in the
research design that is permitted to exert influence over other variables (i.e.
the DV) in the study. The IV is
usually controlled by the research design.
Integer:
A whole number; a natural number, the negatives of these numbers, and zero. Interval Level of
Measurement:
Interval Size:
The numerical size of each group in a group frequency distribution i.e. the
number of data points in the group. Kurtosis:
A measure of the vertical deviation form normality (amount of peakedness or
flatness) in the plot of a data set. Labeling Figures &
Tables: Don’t label as “histogram, etc.” if it is obviously a
histogram, etc.
Leptokurtic:
Curve that is more peaked than mesokurtic (normal) curve.
Level or Variables:
Equivalent in meaning to ”condition” and “treatment.” Level of Confidence:
See “Confidence Interval” Levels of Measurement:
Nominal, ordinal, interval, ratio. Line of Best Fit:
Determined by Scattergram points i.e. an “estimate.” MANOVA:
(Multiple Analysis of Variance) Simultaneous analysis of tow or more dependent
variables in a research design using analysis of variance.
Mean:
The arithmetic average score in a distribution.
Measurement:
The process of comparing to a standard. Median:
The 50th percentile or the score that falls midway in the
range of ordered values.
Mesokurtic:
The typical, bell-shaped, normal curve. Mode:
The score in a distribution of values that occurs the most often.
Multiple Regression
Analysis: Many IVs predicting just one
DV where each variable you are entering into the regression is based on the
strength of the outcome variable.
N:
# of cases or “total” population. n:
# of those “sampled” or the
sub sample. Nominal Level of
Measurement: (categorical)
Normal Curve:
A curve, which has known characteristics, formed by the bilaterally symmetrical,
bell-shaped distribution of values around the mean. Normal Distribution:
Peak occurs in middle of bell-shaped curve at 50th percentile. Null Hypothesis:
(H subscript 0) Hypothesis that predicts absence of a relationship among
subjects or no differences between or among subjects.
It is typically the hypothesis that is tested statistically.
One-Tailed Test:
Test of research hypothesis wherein the difference between two mean values is
predicted to be significant. It
uses only one tail of the normal curve. Order of Operations:
Ordinal Level of
Measurement: (categorical)
Ordinal Scale:
A nonparametric listing of data based on order without consideration of the
absolute value of each data point i.e. listing from highest to lowest or first,
second, third, etc.
Outlier:
A value in a data set that lies beyond the limits of the typical scores.
Paired T-Test:
If not sure about direction, use two-tailed test.
If you use one-tailed test, must justify why you suspect this. Parameter:
A characteristic of a population. Parametric:
Data that meet the assumptions of normality. Percentile:
A point or position on a continuous scale of 100 theoretical divisions such that
a certain fraction of the population of scores lies at or below that point.
Platykurtic:
A curve that is more flat than a mesokurtic (normal) curve.
Population:
Group of people, places, or things that have at least one common characteristic.
Post Hoc Tests:
(p. 161) Identifies the groups that differ significantly; goes beyond just
indicating that a difference exist at all.
Tells where the means occur and how significant they are.
Power in Statistics:
“Power” in your study is
whether you have the power to find significance.
Probability of Error:
The probability that a statement is incorrect; a value expressed as a decimal
that establishes the probability that a statement is incorrect. P Value:
Tells the significance of a relationship.
Quartile: One-fourth of the range of
values.
Quintile:
One-fifth of the range of values. R=RangeRandom Sample:
A sample taken from a population where every member of the population has an
equal chance of being selected in the sample.
Range:
The numerical distance from the highest to the lowest score.
Rank Order Distribution:
An ordered listing of data in a single column. Ratio Level of
Measurement:
Ratio Scale:
A parametric scale of measurement based on order, with equal units of
measurement and an established zero point i.e. data based on time, distance,
force, or counting events. Real Limits:
The assumed upper and lower values for a group in a grouped frequency
distribution that include all possible values on a continuous scale. Regression:
A method of predicting values on the Y-variable based on a value on one or more
X-variables and the relationship between the variables; a statistical term
meaning prediction.
Regression Analysis:
Predicts one variable from another.
Reliability:
A measure of the consistency of the data when measurements are taken more than
once under the same conditions. Repeated Measures:
Measuring the same set of subjects more than once as in a pre-post comparison;
same as within-subjects design. Repeated T-Test:
Must use Bonferroni Adjustment to compensate for multiple tests. R Value:
Tells the strength of the relationship.
Sample:
A portion or fraction of a population. Sampling Error:
The amount of error in the estimate of a population parameter based on a sample. Scale
“Identification” of Variables: (nominal,
ordinal, ratio, or interval) Scale “Types” of
Variables: (discrete or continuous) Scattergram:
Graph plotting X against Y.
Scatter Plot: A
scattering of individual points that produces a visual picture of the
relationship between the variables. Scheffe’s Confidence
Interval (I): Post hoc test conducted
after a significant ANOVA to determine the significance of all possible
combinations of cell contrasts.
Significant:
A statistical term meaning that a relationship or a mean difference is not due
solely to chance. Simple Frequency
Distribution: An ordered listing of
the values of the variable with a frequency column that indicates the number of
cases for each value. Skewed:
A plot of values that is not normal i.e. a disproportionate number of subjects
fall toward one end of the scale—the curve is not bilaterally symmetrical. Skewness:
A measure of lateral deviation from normality (bilateral symmetry) in the plot
of a data set that reflects asymmetry.
Sorting: Standard Deviation:
A measure of the spread, or dispersion, of the values in a parametric data set
standardized to the scale of the unit of measurement of the data; the square
root of the average of the squared deviations around the mean.
Standard Error of the
Mean: The numeric value that indicates
the amount of error in the prediction of a Y value in bivariate or multivariate
regression.
Standard Score:
(p. 71) A score that is derived
from raw data and that has a standard basis for comparison i.e. it has a known
central tendency and a known variability.
Stanine:
A standard score based on the division of the normal curve into nine sections,
each of which is one-half of a standard deviation wide, with a mean of 5 and a
range of 1 to 9. Stratified Sample:
A series of samples taken from various subgroups of a population so that the
subgroups are represented in the total sample in the same proportion that they
are found in the population. Statement Sentences:
(Examples)
Statistic:
A characteristic of a sample. Statistics: A mathematical
technique by which data are organized, treated, and presented for interpretation
and evaluation. Step Wise Regression
Analysis (subform of Multiple Regression Analysis):
Adds one variable at a time but only the ones that add strongly to the equation.
Sum of Squares:
The sum of the squares of the deviation from a mean of a set of scores.
Reflects deviation around the mean.
Symmetrical Distribution:
If median and mean are close, probably a symmetrical distribution. Tables:
Tails:
If you have a good idea of what way the hypothesis will go, then you’d use a
one-tailed test. If you aren’t
sure which way the hypothesis will go, then you’d need to use a two-tailed
test. Tally: Terminal Statistic: A
statistic that does not provide information that can be used in further analysis
of the data. Titles:
Table titles go on top. Figure
titles go on bottom. Treatment:
Equivalent in meaning to “condition” and “level or variables.” T Score:
A standard score with a mean of 50 and a standard deviation of 10.
T-Test:
Compares two means i.e. evaluates significant difference between group means.
Tukey’s Honestly
Significant Difference (HSD): Post hoc
test conducted after a significant ANOVA to determine the significance of
pairwise cell contrasts.
Two-Tailed Test:
Test of null hypothesis wherein a difference between tow mean values is
predicted to be zero. It uses both
tails of the normal curve. Type I Error:
Rejection of null hypothesis when it is really true.
Type II Error:
Acceptance of the null hypothesis when it is really false.
Unpaired T-Test:
Compares two different populations that don’t have correlated data i.e. two
different schools. Validity:
The soundness or correctness of a test or instrument in measuring what it is
designed to measure i.e. the truthfulness of the test or instrument. Variability:
Scatter of scores.
Variable:
A characteristic of a person, place, or thing that can assume more than one
value.
Variance:
A measure of the spread, or dispersion, of the values in a parametric
data set; the average of the squared deviations around the mean.
o
Variability>¯Confidence (clear separation in scatter plot between points) o
¯Variability>Confidence (no clear area of separation so can’t be as confident if
you wanted to just pull out one sample as an example of the whole population) X Axis: Y Axis: Frequency Y Intercept:
Point where the extension of the best fit line intercepts the Y-axis.
Z Score:
(Z % areas on p. 70) A standard score with a mean of 0 and a SD of 1.
o
State Above: 84% of group scored
above this person who was at 16% percentile based on the normal distribution. o
Whatever is below the % of standing
area under the curve tells the person’s standing in the population. Ron Jones/www.ronjones.org (11-3-01)
|
Get
Fit.
Be Strong. |