http://changingminds.org/explanations/research/des http://changingminds.org/explanations/r esearch/design/types_validity.ht ign/types_validity.ht m
Different types of reliability Type of reliability
Internal consitency
Interrater
When
Testest-re rete tesst
What
Assess a single dimension
Correlate each individual item score with the total score.
Find consistency in the rating of some outcome
The reliability coefficient for your test indicates a poor, Examine the percentage of moderate, or high degree of agreement between raters. agreement between respondents.
Compare several different forms of a test to see if they are equivalent or reliable
Parallel
How
Reli Reliab abil ilit ity y ov over ti time me
All the items on your test assess the same construct.
Correlate the scores from one form of the test with
Two forms of your test are scores from a second form equivalent to one other. of the same test with the same content. Correlate the scores from time 1 with the scores of time 2.
The test gives the same results even if the participants didn't all take it at the same time.
Types of validity In a research project there are several types of validity that may be sought. In summary: •
• • • •
•
Construct Construct:: Constructs accurately represent reality. o Convergent:: Simultaneous measures of same construct correlate. Convergent o Discriminant: Doesn't measure what it shouldn't. Internal : Causal relationships can be determined. Conclusion Conclusion:: Any relationship can be found. External : Conclusions can be generalized. Criterion: Criterion: Correlation with standards. o Predictive: Predicts future values of criterion. o Concurrent: Correlates Correlates with other tests. Face: Looks like it'll work. Face:
Construct validity Construct validity occurs when the theoretical constructs of cause and effect accurately represent the real-world situations they are intended to model. This is related to how well the experiment is operationalized. A good experiment turns the theory (constructs) into actual things you can measure. Sometimes just finding out more about the construct (which itself must be valid) can be helpful. Construct validity is thus an assessment of the quality of an instrument or experimental design. It says 'Does it measure the construct it is supposed to measure'. If you do not have construct validity, you will likely draw incorrect conclusions from the experiment (garbage in, garbage out). Convergent validity
Convergent validity occurs where measures of constructs that are expected to correlate do so. This is similar to concurrent validity (which looks for correlation with other tests). Discrimina Discriminant nt validity
Discriminant validity occurs where constructs that are expected not to relate do not, such that it is possible to discriminate between these constructs. Convergence and discrimination are often demonstrated by correlation of the measures used within constructs. Convergent validity and Discriminant validity together demonstrate construct validity. Nomologicall network Nomologica
Defined by Cronbach and Meehl, this is the set of relationships between constructs and between consequent measures. The relationships between constructs should be reflected in the relationships between measures or observations. Multitrait-Multimethod ultimethod Matrix (MTMM) Multitrait-M
Defined by Campbell and Fiske, this demonstrates construct validity by using multiple methods (eg. survey, observation, test) to measure the same set of 'traits' and showing correlations in a matrix, where blocks and diagonals have special meaning.
Content validity Content validity occurs when the experiment provides adequate coverage of the subject being studied. This includes measuring the right things as well as having an adequate sample. Samples should be both large enough and be taken for appropriate target groups. The perfect question gives a complete measure of all aspects of what is being investigated. However in practice this is seldom likely, for example a simple addition does not test the whole of mathematical ability.
Content validity is related very closely to good experimental design. A high content validity question covers more of what is sought. A trick with all questions is to ensure that all of the target content is covered (preferably uniformly).
Internal validity Internal validity occurs when it can be concluded that there is a causal relationship between the variables being studied. A danger is that changes might be caused by other factors. It is related to the design of the experiment, such as in the use of random assignment of treatments.
Conclusion validity Conclusion validity occurs when you can conclude that there is a relationship of some kind between the two variables being examined. This may be positive or negative correlation.
External validity External validity occurs when the causal relationship discovered can be generalized generalized to other people, times and contexts. Correct sampling will allow generalization and hence give external validity.
Criterion-related validity This examines the ability of the measure to predict a variable that is designated as a criterion. A criterion may well be an externally-defined 'gold standard'. Achieving this level of validity thus makes results more credible. Criterion-related validity is related to external validity. Predictive validity
This measures the extent to which a future level of a variable can be predicted from a current measurement. This includes correlation with measurements made with different instruments. For example, a political poll intends to measure future voting intent. College entry tests should have a high predictive validity with regard to final exam results. Concurrent validity
This measures the relationship between measures made with existing tests. The existing tests is thus the criterion.
For example a measure of creativity should correlate with existing measures of creativity.
Face validity Face validity occurs where something appears to be valid. This of course depends very much on the judgment of the observer. In any case, it is never sufficient and requires more solid validity to enable acceptable conclusions to be drawn. Measures often start out with face validity as the researcher selects those which seem likely prove the point.
Threats Validity as concluded is not always accepted by others and perhaps rightly so. Typical reasons why it may not be accepted include: • • • •
Inappropriate selection of constructs or measures. Inappropriate measures. Insufficient data collected to make valid conclusions. Measurement done in too few contexts. Measurement done with too few measurement variables.
• • • • • •
Too great a selection variation of in data (can't see the wood for the trees). Inadequate target subjects. Complex interaction across constructs. Subjects giving biased answers or trying to guess what they should say. Experimental method not valid. Operation of experiment not rigorous.