Validation is the process of gathering and evaluating evidence about validity.
It is the test developer's responsibility to supply validity evidence in the test manual.
Local validationstudies are absoulutely necessary when the test user plans to alter in some way the format, instructions, language or content of the test.
The three categories (trinitarian view) in validy are: content validity, criterion-related validity and construct validity.
Content validity is a measure of validity based on an evaluation of the subjects, topics, or content covered by the items in the test.
Criterion-related validity is a measure of validity obtained by evaluating the relationship of scores obtained on the test to scores on other tests or measures.
Construct validity is a measure of validity that is arrived at by executing a comprehensive analysis.
In this classic conception of validity, referred to as the trinitarian view, it might be useful to visualize construct validity as being “umbrella validity” because every other variety of validity falls under it.
Ecological validity refers to a judgment regarding how well a test measures what it purports to measure at the time and place that the variable being measured (typically a behavior, cognition, or emotion) is actually emitted
Face validity relates more to what a test appears to measure to the person being tested than to what the test actually measures.
Face validity is a judgment concerning how relevant the test items appear to be.
Judgments about face validity are frequently thought of from the perspective of the testtaker not the test user.
Content validity describes a judgment of how adequately a test samples behavior representative of the universe of behavior that the test was designed to sample.
From the pooled information, there emerges a test blueprint for the structure of evaluation. Including types of information to be covered by the items, number of items and organization of the items.
Criterion-related validity is a judgment of how adequately a test score can be used to infer an individual’s most probable standing on some measure of interest the measure of interest being the criterion.
Two types of criterion-related validity are concurrent validity and predictive validity.
Concurrent validity is an index of the degree to which a test score is related to some criterion measure obtained at the same time.
Predictive validity is an index of the degree to which a test score predicts some criterion measure.
Criterion is a standard against which a test or a test score is evaluated.
Characteristics of a adequate criterion
relevant
valid
uncontaminated
Criterion contamination is the term applied to a criterion measure that has been based, at least in part, on predictor measures.
Concurrent validity is if test scores are obtained at about the same time as the criterion measures are obtained.
Concurrent validity indicate the extent to which test scores may be used to estimate an individual’s present standing on a criterion.
Measures of the relationship between the test scores and a criterion measure obtained at a future time provide an indication of the predictive validity of the test.
Base rate is the extent to which a particular trait, behavior, characteristic, or attribute exists in the population.
Hit rate may be defined as the proportion of people a test accurately identifies as possessing or exhibiting a particular trait, behavior, characteristic, or attribute
Miss rate may be defined as the proportion of people the test fails to identify as having, or not having, a particular characteristic or attribute.
False positive is a miss wherein the test predicted that the testtaker did possess the particular characteristic or attribute being measured when in fact the testtaker did not.
False negative is a miss wherein the test predicted that the testtaker did not possess the particular characteristic or attribute being measured when the testtaker actually did.
Validity coefficient is a correlation coefficient that provides a measure of the relationship between test scores and scores on the criterion measure.
Incremental validity is the degree to which an additional predictor explains something about the criterion measure that is not explained by predictors already in use.
Construct validity is a judgment about the appropriateness of inferences drawn from test scores regarding individual standings on a variable called a construct.
A construct is an informed, scientific idea developed or hypothesized to
describe or explain behavior.
Constructs are unobservable, presupposed traits that a test developer may invoke to describe test behavior or criterion performance.
Homogeneity refers to how uniform a test is in measuring a single concept.
Test bias is a factor inherent in a test that systematically prevents accurate, impartial measurement.
Rating is a numerical or verbal judgment (or both) that places a person or an attribute along a continuum identified by a scale of numerical or word descriptors known as a rating scale.
Rating error is a judgment resulting from the intentional or unintentional misuse of a rating scale.