Psychological Assessment 2

Created by

riri

Cards (42)

Test 
A tool to measure a particular construct
View source
Test development
1. Test Conceptualization
2. Test Construction
3. Test tryout
4. Item Analysis
5. Test Revision
View source
Test Conceptualization 
Test developers' idea of developing a tool to measure a particular construct
The stimulus for developing a test can be anything (e.g. emergence of a social phenomenon)
View source
Pilot work
Preliminary research surrounding the creation of a prototype of the test<|>Involves creation, revision, and deletion of test items
View source
Test Construction
1. Scaling
2. Writing Items
3. Item Formats
4. Scoring Items
View source
Scaling 
The process of setting rules for assigning numbers in measurement
View source
Types of scales
Age scale
Grade scale
Stanine scale
View source
Likert scale 
Used to scale attitudes, presents test taker with five alternative responses on an agree/disagree or approve/disapprove continuum
View source
Item writing
Considerations: range of content to cover, item formats to employ, number of items to write
View source
For a standardized test, the first draft usually contains approximately twice the number of items that the final version will contain
View source
Item formats 
Selected response (multiple choice, matching, true/false)<|>Constructed response (completion, short answer, essay)
View source
Scoring models
Cumulative model (higher score = higher ability)<|>Class model (placement in a particular class/category)<|>Ipsative scoring (comparison of a test taker's scores on different scales)
View source
Test Tryout 
1. Test is tried out on the sample for which it is constructed
2. Conditions should be as similar as possible to standardized test administration
View source
Characteristics of a good test item
Valid and reliable
Discriminates test takers (high scorers get it right, low scorers get it wrong)
View source
Item Analysis
1. Employs statistical procedures to select the best items from a pool of tryout items
2. Considers item difficulty, item-validity index, item-reliability index, item discrimination index
View source
Item difficulty index
Proportion of total test takers who answered the item correctly
View source
Item-reliability index
Indication of the internal consistency of a test
View source
Item-validity index 
Indication of the degree to which a test is measuring what it purports to measure
View source
Test Revision 
1. Eliminate and rewrite items based on item analysis
2. Balance strengths and weaknesses across items
3. Administer revised test under standardized conditions
View source
The process of developing a test occurs in five stages: Test Conceptualization, Test Construction, Test tryout, Item Analysis, and Test Revision
View source
Test Revision 
Information gathered at item-analysis stage<|>Some items eliminated<|>Others re-written<|>Characterize each item's strengths and weaknesses<|>Balance strengths and weaknesses across items<|>Administer revised test under standardized conditions<|>Consider test in finished form based on item analysis
View source
If many otherwise good items tend to be somewhat easy, the test developer may purposefully include some more difficult items
View source
Having balanced all the concerns, the test developer comes out of revision stage with a test of improved quality
View source
Forms of Reliability
Test-Retest Reliability
Parallel Forms Reliability
Inter-rater Reliability
Split-Half Reliability
View source
Reliability 
Consistency of scores obtained by the same person when re-examined with the same test on different occasions, or with different sets of equivalent items, or under other variable examining condition
View source
Test-Retest Reliability 
Comparing scores obtained from two successive measurements of the same individuals and calculating a correlation between the two sets of scores<|>Measures error associated with administering a test at two different times<|>Only applicable to stable traits
View source
Parallel Forms Reliability 
At least two different versions of the test yield almost the same scores<|>Compares two equivalent forms of a test that measure the same attribute
View source
Inter-rater Reliability 
Degree of agreement between two observers who simultaneously record measurements of the behaviors
View source
Split-Half Reliability 
Obtained by splitting the items on a questionnaire or test in half, computing a separate score for each half, and then calculating the degree of consistency between the two scores for a group of participants
View source
The test can be divided according to the odd and even numbers of the items (odd-even system)
View source
Validity 
Degree to which the measurement procedure measures the variable that it claims to measure (strength and usefulness)
View source
Forms of Validity 
Face Validity
Content Validity
Criterion Validity
Construct Validity
View source
Face Validity 
Simplest and least scientific form of validity, demonstrated when the face value or superficial appearance of a measurement measures what it is supposed to measure
View source
Content Validity 
Concerned with the extent to which the test is representative of a defined body of content consisting of topics and processes<|>Not done by statistical analysis but by the inspection of items by a panel of experts
View source
Criterion Validity 
Involves the relationship or correlation between the test scores and scores on some measurement representing an identical criterion
View source
Predictive Validity 
Demonstrated when scores obtained from a measure accurately predict behavior (criterion) according to a theory
View source
Concurrent Validity 
Established when the scores of a measure (predictor) is correlated with the scores of a different measure (criterion) taken at the same time
View source
Construct Validity 
Requires that the scores obtained from a measurement procedure behave exactly the same as the variable/construct itself<|>Based on many research studies that use the same measurement procedure and grows gradually as each new study contributes more evidence
View source
Convergent Validity 
Involves comparing two different methods to measure the same construct and it is demonstrated by a strong relationship between the scores obtained from the two methods
View source
Divergent Validity 
Refers to the demonstration of the uniqueness of that test<|>Effectively demonstrated when a test has a low correlation with measures of unrelated constructs
View source

See similar decks

Psychological Assessment 2

Cards (42)

Assessment Preparation

Assessment Structure

Assessment Structure

Assessment Preparation

Topic 3: Psychological Problems – How would psychological problems affect you?

5.2.3 Psychological Explanations

2.4 Psychological Problems

Topic 3: Psychological Problems – How would psychological problems affect you?

17.1.2 Psychological Dependence

7. Assessment Preparation

7. Synoptic Assessment

1. Practical Skills Assessment

1.3 Grasping assessment objectives

5.3.3 Psychological Explanations

7. Assessment Preparation

Unit 3: Practical Assessment

7.5.2.3 Reporting Psychological Investigations

6.6.3 Self and Peer Assessment

3.3.1 Assessment Format

Unit 3: Practical Assessment

6.6.1 Understanding Assessment Criteria