Reliability and validity

    Cards (35)

    • What is reliability?
      The consistency of a measure
    • What is external reliability?
      Test-retest method: measure = consistent across time
      Inter-rater reliability: measure = consistent across different individuals using it
    • What happens to the conclusions of a study if the study lacked reliability?
      They can't be trusted -> lack of consistency = lack of objectivity = lack of validity
    • How can external reliability be assessed?
      Test-retest method
      Inter-rater reliability
    • What is the test-retest method?
      Study conducted using measure -> researchers wait appropriate amount of time
      Study repeated with same ppts + same measure -> researchers now have 2 sets of results
      Sets of results run through statistical test measuring correlation -> must have strong positive correlation of 0.8+
    • What is inter-rater reliability?
      Researchers get more than 1 person to independently observe + classify same behaviours of same ppts at same time
      The 2 sets of results are run through a statistical test measuring correlation
      Degree of inter-rater reliability found by comparing calculated value with correlation of critical value to determine likelihood of correlation being by chance
      Must have a strong positive correlation of 0.8+
    • How can external reliability be improved?
      Operationalisation of variables
      Training to ensure inter-rater reliability
      Careful transposition of data
      Use of standardised instructions
    • What is operationalisation of variables?
      Making the method of measuring co-variables/DV objective
      Includes clear behavioural categories when conducting structured observations
    • Why does operationalisation of variables improve the reliability of variables?
      More objective measure = more likely to be interpreted in the same way by researchers + ppts
      Same items/behaviours counted no matter who uses the measure
      = consistent data
    • What is training to ensure inter-rater reliability?
      Providing education to people who will be counting behavioural categories in structured observations
      Observers taught exactly what is meant by each category + given examples so they can recognise them
      Every observer should have same training to ensure they all classify behaviours in same way
    • How does training to ensure inter-rater reliability improve reliability?
      Each observer making judgements on same basis = results more likely to be objective
      More objective judgements = similar classifications when faced with same stimulus
    • What is careful transposition of data?
      Being meticulous when copying over raw results from data collection sheets for analysis = no numbers changed during process
    • How does careful transposition of data improve reliability?
      Ensures that data being compared is actually the data that was collected
      If they aren't, then the data's consistency can't be accurately assessed
    • What are standardised instructions?
      Concise + unchanging set of instructions for ppts to follow during study
      Standardised = no variation between ppts -> all ppts given exact same instructions
    • How do standardised instructions improve reliability?
      Varying instructions between ppts = they could respond differently to measure used in study = behaviour may vary
      Results would not remain consistent across ppts/same ppt at different times
    • What is internal reliability?
      Measure is consistent within itself (all items/questions/etc measure same concept in same way)
    • What happens to the conclusions of a study if the study lacked internal reliability?
      They can't be trusted -> lack of consistency = lack of objectivity = lack of validity
    • What is validity?
      Accurately measuring what it is intended to measure
    • What are the the different types of internal validity?
      Face validity
      Concurrent validity
    • What is face validity?
      Measure subjectively appears to accurately measure what it means to -> no obvious flaws
    • What is concurrent validity?
      Correspondence to a previously established/accepted measure of same thing
    • What happens to the conclusions of a study if the study lacks internal validity?
      They can't be trusted -> inaccurate measure = can't tell us anything about variable(s) being studied
    • How can internal validity be assessed?
      Face validity
      Concurrent validity
    • How is face validity assessed?
      Researcher examines study closely -> no obvious problems = researcher can accept study's validity
      Very weak test of validity!
    • How is concurrent validity assessed?
      Researcher gets ppts to undertake the researchers' measure of behaviour then get previously established measure of same behaviour + same ppts to undertake second measure
      2 sets of results then put through statistical test to test correlation -> degree of concurrent validity determined with critical value (strong positive 0.8+) = validity of measure is high
    • How can internal validity be improved?

      Generally -> tightly controlling study
      Single blind technique -> ensure ppts unaware of study's aim
      Double blind technique -> ensure both ppts AND researchers unaware of study's aim
    • What are the different types of external validity?
      Ecological validity
      Temporal validity
      Population validity
    • What is ecological validity?
      The extent to which findings from a measure/study can be generalised to real-life settings
    • What is temporal validity?

      The extent to which findings from a measure/study can be generalised to periods/eras
    • What is population validity?
      The extent to which findings from a measure/study can be generalised to different individuals beyond the original sample
    • What happens to the conclusions of a study if the study lacks external validity?
      Can't generalise findings outside of specific circumstances in which they were found
      Results + conclusions only apply to that specific lab/time in history/group = conclusions drawn are very limited in explaining behaviour
    • How can ecological validity be assessed?
      Researchers should replicate study + put both sets of results through statistical test measuring correlation -> if critical value = strong positive 0.8+, validity of measure is high
    • How can temporal validity be assessed?
      Researchers should replicate study + put both sets of results through statistical test measuring correlation -> if critical value = strong positive 0.8+, validity of measure is high
    • How can population validity be assessed?
      Researchers should replicate study + put both sets of results through statistical test measuring correlation -> if critical value = strong positive 0.8+, validity of measure is high
    • How can external validity be improved?
      Ecological validity -> researchers should measure behaviour as analogous to real-life experience + lab studies supported with field studies
      Temporal validity -> researchers should measure behaviour as not being context-bound
      Population validity -> researcher should make sample representative of target population using stratified sampling (sub-groups should be represented proportionately)