poor face validity (doesn't look like they're measuring what they're supposed to be measuring) - questions should be removed and/or rewritten so they relate more clearly and obviously to topic being measured
low concurrent validity (scores don't correlate with those on the more established test of same topic) - questions should be removed, revised and/or rewritten so they can be tested again for concurrent validity again
internal/external - changing methods, techniques and designs