AP Notes, Outlines, Study Guides, Vocabulary, Practice Exams and more!

Psychological Tests and Measurement: Ch. 1, 2, 4, 5, 6 Flashcards

Chapter 1: The World of Psychological Testing
Chapter 2: Sources of Information About Tests
Chapter 4: Reliability
Chapter 5: Validity
Chapter 6: Test Development and Item Analysis

Terms : Hide Images
951779504Achievement TestsTests designed to measure knowledge or skills especially as developed through school or job experience.0
951779505AttenuationLessening or reduction; in testing it refers to reduction in correlation between two variables due to imperfect reliability and/or group homogeneity.1
951779506Base RateThe rate at which some characteristic appears in a population.2
951779507Classical Test TheoryThe traditional theory about the construction and reliability of tests, incorporating true score theory.3
951779508Coefficient AlphaA measure of the internal consistency of items on a test4
951779509Concurrent ValidityTest validity demonstrated by relationship between a test and some other criterion measured at approximately the same time.5
951779510Constructed-Response ItemsA test item requiring the examinee to construct an answer rather than select an answer from given alternatives.6
951779511Construct ValidityA broad array of methods used to support the proposition that a test is measuring its target construct.7
951779512Content ValidityTest validity defined by the match between test content and some well-defined body of material such as a curriculum or set of job skills.8
951779513Convergent ValidityValidity evidence showing that performance on a test agrees with other measures of the target construct.9
951779514Contrasted Groups ValidityThe criterion is group membership; generally the better the differentiation between groups, the more valid the test.10
951779515Correction for AttenuationCorrecting the validity coefficient for unreliability in either the test or the criterion or both.11
951779516Correlation CoefficientThe numerical expression ranging from -1.00 to +1.00, of the relationship between two variables.12
951779517Construct Irrelevant VarianceVariance in test scores associated with variables other than those we want to measure.13
951779518Criterion-Related ValidityDemonstrating test validity by showing the relationship between test scores and some external criterion.14
951779519Cutoff ScoreA score on a test or criterion indicating passing vs. failing or some other such division.15
951779520Dichotomous Response FormatItems scored as "correct-incorrect" or "yes-no" where possible scores for each item are 1 or 0.16
951779521Differential ValidityNot predicting the same performance on the criterion, but predicting equally well for two (or more) groups.17
951779522Discriminant ValidityValidity evidence showing that performance on a test has a relatively low correlation with measures of constructs expected to have a low correlation with the trait of interest.18
951779523Domain SamplingThe process of choosing test items that are appropriate to the content domain of the test (also see content validity and domain-referenced tests)19
951779524Employment TestsContent domain consists of the knowledge and skills required by a particular job.20
951779525Error Variance (Score)The difference between the true score and observed score which may be positive or negative. (picture on pg 129)21
951779526Face ValidityThe appearance that a test measures its intended target, especially unaccompanied by any empirical evidence.22
951779527Factor AnalysisA class of statistical methods for identifying dimensions underlying many scores or other indicators of performance.23
951779528False PositiveA case that passes the cut-score on a test intended to predict a criterion but does not pass the cut-score on the criterion.24
951779529False NegativeA case that passes a cut-score on the criterion but does not pass the cut-score on a test intended to predict the criterion.25
951779530Generalizability TheoryA method for studying reliability that allows for examining several sources of unreliable variance simultaneously.26
951779531Hit RateA case that either passes the cut-scores on both criterion and test or fails on both.27
951779532HomoscedasticityEqual degree of scatter at various points along a best-fitting line.28
951779533Incremental ValidityThe increase in validity achieved by adding a new test or procedure to existing tests or procedures.29
951779534Internal Consistency ReliabilityItems that, for the most part, are measuring the same trait or characteristic as indicated by the intercorrelations among the items; has the same score you're measuring; the items are consistent with what you're measuring30
951779535Interval ScoreA scale that orders data points in equal intervals but lacking a true zero point.31
951779536Item AnalysisStatistical analysis of individual test items, especially to determine their difficulty level and discriminating power.32
951779537Kuder-Richardson Reliability Coefficients (KR-20)Yields the average correlation among all possible split-halves for the test; a formula to optimize reliability of consistency coefficients33
951779538Likert Response FormatA format for attitude items in which an examinee expresses degree of agreement or disagreement with a statement.34
951779539Measurement ErrorDivided into two components (systematic and random). Random: caused by any factors that randomly affect measurement of the variable across the sample. Systematic: caused by any factors that systematically affect measurement of the variable across the sample.35
951779540Multitrait-Multimethod MatrixA technique for examining the relationships among several variables each measured in several different ways.36
951779541Nominal ScaleA primitive type of scale that simply places objects in separate categories, with no implication of quantitative differences.37
951779542Nomological NetworkA representation of the constructs of interest in a study, their observable manifestations, and the interrelationships among and between these (Cronbach and Meehl's view of construct validity that in order to provide evidence that a measure has construct validity, it has to be developed for its measure).38
951779543Normal Score Distribution (Continuous Probability Distribution)A function that tells the probability of a number in some context falling between any two real numbers.39
951779544Observational TechniquesA social research technique that involves the direct observation of phenomena in their natural setting.40
951779545Observed ScoreA person's actual score on a test.41
951779546Ordinal ScaleA scale that places objects in order, without implying equal distances between points along the scale.42
951779547Phi Coefficient (Mean Square Contingency Coefficient)A measure of association for two binary variables introduced by Karl Pearson.43
951779548Predictive ValidityValidity demonstrated by showing the extent to which a test can predict performance on some external criterion when the test is administered well in advance.44
951779549PsychometricsTheories of intelligence that depend heavily on the use of tests and examination of relationships among the tests.45
951779550r2 (Coefficient of Determination)Indicates how well data points fit a line or curve. A statistic used in the context of statistical models whose main purpose is either the prediction of future outcomes or the testing of hypotheses, on the basis of other related information.46
951779551Ratio ScaleA type of scale that classifies, then orders objects along a scale, with equal intervals and a true zero point.47
951779552ReliabilityThe consistency or dependability of test performance across occasions, scorers, and specific content.48
951779553Reverse Scoring (Distractor?)An incorrect/non-preferred option in any item (an "incorrect" option may actually behave as a "correct" option).49
951779554Selected-Response ItemsTest items in which the examinee selects a response from given alternatives.50
951779555SelectivityThe ability of a test to identify individuals with some characteristic.51
951779556Self-ReportA type of survey, questionnaire, or poll in which respondents read the question and select a response by themselves without researcher-influence (any method which involves asking a participant about their feelings, attitudes, beliefs, etc.)52
951779557SensitivityProportion of individuals the test correctly identifies.53
951779558Shared Family VarianceIn studies of heredity and environment, variance attributable to the fact that members of a family presumably have similar environments.54
951779559Spearman-Brown FormulaA formula allowing estimation of the effect on reliability of lengthening or shortening a test.55
951779560Specific Domain MeasuresA test that focuses on just one or a few variables in the non-cognitive domain; contrasted with comprehensive inventories.56
951779561SpecificityThe ability of a test to not select individuals who do not have the same characteristic.57
951779562Split-Half ReliabilityA measure of reliability based on splitting the test into two halves, then correcting performance on the two halves.58
951779563Standard Error of MeasurementAn index of the degree of variability in test scores resulting from imperfect reliability.59
951779564Table of SpecificationsA table with the content getting measured and the data, to determine the content validity of the test by matching the content of the test with the table.60
951779565Test BiasShowing that a test measures somewhat different constructs for different groups of examinees, especially for majority and minority groups.61
951779566Test-Retest Reliability (Rtt)Reliability determined by correlating performance on a test administered on two different occasions.62
951779567True ScoreThe score a person would theoretically get if all sources of unreliable variance were removed or cancelled out.63
951779568ValidityAn indication of the extent to which a test measures what it is intended to measure.64
957933820Inter-Rater Reliability (Rrr)Someone else provides the testing from the scores; the rater is a source of error variance65
957933821Intra-Rater Reliability (Rir)The same rater of the test and score rates from one time to another; finds consistency among the rater66
957933822Cohen's Kappa (Observer Agreement)People trained to notice what constitutes a certain behavior67

Need Help?

We hope your visit has been a productive one. If you're having any problems, or would like to give some feedback, we'd love to hear from you.

For general help, questions, and suggestions, try our dedicated support forums.

If you need to contact the Course-Notes.Org web experience team, please use our contact form.

Need Notes?

While we strive to provide the most comprehensive notes for as many high school textbooks as possible, there are certainly going to be some that we miss. Drop us a note and let us know which textbooks you need. Be sure to include which edition of the textbook you are using! If we see enough demand, we'll do whatever we can to get those notes up on the site for you!