Page Title:
Keywords:
Description:
Skip to main content
NAEP Analysis and Scaling → Initial Activities → Constructed-Response Interrater Reliability → Range of Response Codes, Percent Agreement, and Cohen's Kappa or Intraclass Correlation for the Constructed-Response Items Used in Scaling, Age 13 Mathematics Long-Term Trend Assessment, by Item and Block: 2012
NAEP Technical DocumentationScore range and percent agreement for the constructed-response items used in scaling, age 13 mathematics long-term trend assessment, by item and block: 2012
Block Item Range of response codes Sample size Percent exact agreement
NOTE: Special codes assigned to student responses including blank, off-task, and not-scorable were included in the calculation of the percent exact agreement measure.
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2012 Mathematics Long-Term Trend Assessment.
M1 N329201 1-2 1,100 99
N331501 1-2 1,100 99
N333401 1-2 1,100 99
N334201 1-2 1,100 100
N333701 1-2 1,100 100
M2 N301801 1-2 1,000 100
N302401 1-2 1,000 99
N302601 1-2 1,000 100
N302901 1-2 1,000 100
N303201 1-2 1,000 100
M3 N324301 1-2 1,000 100
N322401 1-2 1,000 100
N325001 1-2 1,000 100
N325301 1-2 1,000 99
N325601 1-2 1,000 100
N325701 1-2 1,000 99
N326001 1-2 1,000 99
N326401 1-2 1,000 100
M4 N316601 1-2 1,100 100
N316602 1-2 1,100 100
N316603 1-2 1,100 100
N316701 1-2 1,100 100
N317101 1-2 1,100 100
N317102 1-2 1,100 99
N317103 1-2 1,100 100
N318801 1-2 1,100 99
M5 N296401 1-2 1,000 100
N288101 1-2 1,000 100
N296701 1-2 1,000 99
N296901 1-2 1,000 100
N297001 1-2 1,000 100
M6 N298201 1-2 1,000 100
N289901 1-2 1,000 100
N290101 1-2 1,000 100
N298601 1-2 1,000 99
N299001 1-2 1,000 100

Score range and Cohen's Kappa or intraclass correlation for the constructed-response items used in scaling, age 13 mathematics long-term trend assessment, by item and block: 2012
Block Item Range of response codes Sample size Cohen’s Kappa Intraclass correlation
† Not applicable. The intraclass correlation is not reported for dichotomously scored items; Cohen's Kappa is not reported for polytomously scored items.
NOTE: Cohen's Kappa is a measure of reliability that is appropriate for items that are dichotomously scored. The intraclass correlation coefficient is most appropriate for items with more than two categories. The discrepancy in sample sizes compared to the first table is due to the fact that percent agreement is calculated from the entire sample size, while Cohen's Kappa and intraclass correlation statistics exclude those who omit the item.
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2012 Mathematics Long-Term Trend Assessment.
M1 N334201 1-2 1,000 0.98
N333701 1-2 1,000 0.99
N329201 1-2 1,000 0.98
N331501 1-2 1,000 0.99
N333401 1-2 1,000 0.99
M2 N301801 1-2 1,000 0.96
N302401 1-2 1,000 0.99
N302601 1-2 900 0.99
N302901 1-2 1,000 0.99
N303201 1-2 1,000 1.00
M3 N324301 1-2 1,000 0.99
N322401 1-2 1,000 0.99
N325001 1-2 1,000 0.99
N325301 1-2 1,000 0.98
N325601 1-2 1,000 0.99
N325701 1-2 900 0.99
N326001 1-2 900 0.99
N326401 1-2 800 0.99
M4 N316601 1-2 1,100 0.99
N316602 1-2 1,100 0.98
N316603 1-2 1,100 0.98
N316701 1-2 1,100 1.00
N317101 1-2 1,100 1.00
N317102 1-2 1,000 0.99
N317103 1-2 1,000 0.99
N318801 1-2 700 0.98
M5 N296401 1-2 1,000 1.00
N288101 1-2 1,000 0.99
N296701 1-2 1,000 0.98
N296901 1-2 1,000 1.00
N297001 1-2 1,000 0.99
M6 N298201 1-2 1,000 1.00
N289901 1-2 1,000 1.00
N290101 1-2 1,000 1.00
N298601 1-2 1,000 0.98
N299001 1-2 1,000 0.99

Last updated 13 November 2013 (JL)