Skip to main content

Part of the book series: Evaluation in Education and Human Services Series ((EEHS,volume 28))

Abstract

During the past 30 years or so, a new theoretical basis for educational and psychological testing and measurement has emerged. It has been variously referred to as latent trait theory, item characteristic curve theory, and, more recently, item response theory (IRT). Although this new test theory holds considerable promise as a successor to classical test theory, it has been underutilized by test practitioners. One important reason for this underutilization is that many test developers have not had sufficient time to devote to the study of the technical and mathematical intricacies involved in this new test theory and its mathematical models. This chapter is intended as an overview of IRT for individuals with some background in the basic methods of classical test theory. Readers are referred to Hambleton (1989) and Hambleton and Swaminathan (1985) for other overviews of IRT.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Angoff, W. H. (1982). Summary and derivation of equating methods used at ETS. In P. W. Holland & D. R. Rubin (eds.), Testing equating. New York: Academic Press, pp. 55–69.

    Google Scholar 

  • Assessment Systems Corporation. (1989). User’s manual for the MicroCAT Testing System, (3rd ed.) St. Paul, MN: Author.

    Google Scholar 

  • Birnbaum, A. (1968). Test scores, sufficient statistics, and the information structures of tests. In F. M. Lord & M. R. Novick, Statistical theories of mental test scores. Reading, MA: Addison-Wesley.

    Google Scholar 

  • Bock, R. D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika 37:29–51.

    Article  Google Scholar 

  • Fisher, R. A. (1922). On the mathematical foundations of theoretical statistics. Philosophical Transactions of the Royal Society of London (A) 222:309–368.

    Article  Google Scholar 

  • Guttman, L. (1944). A basis for scaling qualitative data. American Sociological Review 9:139–150.

    Article  Google Scholar 

  • Hambleton, R. K. (1989). Principles and selected applications of item response theory. In R. Linn (ed.), Educational measurement, 3rd ed. New York: Macmillan, pp. 147–200.

    Google Scholar 

  • Hambleton, R.K., & Swaminathan, H. (1985). Item response theory. Principles and applications. Norwell, MA: Kluwer Academic Publishers.

    Book  Google Scholar 

  • Hambleton, R. K., & Traub, R.E. (1973). Analysis of empirical data using two logistic latent trait models. British Journal of Mathematical and Statistical Psychology 26:195–211.

    Article  Google Scholar 

  • Harris, D. (1989). Comparison of 1-, 2-, and 3-parameter IRT models. Educational Measurement: Issues and Practice 8:35–44.

    Article  Google Scholar 

  • Harwell, M.R., Baker, F. B., & Zwarts, M. (1988). Item parameter estimation via marginal maximum likelihood and an EM algorithm: A didactic. Journal of Educational Statistics 13:243–271.

    Article  Google Scholar 

  • Lazarsfeid, P. F. (1950). The logical and mathematical foundation of latent structure analysis. In S. A. Stouffer, L. Guttman, E. A. Suchman, P. F. Lazarsfeid, S.A. Star, & J.A. Clausen (eds.), Measurement and prediction: Studies in social psychology in World War II, Vol. IV. New York: Wiley, 1966. (Originally published by Princeton University Press, 1950.)

    Google Scholar 

  • Lord, F. M. (1952). A theory of test scores (Psychometric Monograph, No. 7). Psychometric Society.

    Google Scholar 

  • Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum Associates.

    Google Scholar 

  • Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley.

    Google Scholar 

  • McBride, J. R. (1977). A brief overview of adaptive testing. In D. J. Weiss (ed.), Applications of computerized adaptive testing (Research Report 77–1). University of Minnesota, Department of Psychology, Psychometric Methods Program.

    Google Scholar 

  • McDonald, R. P. (1985). Factor analysis and related methods. Hillsdale, NJ: Lawrence Erlbaum Associates.

    Google Scholar 

  • Mislevy, R. J., & Bock, R. D. (1982). BILOG: Maximum likelihood item analysis and test scoring with logistic models for binary items. Chicago: International Educational Services.

    Google Scholar 

  • Mislevy, R.J., & Stocking, M. L. (1989). A consumer’s guide to LOGIST and BILOG. Applied Psychological Measurement 13:57–75.

    Article  Google Scholar 

  • Mosier, C. I. (1940). Psychophysics and mental test theory: Fundamental postulates and elementary theorems. Psychological Review 47:355–366.

    Article  Google Scholar 

  • Rasch, G. (1960/1980). Probabilistic models for some intelligence and attainment tests. Chicago: University of Chicago Press. (Originally published by The Danish Institute for Educational Research, Copenhagen, 1960.)

    Google Scholar 

  • Reckase, M. D. (1983, April). The definition of difficulty and discrimination for multidimensional item response theory models. Paper presented at the meeting of the American Educational Research Association, Montreal, Canada

    Google Scholar 

  • Reckase, M. D. (1985). The difficulty of test items that measure more than one dimension. Applied Psychological Measurement 9:401–422.

    Article  Google Scholar 

  • Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores (Psychometric Monograph No. 17). Psychometric Society.

    Google Scholar 

  • Samejima, F. (1972). A general model for free-response data (Psychometric Monograph No. 18). Psychometric Society.

    Google Scholar 

  • Samejima, F. (1973). Homogeneous case of the continuous response model. Psychometrika 38:203–219.

    Article  Google Scholar 

  • Swaminathan, H. (1983). Parameter estimation in item response models. In R.K. Hambleton (ed.), Applications of item response theory. Vancouver, BC: Educational Research Institute of British Columbia, pp. 24–44.

    Google Scholar 

  • Vale, C D. (1986). Linking item parameters onto a common scale. Applied Psychological Measurement 10:333–344.

    Article  Google Scholar 

  • van der Linden, W.J. (1986). The changing conception of measurement in education and psychology. Applied Psychological Measurement 10:325–332.

    Article  Google Scholar 

  • van der Linden, W.J., & Boekkooi-Timminga, E. (1989). A maximin model for test design with practical constraints. Psychometrika 54:237–247.

    Article  Google Scholar 

  • Wainer, H. (Ed.). (1990). Computerized adaptive testing: A primer. Hillsdale, NJ: Lawrence Erlbaum Associates.

    Google Scholar 

  • Weiss, D.J. (1985). Adaptive testing by computer. Journal of Consulting and Clinical Psychology 53:774–789.

    Article  Google Scholar 

  • Weiss, D.J., & Kingsbury, G. G. (1984). Application of computerized adaptive testing to educational problems. Journal of Educational Measurement 21:361–375.

    Article  Google Scholar 

  • Wingersky, M.S., Barton, M. A., & Lord, F.M. (1982). LOGIST User’s Guide. Princeton, NJ: Educational Testing Service.

    Google Scholar 

  • Wright, B.D., & Mead, R.J. (1976). BICAL: Calibrating rating scores with the Rasch model (Research Memorandum No. 23). Chicago: Chicago Statistical Laboratory, Department of Education, University of Chicago.

    Google Scholar 

Download references

Authors

Editor information

Ronald K. Hambleton Jac N. Zaal

Rights and permissions

Reprints and permissions

Copyright information

© 1991 Springer Science+Business Media New York

About this chapter

Cite this chapter

Weiss, D.J., Yoes, M.E. (1991). Item Response Theory. In: Hambleton, R.K., Zaal, J.N. (eds) Advances in Educational and Psychological Testing: Theory and Applications. Evaluation in Education and Human Services Series, vol 28. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-2195-5_3

Download citation

  • DOI: https://doi.org/10.1007/978-94-009-2195-5_3

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-010-7484-1

  • Online ISBN: 978-94-009-2195-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics