Item Response Theory

@inproceedings{Weiss1991ItemRT,
  title={Item Response Theory},
  author={David J. Weiss and Michael E. Yoes},
  year={1991}
}
During the past 30 years or so, a new theoretical basis for educational and psychological testing and measurement has emerged. It has been variously referred to as latent trait theory, item characteristic curve theory, and, more recently, item response theory (IRT). Although this new test theory holds considerable promise as a successor to classical test theory, it has been underutilized by test practitioners. One important reason for this underutilization is that many test developers have not… 
Item Response Theory
Item response theory (IRT) seeks to model the way in which latent psychological constructs manifest themselves in terms of observable item responses; this information is useful when developing,
Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development
TLDR
A comprehensive overview of the IRT and its procedures as applied to test item development and analysis is provided and some suggestions for test developers and test specialists at all levels to adopt IRT for its identified crucial theoretical and empirical gains over CTT are concluded.
Assessing fit of item response theory models
ASSESSING FIT OF ITEM RESPONSE THEORY MODELS FEBRUARY 2006 YING LU, B.A., BEIJING FOREIGN STUDIES UNIVERSITY M.S., UNIVERSITY OF MASSACHUSETTS AMHERST Ed.D., UNIVERSITY OF MASSACHUSETTS AMHERST
Gaining a Better Understanding of General Mattering Scale: An Application of Classical Test Theory and Item Response Theory
The current study shows applications of both classical test theory (CTT) and item response theory (IRT) to psychology data. The study discusses item level analyses of General Mattering Scale produced
Modeling Item-Level Data With Item Response Theory
ficult questions. Ideally, the two camps should have much cross fertilization, and the lines between them should be blurred. However, complex theories are often con structed before appropriate meth
Measurement Theory in Language Testing: Past Traditions and Current Trends.
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is
Comparison of Validity and Reliability of Two Tests Developed by Classical Test Theory and Item Response Theory
In constructing the test process after preliminary test try outs two tests have been developed by Classical Test and Item Response Theories. The aim of the research is to compare psychometric
A primer on standardized testing: History, measurement, classical test theory, item response theory, and equating.
TLDR
The history of standardized testing, the frameworks of classical test theory and IRT, and the logic of scaling and equating are presented will aid readers in understanding these concepts of modern testing.
Multidimensional Item Response Theory for Factor Structure Assessment in Educational Psychology Research
This study demonstrates the use of multidimensional item response theory (MIRT) to investigate an instrument’s factor structure. For didactic purposes, MIRT was used to assess the factor structure of
An empirical comparison of item response theory and classical test theory item/person statistics
An Empirical Comparison of Item Response Theory and Classical Test Theory Item/Person Statistics. (August 2004) Troy Gerard Courville, B.S., Louisiana State UniversityShreveport; M.S., Texas A&M
...
...

References

SHOWING 1-10 OF 69 REFERENCES
Statistical Theories of Mental Test Scores.
This is a reprint of the orginal book released in 1968. Our primary goal in this book is to sharpen the skill, sophistication, and in- tuition of the reader in the interpretation of mental test data,
The changing conception of measurement in education and psychology
Since the era of Binet and Spearman, classical test theory and the ideal of the standard test have gone hand in hand, in part because both are based on the same paradigm of experimental control by
A Comparison of Two Procedures for Computing IRT Equating Coefficients
In order to equate tests under Item Response Theory (IRT), one must obtain the slope and intercept coefficients of the appropriate linear transformation. This article compares two methods for
The Difficulty of Test Items That Measure More Than One Ability
Many test items require more than one ability to obtain a correct response. This article proposes a mul tidimensional index of item difficulty that can be used with items of this type. The proposed
APPLICATION OF COMPUTERIZED ADAPTIVE TESTING TO EDUCATIONAL PROBLEMS
TLDR
Evidence from a series of studies comparing conventional and adaptive testing procedures is presented showing that the adaptive procedure results in more accurate mastery classifications than do conventional mastery tests, while using fewer test questions.
Fitting a response model forn dichotomously scored items
A method of estimating the parameters of the normal ogive model for dichotomously scored item-responses by maximum likelihood is demonstrated. Although the procedure requires numerical integration in
ANALYSIS OF EMPIRICAL DATA USING TWO LOGISTIC LATENT TRAIT MODELS
Although Birnbaum's logistic models have been known since 1957, there have been few applications to empirical data reported in the literature. In this study, the one- and two-parameter logistic
A basis for scaling qualitative data.
IN A GREAT deal of research in the social and psychological sciences, interest lies in certain large classes of qualitative observations. For example, research in marriage is concerned with a class
Methodology Review: Item Parameter Estimation Under the One-, Two-, and Three-Parameter Logistic Models
This paper surveys the techniques used in item re sponse theory to estimate the parameters of the item characteristic curves fitted to item response data. The major focus is on the joint maximum
Estimating item parameters and latent ability when responses are scored in two or more nominal categories
A multivariate logistic latent trait model for items scored in two or more nominal categories is proposed. Statistical methods based on the model provide 1) estimation of two item parameters for each
...
...