Guessing and the Rasch Model

@article{Holster2016GuessingAT,
  title={Guessing and the Rasch Model},
  author={Trevor A. Holster and J. Lake},
  journal={Language Assessment Quarterly},
  year={2016},
  volume={13},
  pages={124 - 141}
}
ABSTRACT Stewart questioned Beglar’s use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a “guessing” parameter. In support of this theory, Stewart presented fit statistics derived from Rasch analysis of random numbers displaying good data-model fit. Because data-model fit is a key assumption of the Rasch model, Stewart argued that the… 
A Response to Holster and Lake Regarding Guessing and the Rasch Model
ABSTRACT Stewart questioned vocabulary size estimation methods proposed by Beglar and Nation for the Vocabulary Size Test, further arguing Rasch mean square (MSQ) fit statistics cannot determine the
Effects of Removing Responses With Likely Random Guessing Under Rasch Measurement on a Multiple-Choice Language Proficiency Test
  • C. Lin
  • Psychology
    Language Assessment Quarterly
  • 2018
TLDR
This study investigates the effects of removing responses with likely guessing on item difficulty estimates, person ability measures, and test information function (i.e., a function of measurement precision for person ability) on an MC language proficiency test and shows that the measurement Precision for high-performing examinees increases after accounting for likely random guessing, while the measurement precision For low- and medium- performing examinees remains similar with and without likely guessing.
Getting Lucky: How Guessing Threatens the Validity of Performance Classifications
There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the
A comprehensive review of Rasch measurement in language assessment: Recommendations and guidelines for research
Over the past decades, the application of Rasch measurement in language assessment has gradually increased. In the present study, we coded 215 papers using Rasch measurement published in 21 applied
The Indonesian version of the Life Orientation Test-Revised (LOT-R): Psychometric properties based on the Rasch model
Abstract This study aimed to validate the Indonesian version of the Life Orientation Test-Revised (LOT-R), a scale used to measure optimism. Despite being adapted in over 20 countries, its validation
Cloze-elide as a classroom reading test
The cloze-elide procedure, in which test-takers must identify redundant words that have been added to a reading text, provides a process-oriented test of reading ability. This research investigated
Female Hedonistic Behavior Questionnaire (FHBQ): Psychometric Properties Based on the Rasch Model
This study aims to develop standard measurement tools from hedonistic behavior using the perspective of the Rasch measurement model. The evaluation of psychometric characteristics conducted in this
APPLICATION OF THE RASCH MODEL ON THE TEST INSTRUMENT TO ANALYZE THE PROBLEM-SOLVING ABILITY OF STUDENTS IN ACID-BASE SOLUTIONS MATERIALS
APPLICATION OF THE RASCH MODEL ON THE TEST INSTRUMENT TO ANALYZE THE PROBLEM-SOLVING ABILITY OF STUDENTS IN ACID-BASE SOLUTIONS MATERIALSAbstractIn 21st century education, students are required to
Using rasch measurement to score, evaluate, and improve examinations in an anatomy course
TLDR
The purpose of this research is to analyze a common medical school anatomy examination using both the traditional CTT scoring method and a Rasch measurement scoring method to determine which technique provides more robust findings, and which set of psychometric indicators will be more meaningful and useful for anatomists looking to improve the psychometric quality and functioning of their examinations.
The Application of Rasch in the Validation of Corporate Citizenship Scale
This paper articulates the application of Rasch measurement in corporate citizenship research. With burgeoning expectation for greater corporate responsibility, studies have found that many companies
...
1
2
...

References

SHOWING 1-10 OF 28 REFERENCES
Do Multiple-Choice Options Inflate Estimates of Vocabulary Size on the VST?
TLDR
This article argues that the VST’s multiple-choice format results in a test of passive recognition of words that does not approximate the experience of readers of authentic English texts, and details drawbacks of the Rasch framework and mean-square fit statistics in detecting the overall contribution of guessing effects to raw test scores.
A Rasch-based validation of the Vocabulary Size Test
The primary purpose of this study was to provide preliminary validity evidence for a 140-item form of the Vocabulary Size Test, which is designed to measure written receptive knowledge of the first
Formula Scoring of Multiple‐Choice Tests (Correction for Guessing)
Formula scoring is a procedure designed to reduce multiple-choice test score irregularities due to guessing. Typically, a formula score is obtained by subtracting a proportion of the number of wrong
To Guess or Not to Guess: A Decision‐Theoretic View of Formula Scoring
Multiple-choice tests are often scored by formulas under which the respondent's expected score for an item is the same whether he or she omits it or guesses at random. Typically, these formulas are
Small n justifies the Rasch model
Comment on 3PL IRT Adjustment for Guessing
Guessing behavior is an issue discussed widely with regard to multiple choice tests. Its primary effect is on number-correct scores for examinees at lower levels of proficiency. This is a systematic
FORMULA SCORING AND NUMBER-RIGHT SCORING1
In discussions of formula scoring, the following assertion is sometimes made (for example, see Diamond & Evans, 1973, p. 181): Formula scoring is based on the assumption that examinees either know
The creation and validation of a listening vocabulary levels test
An important gap in the field of second language vocabulary assessment concerns the lack of validated tests measuring aural vocabulary knowledge. The primary purpose of this study is to introduce and
OPTIMAL NUMBER OF CHOICES PER ITEM— A COMPARISON OF FOUR APPROACHES*
Typical multiple choice tests have four or five alternative choices per item. What is the optimal number? Here two approaches given in the literature are compared with two new approaches. From some
Multiple-Choice Models: The Distractors Are also Part of the Item.
This paper describes an item response model for multiple-choice items and illustrates its application in item analysis. The model provides parametric and graphical summaries of the performance of
...
1
2
3
...