High-stakes testing in the context of "Test (student assessment)"

Play Trivia Questions online!

or

Skip to study material about High-stakes testing in the context of "Test (student assessment)"

Ad spacer

⭐ Core Definition: High-stakes testing

A high-stakes test is a test with important consequences for the test taker. Passing has important benefits, such as a high school diploma, a scholarship, or a license to practice a profession. Failing has important disadvantages, such as being forced to take remedial classes until the test can be passed, not being allowed to drive a car, or difficulty finding employment.

The use and misuse of high-stakes tests is a controversial topic in public education, especially in the United States and U.K., where they have become especially popular in recent years, used not only to assess school-age students but in attempts to increase teacher accountability.

↓ Menu

>>>PUT SHARE BUTTONS HERE<<<
In this Dossier

High-stakes testing in the context of Item response theory

In psychometrics, item response theory (IRT, also known as latent trait theory, strong true score theory, or modern mental test theory) is a paradigm for the design, analysis, and scoring of tests, questionnaires, and similar instruments measuring abilities, attitudes, or other variables. It is a theory of testing based on the relationship between individuals' performances on a test item and the test takers' levels of performance on an overall measure of the ability that item was designed to measure. Several different statistical models are used to represent both item and test taker characteristics. Unlike simpler alternatives for creating scales and evaluating questionnaire responses, it does not assume that each item is equally difficult. This distinguishes IRT from, for instance, Likert scaling, in which "All items are assumed to be replications of each other or in other words items are considered to be parallel instruments". By contrast, item response theory treats the difficulty of each item (the item characteristic curves, or ICCs) as information to be incorporated in scaling items.

It is based on the application of related mathematical models to testing data. Because it is often regarded as superior to classical test theory, it is the preferred method for developing scales in the United States, especially when optimal decisions are demanded, as in so-called high-stakes tests, e.g., the Graduate Record Examination (GRE) and Graduate Management Admission Test (GMAT).

↑ Return to Menu