Cronbach's alpha
Cronbach's alpha (Cronbach's ) or coefficient alpha (coefficient ), is a reliability coefficient and a measure of the internal consistency of tests and measures.[1][2] It was devised by the American psychometrician Lee Cronbach. Today it enjoys such wide-spread usage that numerous studies warn against using Cronbach's alpha uncritically.[3][4][5][6][7]
History
[edit]In his initial 1951 publication, Lee Cronbach described the coefficient as Coefficient alpha[1] and included an additional derivation.[8] Coefficient alpha had been used implicitly in previous studies,[9][10][11][12] but his interpretation was thought to be more intuitively attractive relative to previous studies and it became quite popular.[13]
- In 1967, Melvin Novick and Charles Lewis proved that it was equal to reliability if the true scores[i] of the compared tests or measures vary by a constant, which is independent of the people measured. In this case, the tests or measurements were said to be "essentially tau-equivalent."[14]
- In 1978, Cronbach asserted that the reason the initial 1951 publication was widely cited was "mostly because [he] put a brand name on a common-place coefficient."[2]: 263 He explained that he had originally planned to name other types of reliability coefficients, such as those used in inter-rater reliability and test-retest reliability, after consecutive Greek letters (i.e., , , etc.), but later changed his mind.
- Later, in 2004, Cronbach and Richard Shavelson encouraged readers to use generalizability theory rather than . Cronbach opposed the use of the name "Cronbach's alpha" and explicitly denied the existence of studies that had published the general formula of KR-20 before Cronbach's 1951 publication of the same name.[7]
Prerequisites for using Cronbach's alpha
[edit]To use Cronbach's alpha as an accurate estimate of reliability, the following conditions must be met:[15][16]
- The "parts" (i.e. items, test parts, etc.) must be essentially tau-equivalent;
- Errors in the measurements are independent.
However, under the definition of CTT, the errors are defined to be independent.
This is often a source of confusion for users who might consider some aspect of the testing process to be an "error" (rater biases, examinee collusion, self-report faking). Anything that increases the covariance among the parts will contribute to greater true score variance. Under such circumstances, alpha is likely to over-estimate the reliability intended by the user.[16][3]
Formula and calculation
[edit]Reliability can be defined as one minus the error score variance divided by the observed score variance:
Cronbach's alpha is best understand as a direct estimate of this definitional formula with error score variance estimated as the sum of the variances of each "part" (e.g., items or testlets):[6]
where:
- represents the number of "parts" (items, test parts, etc.) in the measure;
- the term causes alpha to be an unbiased estimate of reliability when the parts are parallel or essentially tau equivalent;
- the variance associated with each part i; and
- the observed score variance (the variance associated with the total test scores).
The reason that the sum of the individual part variances estimates the error score variance is because
and the variance of a composite is equal to twice the sum of all covariances of the parts plus the sum of the variances of the parts: . Therefore estimates and estimates . It is much easier to compute alpha by summing the part variances (to estimate error score variance) than adding up all the unique part covariances (to estimate true score variance) .
Alternatively, alpha can be calculated through the following formula:[17]
where:
- represents the average variance
- represents the average inter-item covariance (or the average covariance of "parts").
Common misconceptions
[edit]Application of Cronbach's alpha is not always straightforward and can give rise to common misconceptions.
A high value of Cronbach's alpha indicates homogeneity between the items
[edit]Many textbooks refer to as an indicator of homogeneity[18] between items. This misconception stems from the inaccurate explanation of Cronbach (1951)[8] that high values show homogeneity between the items. Homogeneity is a term that is rarely used in modern literature, and related studies interpret the term as referring to uni-dimensionality. Several studies have provided proofs or counterexamples that high values do not indicate uni-dimensionality.[19][20][21][22][23] See counterexamples below.
in the uni-dimensional data above.
in the multidimensional data above.
The above data have , but are multidimensional.
The above data have , but are uni-dimensional.
Uni-dimensionality is a prerequisite for . One should check uni-dimensionality before calculating rather than calculating to check uni-dimensionality.
A high value of Cronbach's alpha indicates internal consistency
[edit]The term "internal consistency" is commonly used in the reliability literature, but its meaning is not clearly defined. The term is sometimes used to refer to a certain kind of reliability (e.g., internal consistency reliability), but it is unclear exactly which reliability coefficients are included here, in addition to . Cronbach (1951)[8] used the term in several senses without an explicit definition. Cortina (1993)[19] showed that is not an indicator of any of these.
Removing items using "alpha if item deleted" always increases reliability
[edit]Most psychometric software will produce a column labeled "alpha if item deleted" which is the coefficient alpha that would be obtained if an item were to be dropped. For good items, this value is lower than the current coefficient alpha for the whole scale. But for some weak or bad items, the "alpha if item deleted" value shows an increase over the current current coefficient alpha for the whole scale.
Removing an item using "alpha if item deleted" may result in 'alpha inflation,' where sample-level reliability is reported to be higher than population-level reliability.[24] It may also reduce population-level reliability.[25] The elimination of less-reliable items should be based not only on a statistical basis but also on a theoretical and logical basis. It is also recommended that the whole sample be divided into two and cross-validated.[24]
Notes
[edit]- ^ The true score is the difference between the score observed during the test or measurement and the error in that observation. See classical test theory for further information.
References
[edit]- ^ a b Cronbach, Lee J. (1951). "Coefficient alpha and the internal structure of tests". Psychometrika. 16 (3). Springer Science and Business Media LLC: 297–334. doi:10.1007/bf02310555. hdl:10983/2196. S2CID 13820448.
- ^ a b Cronbach, L. J. (1978). "Citation Classics" (PDF). Current Contents. 13: 263. Archived (PDF) from the original on 2022-01-20. Retrieved 2021-03-22.
- ^ a b Sijtsma, K. (2009). "On the use, the misuse, and the very limited usefulness of Cronbach's alpha". Psychometrika. 74 (1): 107–120. doi:10.1007/s11336-008-9101-0. PMC 2792363. PMID 20037639.
- ^ Green, S. B.; Yang, Y. (2009). "Commentary on coefficient alpha: A cautionary tale". Psychometrika. 74 (1): 121–135. doi:10.1007/s11336-008-9098-4. S2CID 122718353.
- ^ Revelle, W.; Zinbarg, R. E. (2009). "Coefficients alpha, beta, omega, and the glb: Comments on Sijtsma". Psychometrika. 74 (1): 145–154. doi:10.1007/s11336-008-9102-z. S2CID 5864489.
- ^ a b Raykov, T.; Marcoulides, G. A. (2017). "Thanks coefficient alpha, we still need you!". Educational and Psychological Measurement. 79 (1): 200–210. doi:10.1177/0013164417725127. PMC 6318747. PMID 30636788.
- ^ a b Cronbach, L. J.; Shavelson, R. J. (2004). "My Current Thoughts on Coefficient Alpha and Successor Procedures". Educational and Psychological Measurement. 64 (3): 391–418. doi:10.1177/0013164404266386. S2CID 51846704.
- ^ a b c Cronbach, L.J. (1951). "Coefficient alpha and the internal structure of tests". Psychometrika. 16 (3): 297–334. doi:10.1007/BF02310555. hdl:10983/2196. S2CID 13820448.
- ^ Hoyt, C. (1941). "Test reliability estimated by analysis of variance". Psychometrika. 6 (3): 153–160. doi:10.1007/BF02289270. S2CID 122361318.
- ^ Guttman, L. (1945). "A basis for analyzing test-retest reliability". Psychometrika. 10 (4): 255–282. doi:10.1007/BF02288892. PMID 21007983. S2CID 17220260.
- ^ Jackson, R. W. B.; Ferguson, G. A. (1941). "Studies on the reliability of tests". University of Toronto Department of Educational Research Bulletin. 12: 132.
- ^ Gulliksen, H. (1950). Theory of mental tests. Wiley. doi:10.1037/13240-000.
- ^ Cronbach, Lee (1978). "Citation Classics" (PDF). Current Contents. 13 (8). Archived (PDF) from the original on 2022-10-22. Retrieved 2022-10-21.
- ^ Novick, M. R.; Lewis, C. (1967). "Coefficient alpha and the reliability of composite measurements". Psychometrika. 32 (1): 1–13. doi:10.1007/BF02289400. PMID 5232569. S2CID 186226312.
- ^ Spiliotopoulou, Georgia (2009). "Reliability reconsidered: Cronbach's alpha and paediatric assessment in occupational therapy". Australian Occupational Therapy Journal. 56 (3): 150–155. doi:10.1111/j.1440-1630.2009.00785.x. PMID 20854508. Archived from the original on 2022-10-21. Retrieved 2022-10-21.
- ^ a b Cortina, Jose M. (1993). "What is coefficient alpha? An examination of theory and applications". Journal of Applied Psychology. 78 (1): 98–104. doi:10.1037/0021-9010.78.1.98. ISSN 1939-1854. Archived from the original on 2023-08-13. Retrieved 2022-10-21.
- ^ DATAtab (October 27, 2021). Cronbach's Alpha (Simply explained). YouTube. Event occurs at 4:08. Retrieved 2023-08-01.
- ^ "APA Dictionary of Psychology". dictionary.apa.org. Archived from the original on 2019-07-31. Retrieved 2023-02-20.
- ^ a b Cortina, J. M. (1993). "What is coefficient alpha? An examination of theory and applications". Journal of Applied Psychology. 78 (1): 98–104. doi:10.1037/0021-9010.78.1.98.
- ^ Green, S. B.; Lissitz, R. W.; Mulaik, S. A. (1977). "Limitations of coefficient alpha as an Index of test unidimensionality". Educational and Psychological Measurement. 37 (4): 827–838. doi:10.1177/001316447703700403. S2CID 122986180.
- ^ McDonald, R. P. (1981). "The dimensionality of tests and items". The British Journal of Mathematical and Statistical Psychology. 34 (1): 100–117. doi:10.1111/j.2044-8317.1981.tb00621.x.
- ^ Schmitt, N. (1996). "Uses and abuses of coefficient alpha". Psychological Assessment. 8 (4): 350–3. doi:10.1037/1040-3590.8.4.350.
- ^ Ten Berge, J. M. F.; Sočan, G. (2004). "The greatest lower bound to the reliability of a test and the hypothesis of unidimensionality". Psychometrika. 69 (4): 613–625. doi:10.1007/BF02289858. S2CID 122674001.
- ^ a b Kopalle, P. K.; Lehmann, D. R. (1997). "Alpha inflation? The impact of eliminating scale items on Cronbach's alpha". Organizational Behavior and Human Decision Processes. 70 (3): 189–197. doi:10.1006/obhd.1997.2702.
- ^ Raykov, T. (2007). "Reliability if deleted, not 'alpha if deleted': Evaluation of scale reliability following component deletion". The British Journal of Mathematical and Statistical Psychology. 60 (2): 201–216. doi:10.1348/000711006X115954. PMID 17971267.
External links
[edit]- Cronbach's alpha SPSS tutorial
- The free web interface and R package cocoon allow us to statistically compare two or more dependent or independent Cronbach alpha coefficients.