No access
Research Note
12 December 2022

An Acoustic-Phonetic Approach to Effects of Face Masks on Speech Intelligibility

Publication: Journal of Speech, Language, and Hearing Research
Volume 65, Number 12
Pages 4679-4689

Abstract

Purpose:

This study aimed to examine the effects of wearing a face mask on speech acoustics and intelligibility, using an acoustic-phonetic analysis of speech. In addition, the effects of speakers' behavioral modification while wearing a mask were examined.

Method:

Fourteen female adults were asked to read a set of words and sentences under three conditions: (a) conversational, mask-off; (b) conversational, mask-on; and (c) clear, mask-on. Seventy listeners rated speech intelligibility using two methods: orthographic transcription and visual analog scale (VAS). Acoustic measures for vowels included duration, first (F1) and second (F2) formant frequency, and intensity ratio of F1/F2. For consonants, spectral moment coefficients and consonant–vowel (CV) boundary (intensity ratio between consonant and vowel) were measured.

Results:

Face masks had a negative impact on speech intelligibility as measured by both intelligibility ratings. However, speech intelligibility was recovered in the clear speech condition for VAS but not for transcription scores. Analysis of orthographic transcription showed that listeners tended to frequently confuse consonants (particularly fricatives, affricates, and stops), rather than vowels in the word-initial position. Acoustic data indicated a significant effect of condition on CV intensity ratio only.

Conclusions:

Our data demonstrate a negative effect of face masks on speech intelligibility, mainly affecting consonants. However, intelligibility can be enhanced by speaking clearly, likely driven by prosodic alterations.

Get full access to this article

View all available purchase options and get full access to this article.

References

American Speech-Language-Hearing Association. (2022). Communicating effectively while wearing masks. https://www.asha.org/public/communicating-effectively-while-wearing-masks-and-physical-distancing/
Ansel, B. M., & Kent, R. D. (1992). Acoustic-phonetic contrasts and intelligibility in the dysarthria associated with mixed cerebral palsy. Journal of Speech and Hearing Research, 35(2), 296–308.
Atcherson, S. R., Finley, E. T., McDowell, B. R., & Watson, C. (2020). More speech degradations and considerations in the search for transparent face coverings during the COVID-19 pandemic. American Academy of Audiology.
Atcherson, S. R., McDowell, B. R., & Howard, M. P. (2021). Acoustic effects of non-transparent and transparent face coverings. The Journal of the Acoustical Society of America, 149(4), 2249–2254.
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48.
Bell, T. S., Dirks, D. D., & Carterette, E. C. (1989). Interactive factors in consonant confusion patterns. The Journal of the Acoustical Society of America, 85(1), 339–346.
Boersma, W. & Weenink, D. (2019). Praat: Doing phonetics by computer (Version 6.1) [Computer program] . Retrieved April 2021, from http://www.praat.org/
Borrie, S. A., Barrett, T. S., & Yoho, S. E. (2019). Autoscore: An open-source automated tool for scoring listener perception of speech. The Journal of the Acoustical Society of America, 145(1), 392–399.
Bunton, K. (2006). Fundamental frequency as a perceptual cue for vowel identification in speakers with Parkinson's disease. Folia Phoniatrica et Logopaedica, 58(5), 323–339.
Bunton, K., Kent, R. D., Kent, J. F., & Duffy, J. R. (2001). The effects of flattening fundamental frequency contours on sentence intelligibility in speakers with dysarthria. Clinical Linguistics & Phonetics, 15(3), 181–193.
Chin, S. B., Finnegan, K. R., & Chung, B. A. (2001). Relationships among types of speech intelligibility in pediatric users of cochlear implants. Journal of Communication Disorders, 34(3), 187–205.
Dagenais, P. A., Brown, G. R., & Moore, R. E. (2006). Speech rate effects upon intelligibility and acceptability of dysarthric speech. Clinical Linguistics & Phonetics, 20(2–3), 141–148.
Fairbanks, G., Guttman, N., & Miron, M. S. (1957). Effects of time compression upon the comprehension of connected speech. Journal of Speech and Hearing Disorders, 22(1), 10–19.
Fant, G. (1970). Acoustic theory of speech production (No. 2). Walter de Gruyter.
Ferguson, S. H., & Kewley-Port, D. (2002). Vowel intelligibility in clear and conversational speech for normal-hearing and hearing-impaired listeners. The Journal of the Acoustical Society of America, 112(1), 259–271.
Ferguson, S. H., & Kewley-Port, D. (2007). Talker differences in clear and conversational speech: Acoustic characteristics of vowels. Journal of Speech, Language, and Hearing Research, 50(5), 1241–1255.
Forrest, K., Weismer, G., Hodge, M., Dinnsen, D. A., & Elbert, M. (1990). Statistical analysis of word-initial /k/ and /t/ produced by normal and phonologically disordered children. Clinical Linguistics & Phonetics, 4(4), 327–340.
Goldin, A., Weinstein, B., & Shiman, N. (2020). Speech blocked by surgical masks becomes a more important issue in the era of COVID-19. Hearing Review, 27(5), 8–9.
Hillenbrand, J., Getty, L. A., Clark, M. J., & Wheeler, K. (1995). Acoustic characteristics of American English vowels. The Journal of the Acoustical Society of America, 97(5), 3099–3111.
Kent, R. D., & Kim, Y. (2011). The assessment of intelligibility in motor speech disorders. In A. Lowit & R. D. Kent (Eds.), Assessment of motor speech disorders (pp. 21–37). Plural.
Kent, R. D., Weismer, G., Kent, J. F., & Rosenbek, J. C. (1989). Toward phonetic intelligibility testing in dysarthria. Journal of Speech and Hearing Disorders, 54(4), 482–499.
Kim, Y. (2017). Acoustic characteristics of fricatives /s/ and /∫/produced by speakers with Parkinson's disease. Clinical Archives of Communication Disorders, 2(1), 7–14.
Kim, Y., Kent, R. D., & Weismer, G. (2011). An acoustic study of the relationships among neurologic disease, dysarthria type, and severity of dysarthria. Journal of Speech, Language, and Hearing Research, 54(2), 417–429.
Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest Package: Tests in linear mixed effects models. Journal of Statistical Software, 82(13), 1–26.
Lansford, K. L., Borrie, S. A., & Bystricky, L. (2016). Use of crowdsourcing to assess the ecological validity of perceptual-training paradigms in dysarthria. American Journal of Speech-Language Pathology, 25(2), 233–239.
Lenth, R., Singmann, H., Love, J., Buerkner, P., & Herve, M. (2019). Emmeans: Estimated marginal means, aka least-squares means (R package Version 1.1) [Computer software] .
Levy, E. S., Chang, Y. M., Ancelle, J. A., & McAuliffe, M. J. (2017). Acoustic and perceptual consequences of speech cues for children with dysarthria. Journal of Speech, Language, and Hearing Research, 60(6S), 1766–1779.
Lu, Y., & Cooke, M. (2009). The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise. Speech Communication, 51(12), 1253–1262.
Lüdecke, D., Ben-Shachar, M. S., Patil, I., Waggoner, P., & Makowski, D. (2021). performance: An R package for assessment, comparison and testing of statistical models. Journal of Open Source Software, 6(60).
Magee, M., Lewis, C., Noffs, G., Reece, H., Chan, J. C., Zaga, C. J., Paynter, C., Birchall, O., Rojas Azocar, S., Ediriweera, A., Kenyon, K., Caverléd, M. W., Schultz, B. G., & Vogel, A. P. (2020). Effects of face masks on acoustic analysis and speech perception: Implications for peri-pandemic protocols. The Journal of the Acoustical Society of America, 148(6), 3562–3568.
Mattys, S. L., & Liss, J. M. (2008). On building models of spoken-word recognition: When there is as much to learn from natural “oddities” as artificial normality. Perception & Psychophysics, 70(7), 1235–1242.
McRae, P. A., Tjaden, K., & Schoonings, B. (2002). Acoustic and perceptual consequences of articulatory rate change in Parkinson disease. Journal of Speech, Language, and Hearing Research, 45(1), 35–50.
Mefferd, A. S. (2017). Tongue-and jaw-specific contributions to acoustic vowel contrast changes in the diphthong /ai/ in response to slow, loud, and clear speech. Journal of Speech, Language, and Hearing Research, 60(11), 3144–3158.
Milenkovic, P. (2005). Time-frequency 32 [Computer software] . University of Wisconsin-Madison.
Moon, S. J., & Lindblom, B. (1994). Interaction between duration, context, and speaking style in English stressed vowels. The Journal of the Acoustical Society of America, 96(1), 40–55.
National Institute on Deafness and Other Communication Disorders. (2021). Improving communication when wearing a face covering: Text version. Retrieved March 27, 2022, from https://www.nidcd.nih.gov/about/nidcd-director-message/cloth-face-coverings-and-distancing-pose-communication-challenges-many/improving-communication-when-wearing-face-covering-text-version
Nguyen, D. D., McCabe, P., Thomas, D., Purcell, A., Doble, M., Novakovic, D., Chacon, A., & Madill, C. (2021). Acoustic voice characteristics with and without wearing a facemask. Scientific Reports, 11(1), 5651–5611.
Picheny, M. A., Durlach, N. I., & Braida, L. D. (1986). Speaking clearly for the hard of hearing II: Acoustic characteristics of clear and conversational speech. Journal of Speech and Hearing Research, 29(4), 434–446.
Pollack, I. (1948). Effects of high pass and low pass filtering on the intelligibility of speech in noise. The Journal of the Acoustical Society of America, 20(3), 259–266.
R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
Rab, S., Javaid, M., Haleem, A., & Vaishya, R. (2020). Face masks are new normal after COVID-19 pandemic. Diabetes & Metabolic Syndrome: Clinical Research & Reviews, 14(6), 1617–1619.
Smiljanić, R., & Bradlow, A. R. (2009). Speaking and hearing clearly: Talker and listener factors in speaking style changes. Language and Linguistics Compass, 3(1), 236–264.
Stevens, K. N. (1980). Acoustic correlates of some phonetic categories. The Journal of the Acoustical Society of America, 68(3), 836–842.
Tjaden, K., & Turner, G. S. (1997). Spectral properties of fricatives in amyotrophic lateral sclerosis. Journal of Speech, Language, and Hearing Research, 40(6), 1358–1372.
Toscano, J. C., & Toscano, C. M. (2021). Effects of face masks on speech recognition in multi-talker babble noise. PLOS ONE, 16(2), Article e0246842.
Uchanski, R. M. (2005). Clear speech. In D. B. Pisoni & R. E. Remez (Eds.), The handbook of speech perception (pp. 207–235). Wiley. https://doi.org/10.1002/9780470757024.ch9
Winn, M. B., Chatterjee, M., & Idsardi, W. J. (2013). Roles of voice onset time and F0 in stop consonant voicing perception: Effects of masking noise and low-pass filtering. Journal of Speech, Language, and Hearing Research, 56(4), 1097–1107.
Wouters, J., & Macon, M. W. (2002). Effects of prosodic factors on spectral dynamics. I. Analysis. The Journal of the Acoustical Society of America, 111(1), 417–427.
Yi, H., Pingsterhaus, A., & Song, W. (2021). Effects of wearing face masks while using different speaking styles in noise on speech intelligibility during the COVID-19 pandemic. Frontiers in Psychology, 12.

Information & Authors

Information

Published In

Journal of Speech, Language, and Hearing Research
Volume 65Number 12December 2022
Pages: 4679-4689
PubMed: 36351244

History

  • Received: May 2, 2022
  • Revised: Jul 24, 2022
  • Accepted: Aug 19, 2022
  • Published online: Nov 9, 2022
  • Published in issue: Dec 12, 2022

Permissions

Request permissions for this article.

Authors

Affiliations

School of Communication Science and Disorders, Florida State University, Tallahassee
Austin Thompson
School of Communication Science and Disorders, Florida State University, Tallahassee

Notes

Disclosure: The authors have declared that no competing financial or nonfinancial interests existed at the time of publication.
Correspondence to Yunjung Kim: [email protected]
Editor-in-Chief: Cara E. Stepp
Editor: Kate Bunton

Metrics & Citations

Metrics

Article Metrics
View all metrics



Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

For more information or tips please see 'Downloading to a citation manager' in the Help menu.

Citing Literature

  • Face masks and protection against COVID-19 and other viral respiratory infections: Assessment of benefits and harms in children, Paediatric Respiratory Reviews, 10.1016/j.prrv.2024.08.003, 54, (62-69), (2025).
  • The influence of masks on children's ability to produce new words, Language Learning and Development, 10.1080/15475441.2024.2429034, (1-20), (2024).
  • Improving masked communication: the case for transparent masks, Frontiers in Communication, 10.3389/fcomm.2024.1371142, 9, (2024).
  • Does Native Language Matter in Perceptual Ratings of Dysarthria?, Journal of Speech, Language, and Hearing Research, 10.1044/2024_JSLHR-23-00668, 67, 9, (2842-2855), (2024).
  • Children benefit from gestures to understand degraded speech but to a lesser extent than adults, Frontiers in Psychology, 10.3389/fpsyg.2023.1305562, 14, (2024).
  • Child mask mandates for COVID-19: a systematic review, Archives of Disease in Childhood, 10.1136/archdischild-2023-326215, 109, 3, (e1-e7), (2023).
  • Effect of Face Masks and Speech Style on Speech Intelligibility and Listener Effort in Parkinson's Disease, American Journal of Speech-Language Pathology, 10.1044/2023_AJSLP-23-00085, 32, 5, (2282-2296), (2023).
  • Automated Scoring of the Speech Intelligibility Test Using Autoscore, American Journal of Speech-Language Pathology, 10.1044/2024_AJSLP-24-00276, 0, 0, (1-12), (undefined).

View Options

Sign In Options

ASHA member? If so, log in with your ASHA website credentials for full access.

Member Login

View options

PDF

View PDF

Full Text

View Full Text

Figures

Tables

Media

Share

Share

Copy the content Link

Share