A multidisciplinary investigation including a comprehensive cognitive assessment is essential for an accurate diagnosis of PPA (
Butts et al., 2015;
Marshall et al., 2018). In the current memory clinical setting, however, language assessment is often neglected or very limited due to a lack of short and easy-to-administer language tests that assess more than just naming. For example, the Boston Naming Test (
Kaplan et al., 1983) is often used—in many cases even in its abbreviated 15-item Consortium To Establish a Registry for Alzheimer's Disease version (
Mack et al., 1992). This test, however, only examines the presence of anomia (word-retrieval deficits) and is insufficient to differentiate between all PPA subtypes since this difficulty is common to all three variants of PPA (
Gorno-Tempini et al., 2011). If language is assessed more broadly, most commonly tests are used that have been primarily developed for the assessment of aphasia syndromes after stroke and examine the different language domains separately. As a result, different language measures are often combined to distinguish across PPA subtypes. Combining different tests, however, may lead to invalid interpretation due to differences in normative groups and task properties (e.g., word frequency, number of syllables) across tests (
Savage et al., 2013). Moreover, the often-used standard aphasia batteries developed for use with stroke-induced aphasia may lack sensitivity to detect the subtle deficits that are present in early stages of PPA and may result in inappropriate nomenclature (such as Broca's or Wernicke's) based on the stroke-induced aphasia classification (
Henry & Grasso, 2018).
Clark et al. (2020) examined the utility of a standard stroke aphasia battery, the Western Aphasia Battery–Revised (WAB-R), for classifying variants of PPA, and they concluded that “WAB-R classification did not distinguish among PPA classification determined by consensus” (p. 498).
To overcome these problems,
Savage et al. (2013) developed a new language screen, the Sydney Language Battery (SYDBAT). This instrument measures the integrity of four main aspects of language proficiency: naming, single-word comprehension, semantic association, and repetition abilities, using stimuli based on the same set of target words across the four subtests. This approach eliminates the confounds of linguistic differences and varying norms that trouble the current clinical practice when combining multiple measures from different tests. They created a simple diagnostic algorithm based on the performance on the Naming and Semantic Association subtests, in which sv-PPA is diagnosed if naming and semantic association scores are 2
SDs or more below the control mean, lv-PPA is diagnosed if naming is below 22 but semantic association is within 2
SDs of the control mean, and nfv-PPA is diagnosed if naming and semantic association are both within 2
SDs of the control mean. While the SYDBAT has proven successful in correctly classifying 80% of the PPA patients based on this diagnostic algorithm (
Savage et al., 2013), evidence on its validity and reliability is to date limited to the Australian English version.
This study had several aims. First, this study sought to determine the validity and reliability of the Dutch version of the SYDBAT (henceforth referred to as the SYDBAT-NL) and derive a predictive diagnostic algorithm using the distinct task profiles for each PPA subtype with the goal of confirming the findings of
Savage et al. (2013) in an independent sample. Based on previous research (
Savage et al., 2013), each PPA subtype was expected to demonstrate a distinct profile across SYDBAT subtests, with most prominent impairments in naming as well as in verbal and visual comprehension (i.e., Word Comprehension and Semantic Association) in sv-PPA because these tests are semantically based. Conversely, nfv-PPA patients were expected to perform well on the visual and verbal comprehension subtests, but poorer on speech production–related subtests (i.e., Naming and Repetition). Lastly, lv-PPA patients were expected to perform at an intermediate level on all subtests, yet with most evident deficits in naming. Next, this study aimed to examine the SYDBAT in patients diagnosed with MCI or with AD, thus extending previous research by including a non-PPA patient population in which language impairment also is present. This will enable a direct comparison of PPA with MCI and AD. We expected that the SYDBAT would demonstrate a broad spectrum of language deficits in MCI and AD, mainly on semantically based subtests (i.e., Naming, Word Comprehension, and Semantic Association). Since we expect language deficits to be present in both MCI and AD, the assessment of language deficits may not be sufficient to differentially diagnose MCI and AD from PPA. The construct validity was examined through comparison between performance on the SYDBAT-NL subtests and other commonly used language tests. Here, we expected good convergent validity when comparing SYDBAT subtest scores with other established language measures (
Savage et al., 2013), while for divergent validity, we expected that the Naming, Word Comprehension, and Semantic Association tests would correlate due to a shared semantic component. Reliability was measured as internal consistency via Cronbach's alpha. By establishing the SYDBAT as a promising tool that can be used in memory clinics, early detection and differentiation of language impairment in neurodegenerative disease becomes possible, which has great prognostic value and possible implications for treatment (
Leyton et al., 2013;
Tippett et al., 2015).
Discussion
In this study, the diagnostic value of the SYDBAT-NL was assessed in samples of patients with PPA or non-PPA cognitive decline (AD and MCI patients) and cognitively unimpaired controls. As hypothesized, different SYDBAT performance patterns were found across PPA and non-PPA patient groups. While a discriminant function analysis based on SYDBAT subtest scores could predict PPA subtype with 77.8% accuracy, it was more difficult to disentangle PPA from non-PPA patients on SYDBAT scores alone. To assist clinical interpretation, simple rules were set up and translated into a diagnostic decision tree for subtyping PPA, which was capable of diagnosing a large proportion of the cases. This diagnostic tree resulted in very high specificity for the sv-PPA (100%) and nfv-PPA (94%) subtypes, whereas the lv-PPA subtype showed a higher sensitivity (90%) compared to specificity (84%).
To examine impairment profiles per patient group, SYDBAT scores were classified as either clinically impaired or unimpaired, based on a threshold of 1.5
SDs below the cognitively unimpaired control group mean. As expected, none of the nfv-PPA patients showed impaired scores on the Semantic Association or Word Comprehension subtests (
Gorno-Tempini et al., 2011). It is worth noting that a third of the nfv-PPA patients scored within the normal limits on the Naming subtest, providing a clear distinction from the two other PPA variants. Even though nfv-PPA patients typically present with effortful speech and production errors, their performance on naming tasks that require single-word production in response to a target can be relatively preserved (
Graham et al., 2004).
In contrast, sv-PPA patients showed the opposite pattern in which tasks that require semantic knowledge were affected most. All had a clinically impaired Naming score, and 10 out of 13 (77%) also had an impaired score on Semantic Association. Based on the diagnostic rules, these two subtests together correctly predicted 92% of the sv-PPA cases. Only one sv-PPA patient was misclassified as lv-PPA due to mild naming and semantic deficits. This patient may have been identified with tests involving the naming and comprehension of very low frequency nouns (e.g., proper names) not included here.
Most predictive errors were made between nfv-PPA and lv-PPA variants, as three nfv-PPA patients were misclassified as lv-PPA and two lv-PPA patients were misclassified as nfv-PPA. This is in line with previous studies reporting the distinction between these two variants to be the most challenging, as they present with overlapping characteristics and the differentiation of key linguistic features requires considerable expertise (
Croot et al., 2012). At the same time, the distinction between lv-PPA and nfv-PPA is relevant to clinical practice, since the majority of lv-PPA patients have underlying Alzheimer's disease pathology, while nfv-PPA is strongly linked to frontotemporal lobar degeneration tau pathology (
Gorno-Tempini et al., 2011). This stresses the importance of a more extensive neuropsychological investigation, including tasks of episodic memory, emotion processing, and executive functioning (
Eikelboom et al., 2018;
Piguet et al., 2015) and the development of new tests for discriminating between these variants. The current study provides preliminary recommendations regarding the use of these broad neuropsychological tests, which will require further validation in future research. Especially in the early stages of the disease, when global cognition is typically still preserved, more sensitive measures than global cognitive measures such as the MoCA will need to be developed.
We aimed to create a short and easy-to-administer language screener that can be widely used in memory clinics. As such, the SYDBAT measures language at the single-word level and arguably does not capture all aspects of language and communication. Specifically, while no differences were found between lv-PPA and nfv-PPA on word comprehension, a difference is likely to emerge at the sentence level, with difficulties arising from different aspects of the sentences—grammatical complexity in nfv-PPA and length of utterances in lv-PPA. Utilizing this difference in underlying mechanisms to create a sentence comprehension test in which these differences result in different scores will be fruitful in distinguishing nfv-PPA from lv-PPA.
Similarly, a more extensive assessment of repetition deficits may shed light on the differences between lv-PPA and nfv-PPA. Performance on the SYDBAT Repetition subtest showed no difference between lv-PPA and nfv-PPA subtypes, with half of the nfv-PPA patients and less than half of the lv-PPA patients (7/20) showing a clinical impairment on this subtest. While both subtypes thus show repetition errors, the underlying mechanism causing these errors is likely to be different, since the impairments in word repetition in lv-PPA patients are likely to reflect a phonological rather than the articulatory impairment that typifies nfv-PPA (
Leyton, Savage, et al., 2014). Moreover, the repetition impairments in lv-PPA may be more prominent in tasks that measure repetition at the sentence level, as a phonologic short-term memory deficit is thought to be a key cognitive mechanism underlying deficits in lv-PPA. For the SYDBAT Repetition subtest, both articulatory as well as phonological errors are counted as errors (see
Supplemental Material S1 for the SYDBAT scoring instructions). Future studies are necessary to determine whether a qualitative analysis of the SYDBAT Repetition errors in both lv-PPA and nfv-PPA patients or the addition of a sentence repetition test might aid in the differentiation of these variants, especially because previous studies suggest that this could possibly be used as a clinical marker for underlying amyloid burden in PPA (
Leyton, Ballard, et al., 2014).
Lastly, subtest items of the SYDBAT are divided in three blocks of difficulty based on lexical frequency. Previous studies (
Savage et al., 2013) and the current study do not use difficulty based on lexical frequency as a predictor in their analyses due to the goal of creating a short screen that can be easily administered, scored, and interpreted in clinical practice (in which cutoff scores are often preferred). However, it would be interesting for a future study to assess whether item difficulty based on lexical frequency is reflected in patient scores and whether sensitivity and specificity would be improved by including, for example, only the most difficult items of the SYDBAT. In addition, using only the most difficult items of the SYDBAT could be beneficial for improving internal consistency of the SYDBAT subtests, especially for the Word Comprehension subtest.
In addition to the PPA profiles, this study is one of the first to report on the SYDBAT in a non-PPA patient sample. The SYDBAT profiles of the non-PPA patient group (MCI and AD), showing Naming and Semantic Association scores to be lowest with relatively preserved Repetition, are in agreement with previous findings reporting deficits in confrontation naming as well as semantic tests like category fluency, semantic categorization, and lexical decision in this group (
Taler & Phillips, 2008). Even though Naming and Semantic Association scores were significantly lower in the AD group compared to the MCI group, MCI patients performed significantly lower than cognitively unimpaired controls. This confirms increasing evidence that language deficits begin several years before the onset of AD dementia (
Auriacombe et al., 2006) and that differences in language features between AD and MCI reflect predominantly quantitative instead of qualitative differences (
Jokel et al., 2019). Earlier studies (
Alexopoulos et al., 2006) have shown that individuals with multidomain MCI, including language deficits, are more likely to develop AD than those with an isolated memory deficit. Clarification of the nature of language impairment and development of sensitive measures for language impairment in these patient groups, therefore, constitute essential tools for the early detection of AD (
Taler & Phillips, 2008;
Vonk et al., 2020). The application of the SYDBAT could therefore possibly be useful in the diagnostic process of MCI and AD.
The comparison between the SYDBAT profiles of our PPA and non-PPA patients showed a large amount of overlap between groups. MCI patients presented with a similar profile as lv-PPA patients on the semantically based subtests (Naming, Word Comprehension, and Semantic Association). The profile of AD patients greatly overlapped with that of sv-PPA patients, with the exception of Naming, on which sv-PPA patients scored significantly lower than AD patients. As a result, it may be difficult to differentiate MCI, AD, and PPA based on cross-sectional SYDBAT scores alone, which was also confirmed by the discriminative analysis for all groups resulting in a cross-validated classification accuracy of 50.6%. A future study that includes both cognitive and language data in a discriminant analysis may be useful to determine whether this results in a higher classification accuracy. It should be noted, however, that the SYDBAT is intended for use as a first screen in patients visiting memory clinics, in which language is generally not assessed in detail. While the ease of use of the SYDBAT will facilitate its use in clinical practice, SYDBAT scores below cutoff warrant the use of additional, more in depth, approaches to language assessment and the assessment of other cognitive domains in these groups, which may prove more fruitful in the early differential diagnosis. For example, longitudinal assessment using the SYDBAT might be more informative, as language abilities have been shown to decline faster in PPA compared to AD (
Blair et al., 2007). In addition, tests that measure semantic abilities at a more precise level than standardized semantic tasks, for example, by looking at semantic interference (
Vandenberghe et al., 2005), might show more fine-grained distinctions between AD, MCI, and PPA.
An examination of psychometric properties showed good reliability and validity measures for the SYDBAT, yet future studies are needed to confirm these properties in larger samples. Cronbach's alpha indicates that the SYDBAT has high levels of internal consistency, especially for the Naming, Repetition, and Semantic association subtests. Convergent construct validity was supported by moderate-to-large positive correlations between SYDBAT performance and performance on the standardized tests: the BNT, the CAT-NL, and the SAT. Divergent validity analyses showed all measures of Naming, Word Comprehension, and Semantic Association to be positively correlated, which could be explained by the fact that all these subtests contain a semantic component. As expected, however, the Repetition subtest was unrelated to any of the semantically based subtests, confirming the divergent validity to be acceptable. As hypothesized, in terms of PPA classification, the SYDBAT outperforms commonly used language tests that have been primarily developed for the assessment of aphasia syndromes after stroke. This suggests tests that are specifically designed and validated for the PPA population, like the SYDBAT, to be a useful and desirable element in the diagnostic process of PPA in a clinical setting.
The limitations of our study lie in the relatively small sample sizes and the reliance on clinical diagnosis that can limit the validity because of potential misclassification. However, it should be noted that large study samples of PPA patients are relatively rare given the low prevalence of PPA (
Matías-Guiu & García-Ramos, 2013) and the sample size of the current study is similar to that used in previous studies (
Savage et al., 2013). Also, the combination of complementary clinical diagnostic tools such as MRI, cerebrospinal fluid analysis, and neuropsychological assessment, as used in our patient sample, can assure a good diagnostic reliability in the clinical assessment of PPA (
Grand et al., 2011). The risk of misclassification was thus minimized as much as possible in the current study. Finally, the SYDBAT and other neuropsychological tests were administered concurrently for the majority of the patients. One could argue that this bears the risk of circularity. However, the clinical diagnoses were primarily based on the elaborative language assessment using established language tests, available neuroimaging, and/or other biomarkers in a multidisciplinary way.
It should be noted that the SYDBAT is not designed to determine an individual's full linguistic profile. An extensive diagnostic assessment, therefore, is still required to measure and observe all the important characteristics of speech, including agrammatism and apraxia of speech. In addition, since the items consist of nouns only, the comprehension and repetition of single words as measured by the subtests of the SYDBAT might be intact in the early stages of the disease (
Leyton et al., 2013). Therefore, the comprehension of more complex grammatical structures and the repetition of phrases and sentences should be included in a subsequent more extensive neuropsychological or linguistic assessment.
To conclude, the SYDBAT is a promising screening instrument developed to address the paucity of simple and short tools suitable for neuropsychological assessment to help discriminate between the three variants of PPA. A diagnostic decision tree was developed to assist clinicians to determine PPA subtype in a straightforward way. The SYDBAT profiles of AD and MCI patients support previous work that suggests language impairments to be frequent and primarily semantic in nature, yet based on the SYDBAT alone it was difficult to differentiate PPA from MCI and AD. Despite the SYDBAT's limitations, combining it with the outcome of other language and cognitive tests may aid to improve the diagnostic accuracy of PPA and its subtypes in a memory clinic setting.