The Lille apathy rating scale (LARS), a new instrument for detecting and quantifying apathy: validation in Parkinson’s disease
- 1Psychology Department, Charles De Gaulle University, Lille, France
- 2Neurology and Movement Disorders Unit, EA2683, Faculty of Medicine and Lille University Hospital, Lille
- Correspondence to: Dr Pascal Sockeel UFR de Psychologie, BP 149, F-59653 Villeneuve D’Ascq cedex, France;
- Received 13 July 2005
- Accepted 3 January 2006
- Revised 5 December 2005
Background: Apathy is usually defined as reduced interest and participation in various activities. It is a frequent consequence of neurological and psychiatric disorders. Although various scoring methods have been proposed, there is a lack of validated, standardised instruments for detecting apathy and assessing its severity.
Objective: To develop an apathy rating scale using a structured standardised interview capable of distinguishing between the condition’s various features.
Methods: The Lille Apathy Rating Scale (LARS) is based on a structured interview. It includes 33 items, divided into nine domains. Responses are scored on a dichotomous scale. The participants used to validate the scale consisted of 159 patients with probable Parkinson’s disease and 58 healthy control subjects. The Marin Apathy Scale, the Montgomery and Asberg Depression Rating Scale, and the Mattis Dementia Rating Scale were also administered.
Results: Principal component analysis showed that the LARS probed a single construct which forms the root of an oblique factor structure reflecting four dimensions: intellectual curiosity, self awareness, emotion, and action initiation. The main psychometric properties of the LARS (internal consistency, inter-rater and test-retest reliability) were satisfactory. Concurrent validity was evaluated by reference to the Marin scale and to judgements provided by expert clinicians.
Conclusions: Standard validity indices showed that the LARS is sensitive and capable of distinguishing between apathy and depression. As a screening tool, the scale is able to support dichotomous judgements accurately and, when greater measurement sensitivity is required, also determine the severity of apathy within a four category classification.
- AES, apathy evaluation scale
- DRS, dementia rating scale
- LARS, Lille Apathy Rating Scale
- MADRS, Montgomery and Asberg depression rating scale
- NPI, Neuropsychiatric Inventory
- UPDRS, Unified Parkinson’s Disease Rating Scale
Apathy refers to a set of behavioural, emotional, and cognitive features such as reduced interest and participation in the main activities of daily life, a lack of initiative, a trend towards early withdrawal from initiated activities, indifference, and flattening of affect. The concept of apathy lacks specificity and various different definitions have been suggested. For certain investigators, the key feature is lack of motivation, and apathy can thus be defined as “diminished motivation not attributable to diminished level of consciousness, cognitive impairment, or emotional distress”.1 For others, the prime characteristic is lack of initiative: apathy is considered to be “an absence of responsiveness to stimuli as demonstrated by a lack of self initiated action”.2 In addition to these conceptual difficulties, apathy overlaps with a range of other behavioural and psychological factors, including mood and some aspects of personality and cognitive function.3
The apathy evaluation scale (AES) proposed by Marin et al4 is currently the scale most often used to assess apathy. Its reliability and validity for measuring apathy in different pathological conditions have been demonstrated.4 However, the administration instructions and scoring method have suffered from a lack of standardisation. In clinical research and most drug treatment studies, apathy has been assessed with the Neuropsychiatric Inventory (NPI), which includes a specific item for the global evaluation of apathy.5 Recently, Robert et al6 proposed an extension of this assessment by using the Apathy Inventory, which allows separate assessment of emotional blunting, lack of initiative, and lack of interest in addition to the global NPI score. Despite the widely demonstrated validity of these latter scales, they are primarily dependent on the opinion of the caregiver, who may not always be available or reliable.
From a pathophysiological viewpoint, the most common cause of apathy is dysfunction of the frontal lobes, following either a direct lesion of the frontal cortex or damage to regions tightly connected to the latter (such as the basal ganglia). The frontal-subcortical circuits often seem to be involved, and apathy is a common behavioural consequence of basal ganglia disorders.2,3,7 Current estimates of the prevalence of apathy in Parkinson’s disease vary from 16.5% to 42%.3,7,8 It is also frequently observed in other conditions with parkinsonism.9
Our aim was to develop an apathy rating scale using a structured, standardised interview capable of distinguishing between the various features of apathy. We examined its concurrent validity, internal consistency, and inter-rater reliability in a population of healthy subjects and patients with Parkinson’s disease. The latter group was chosen in view of the fact that apathy is commonly observed in this basal ganglia degenerative disease.2
One hundred and fifty nine patients with probable Parkinson’s disease participated in the study (the demographic characteristics are shown in table 1). Parkinson’s disease was defined according to the United Kingdom Parkinson’s disease brain bank criteria.10 The study population was separated into three groups:
47 patients with non-demented Parkinson’s disease early in the course of the disease and well stabilised on antiparkinsonian drug treatment (stable Parkinson’s disease);
73 non-demented patients with fluctuating Parkinson’s disease and severe symptoms (fluctuating Parkinson’s disease);
39 patients with Parkinson’s disease and a diagnosis of dementia according to the fourth Edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV) (Parkinson’s disease with dementia).
Eighteen patients early in the course of the disease were not receiving any treatment at all. Sixty four patients were being treated with levodopa only (mean (SD) (range): levodopa equivalent dosage = 692.8 (360.0) mg/day (100 to 1700)); 10 were receiving a dopamine agonist only (levodopa equivalent dosage = 487.0 (461.7) mg/day (120 to 1500)); and 67 were receiving levodopa in combination with a dopamine agonist (levodopa equivalent dosage = 1038.1 (513.0) mg/day (150 to 2337.5)). Motor disability was evaluated using the motor score on the Unified Parkinson’s Disease Rating Scale (UPDRS-III).11 Although none of the patients were suffering from any neurological disease other than Parkinson’s disease, 24 (15%) had a history of drug induced psychosis. Sixty six patients were receiving treatment for anxiety related, depressive, or psychotic symptoms (23 were on selective serotonin reuptake inhibitors only, nine were on alprazolam or bromazepam only, seven were taking mianserine only, 21 were taking alprazolam or bromazepam plus mianserine (n = 17) or a selective serotonin inhibitor (n = 4), and 10 were being treated with clozapine). Five patients were being treated with acetylcholinesterase inhibitors for their cognitive disorders.
Fifty eight healthy control subjects were chosen to match the patient group as closely as possible with respect to age and educational level (table 1).
None of the controls had a personal history of neurological or psychiatric illness.
All participants gave their informed consent to participation in the study.
Apathy was assessed with the Marin apathy evaluation scale (AES) and our new Lille Apathy Rating Scale (LARS), as described below. The scale and its instructions for use are shown in the appendix which can be viewed on the journal website (http://www.jnnp.com/supplemental).
Severity of depressive symptoms was assessed using the Montgomery and Asberg depression rating scale (MADRS).12 In addition to the global score and in view of the scale’s factorial structure, we also used three subscores for assessing dysphoric apathy, psychic anxiety, and vegetative symptoms.13
Global cognitive efficiency was assessed in terms of the global score (out of 144) on the Mattis Dementia Rating Scale (Mattis DRS).14
All participants underwent assessment of apathy, depression, and cognitive efficiency during a one hour test session.
Scale structure, item development and selection
The LARS was based on the main conceptual principles proposed by Marin et al4 and on our own clinical experience. The scale includes 33 items, divided into nine domains. Eight of these concern the main clinical manifestations of apathy, as described in published reports: reduction in everyday productivity; lack of interest; lack of initiative; extinction of novelty seeking and motivation; blunting of emotional responses; lack of concern; and poor social life. The ninth domain (extinction of self awareness) refers to a particular manifestation of apathy highlighted by Stuss et al,2 who considered self and social awareness as “a metacognitive ability, necessary to mediate information from a personal, social past and current history with projections to the future”. The main impact of apathy on this ability would be a reduction in self criticism and behavioural adjustment to social requirements in one’s own interest.
The items are presented as positively worded questions to which the subject is expected to answer clearly “yes” or “no”, in order to reduce subjective interpretations as much as possible. With the exception of the first three questions (which are coded on a five point Likert-type scale), responses are coded by the clinician on a binary (yes/no) scale, with an additional “NA” (not available) condition for non-classifiable answers or non-applicable items. Finally, the scale was designed in such a way that each of the nine domains can be evaluated through subscales which contribute with equal weighting to the global score. Hence the global score ranges from −36 to +36, with a higher score representing a greater degree of apathy.
The scale’s rating is based on a subject’s own report (during a structured interview) on their thoughts, emotions, and activities over the previous four weeks. In order to evaluate inter-rater reliability, simultaneous ratings on a subset of Parkinson’s disease patients (n = 32) were carried out by two clinicians. Finally, a subset of patients (n = 35) participated in a retest procedure approximately four months later.
The Parkinson’s disease patients were also examined by an independent group of clinicians. These were asked to judge (on the basis of a clinical examination and interview) whether or not the individual was depressed or apathetic or both. When a patient was categorised as apathetic, the clinician had to describe their behaviour using one of the following three adjectives: “mild,” “moderate,” or “severe.” Statistics and data analyses were carried out using Statistica 6.0 software.
First, the 33 items were analysed in terms of potential intercorrelations. As expected, an exploratory examination of the correlation matrix showed relatively low correlations between the scores in items belonging to different domains (lowest coefficient = −0.13, highest coefficient = 0.36, mean (SD) r = 0.10 (0.10)). The correlations between items belonging to the same domain were reasonably high (lowest coefficient = 0.12, highest coefficient = 0.58, mean r = 0.32 (0.17)), whereas the correlations between scores for each item and for the scale as a whole were high (lowest coefficient = 0.52, highest coefficient = 0.81, mean r = 0.65 (0.11)). Hence, these results support our choice of a procedure with dichotomous items which are then collapsed into subscales to improve sensitivity. Finally, and with the exception of self awareness (r = 0.28), the different subscales corresponding to the nine domains were highly correlated to the global LARS score (lowest coefficient = 0.45, highest coefficient = 0.72, mean r = 0.60 (0.08)).
An exploratory principal component analysis was carried out on the nine subscores in order to determine the structure of the data: four factors were identified with eigenvalues close to 1, explaining more than 65% of the total variance. The first factor accounted for 34% of the total variance, confirming the one dimensional nature of the LARS.
A further principal component analysis using oblique rotation generated a satisfactory model for this number of factors (goodness of fit with χ2 (df 6) = 9.69, p>0.13, NS). Table 2 gives the factor loadings and the correlations between the hierarchical structure’s different components.
First, four clusters of variables were found, with either high unique factor loadings or low cross loadings. Next, the correlations between these four primary oblique factors yielded a set of two secondary factors, with the first showing evidence for a main construct representing apathy (factors 1, 3, and 4) and a second one dealing more with self awareness (factor 2). Finally, there was evidence to suggest that the four primary factors might represent distinct dimensions of apathy: (1) intellectual curiosity (lack of interest, novelty seeking and motivation, poor social life), reflecting low interest in novelty as well as a drop in the perceived need for knowledge; (2) self awareness; (3) emotion (blunting of emotional responses, lack of concern); and (4) action initiation (low everyday productivity, lack of initiative).
Internal consistency, test–retest, and inter-rater reliability
Internal consistency was determined both in terms of split half reliability and Cronbach’s standardized α coefficients. The between-items and between-subscales α values were 0.80 and 0.74, respectively. Split half reliability was 0.73 and reached 0.84 after correction with the Spearman Brown “prophecy formula”. When evaluated on a subgroup of 35 patients, the test–retest correlation coefficient was 0.95. Inter-rater reliability was checked using intraclass correlation calculated from the ratios of the different sources of variance. The latter were generated by an analysis of variance (ANOVA) on repeated measures, with the identity of the judge as a dependent variable. This ANOVA did not reveal any differences between scorings (F<1, NS), and the intraclass correlation was high (r = 0.98). According to Nunally,15 these values correspond to high reliability.
Separability between apathy and depression
Parkinson’s disease patients were divided into four groups according to the clinicians’ binary classification of apathy and depression (table 3). The MADRS and LARS scores were compared in a two way multivariate analysis of variance (MANOVA), with the two classifications as categorical predictors. In a second MANOVA, we entered various dependent variables: the MADRS “dysphoric apathy,” “psychic anxiety,” and “vegetative symptoms” scores on the one hand, and LARS subscores from the four factorial analysis clusters (referred as intellectual curiosity (IC), self awareness (SA), emotion (E), and action initiation (AI)) on the other. We hypothesised that the main scores or the subscores, or both, would be differentially sensitive to their corresponding factors—that is, apathy or depression. If the two constructs were independent, we expected not to see an interaction effect.
As shown in table 4, both LARS and MADRS were generally sensitive to both apathy and depression, although the results for different subscales were not homogeneous.
The LARS “IC” factor was reactive to both variables, showing the dependence between these two constructs. Nevertheless, the mean scores observed for the different patient groups, the confidence intervals, and the absence of interactions between the apathy and depression factors all argue in favour of the LARS’s ability to assess apathy independently of depression. Examination of the results for the “dysphoric apathy” factor in the MARDS supported this conclusion, as the correlation between the LARS and the MADRS mainly reflected the correlation between the LARS and the depression scale’s apathy subscore.
Concurrent validity was assessed, first, by the correlation between the AES and LARS scores, and second, by comparison of the frequency distributions between the different cut off scores and expert judgements. We noted a strong correlation between the global scores for the AES and the LARS (r = 0.87). Close examination of links between Marin’s scale and the four LARS factors established that the strongest correlations were found with the IC (r = 0.84) and AI dimensions (r = 0.65). The correlation between AES and the E dimension was r = 0.44 and comparisons between the SA dimension and AES yielded r = 0.15.
Criterion related validity was measured in order to obtain the best accuracy relative to an expert diagnosis. It was calculated from two kinds of cross tabulation tables, referring to either an expert dichotomous patient classification (moderately/severely apathetic v non-apathetic/mildly apathetic) or a four class categorisation of disease severity (non-apathetic, mildly, moderately, or severely apathetic). Cut off scores for the LARS were derived empirically from the distributions of apathetic and control subjects, in order to separate the two sets and to avoid the inclusion of subjects in an inappropriate category (fig 1). An initial cut off score was set at 2.5 SD below the mean score of the control group. Thereafter, and depending on the user’s choice in terms of specificity and sensitivity, three cut off scores were proposed, as shown in table 5.
Comparisons with Marin’s AES were provided for three cut off scores calculated using the same methodology. For a prevalence of 29.56%, the cut off value of −16 seemed to be the best compromise, with a κ of 0.79 and a sensitivity index of 0.89, which are considered to represent excellent agreement.16 Finally, classification into four categories was checked against a corresponding distribution of patients with the cut off values of [−36;−22] for non-apathetic and [−21;−17], [−16;−10], and [−9;+36] for slightly, moderately, or severely apathetic subjects, respectively. Again, these values were adjusted empirically in order to create classes with minimum overlap and to provide as much agreement as possible with the experts. Table 6 shows the corresponding contingency cross tabulation, from which we calculated an accuracy index of 0.81 with a κ of 0.74. These data clearly argue in favour of use of the LARS for optimally distinguishing between different degrees of severity in apathy syndromes.
This new Lille Apathy Rating Scale (LARS) was built on the conceptual basis of Marin’s apathy evaluation scale (AES) but sought to remove various practical difficulties in using the latter. In particular, we have noticed first, a lack of standardisation in the administration and scoring procedures (because of the semistructured nature of the AES); second, variations in interpretation owing to multiple sources of informants; and third, possible sources of error because of fluctuation in the positive/negative orientation of questions. Moreover, we have also noticed a heterogeneous weighting of the items in relation to the main psychological domains. In order to enhance standardisation, improve stability, and reduce subjective interpretations during scoring, we adopted a dichotomous scale. Moreover, the nine domains of interest were equally weighted in the final score, enabling the generation of subject profiles. Question by question examination did not reveal floor or ceiling effects or extremely low or high inter-item correlations that would indicate incongruence or redundancy. This internal consistency was reinforced by relatively high reliability coefficients, suggesting that the scale indeed deals with a single, coherent construct.
In addition to the global assessment of apathy, the LARS also revealed a structure whose factors were interpreted as representing intellectual curiosity, action initiation, emotion, and self awareness. In fact, several studies2,4,6,17,18 have proposed definitions which incorporate distinct components of apathy (behavioural, cognitive, and emotional) and our results agree with and reinforce these suggestions, as these same three realms emerged from our analysis. Nevertheless, our data also generated a fourth factor, representing a reduction in self awareness and impaired behavioural adjustment to social life. This factor seemed to emerge in relation to certain specific characteristics of patients with cognitive decline.
Discriminating between depression and apathy has always been a tricky issue. Nevertheless, some studies have addressed this problem. For instance, Marin et al19 evaluated apathy and depression in patients with either Alzheimer’s disease, stroke, or major depression. Although apathy and depression generally correlated within the groups, absolute scores varied considerably (and independently) between groups. Levy et al20 found the same disease specific relation using the NPI and concluded that the presence of one condition did not predict the presence of the other. Pluck and Brown3 drew very similar conclusions: although the symptoms can dissociate within individual patients, comorbid depression and apathy appeared to have an additive effect on symptoms such as cognitive dysfunction. Our data seem to be compatible with published reports: the correlation between LARS and MADRS scores is clearly explained by the latter’s dysphoric apathy subscale, which covaries with the LARS IC dimension (and, to a lesser extent, with AI). Moreover, the simultaneous presence of these two symptoms indicated the Parkinson’s disease patients with the worst scores on the LARS. In practical terms, distinguishing apathy from depression implies that one should consider both the LARS and the MADRS scores: extremely low scores on both scales indicate patients with little suspicion of either condition; extremely high scores on both scales indicate patients with a high suspicion of both conditions; depressed patients should present a relatively high MADRS score with a relative low LARS score, whereas a relatively low MADRS score and a relatively high LARS score predict the presence of apathy in the absence of depression.
The main goal of this study was to provide a useful tool (meeting the usual psychometric prerequisites) for the assessment of apathy. We demonstrated that the LARS has very satisfactory inter-rater and test–retest reliability. Moreover, the establishment of excellent concurrent and criterion related validity was a major step in proving the value of LARS as a screening test. Furthermore, we proposed several cut off scores with good specificity and sensitivity. The comparison with Marin’s AES criterion related validity was arduous, because we found several proposed AES cut off values.3,21,22 However, by opting for a cut off value corresponding to the mean AES score for the normal control group minus 2.5 SD, our sensitivity and specificity values showed that the LARS provided slightly more reliable validity. Finally, the four-class severity system may provide greater sensitivity by enabling precise measurements in treatment efficacy studies or, more generally, during patient follow up.
In the current version of the LARS, all input information is obtained from the patient. In patients with anosognosia (who may thus underrate their symptoms), this may constitute a limitation. It is useful to obtain information from an informant in such cases, and we are currently working on an informant version of the LARS.
Our results showed that apathy is frequent in Parkinson’s disease and that higher apathy levels are observed in patients with cognitive complications. The observed prevalence of 29% was within the range of previously reported values.3,7,8
The LARS is a reliable and practical instrument for assessing the multiple dimensions of apathetic syndrome. Its psychometric qualities appear to make it particularly suitable for assessing changes in the manifestations of apathy: in the future, the scale could thus constitute an interesting outcome variable for evaluating the efficacy of potential apathy treatments. Further studies will have to demonstrate the ability of the LARS to specify the apathy profile of different patient groups in relation to a given disease aetiology or severity level.