Background and purpose To determine the efficacy of the ACTIVLIM-Stroke questionnaire in detecting changes in functional status of stroke patients.
Method Sixty-eight Benin and Belgian stroke patients participated in this 2-month longitudinal study, involving baseline and follow-up assessments. Outcome measures combined ACTIVLIM-Stroke questionnaire, Barthel Index (BI), 6-minute-walk test (6MWT) and modified Rankin scale (MRS). Responsiveness of ACTIVLIM-Stroke was investigated through different methodological approaches and compared with BI, 6MWT and MRS. Statistical analyses were performed using the paired t tests, effect size (ES) and correlation tests.
Results ACTIVLIM-Stroke detected changes in the whole sample (p<0.001, ES=0.78) and even in a subgroup of patients (p<0.001, ES=0.29) that were classified as stable according to the MRS. Moreover, ACTIVLIM-Stroke permitted the classification of patients into more discriminative groups, including those showing an important improvement (p<0.001, ES=1.87), a slight but non-clinically meaningful improvement (p<0.001, ES=0.38), and no improvement (p=0.1, ES=0.11), demonstrating its high sensitivity to change. Furthermore, there were concordant relationships between ACTIVLIM-Stroke change and any observed changes in BI, 6MWT and MRS scores (r≥0.50, p<0.001), confirming the external responsiveness of ACTVLIM-Stroke.
Conclusions ACTIVLIM-Stroke showed good responsiveness and can detect accurately clinical changes in the functional status of stroke patients. The BI and the 6MWT were also responsive and may provide complementary information while investigating change in functional status. However, in addition to being highly sensitive to change, ACTIVLIM-Stroke presents significant methodological advantages for quantifying functional changes in stroke patients.
Statistics from Altmetric.com
Stroke is prevalent worldwide and is a leading cause of death and disability.1 ,2 The effects of stroke on the victim's quality of life are multifaceted and generally more pronounced in the physical dimension, on cognition, language and mood. As described by the International Classification of Functioning, Disability and Health framework, efficient management of poststroke patients requires global assessment, which accounts for impairments, activity limitations and restriction of participation, while also considering personal and environmental factors.3 This global assessment supposes the measurement of manifest and latent relevant variables, the latter often requiring the use of patient-reported outcome measures. Acquiring accurate information regarding functional recovery after stroke is essential, as it serves as a baseline for healthcare planning and facilitates therapeutic guidelines. Functional recovery after stroke is generally evaluated using scales that are usually incorporated into clinical trials and are routinely used during clinical practice.4
ACTIVLIM-Stroke is a self-reported Rasch-built questionnaire that measures the ability of stroke patients to perform daily living activities. It comprises 20 items and considers various aspects of activity limitations.5 It offers the advantage of cross-cultural validity between African and European contexts. Its validation in these contexts makes ACTIVLIM-Stroke a powerful tool for advancing stroke rehabilitation in multicentric studies. It also shows excellent psychometric properties, including validity, reliability, unidimensionality and invariance. However, its ability to reveal change over time has not yet been determined.
Turner et al6 reported that an instrument measuring health status should be responsive to changes. Consequently, responsiveness, which reflects the ability of a questionnaire to detect changes over time, is an important psychometric quality and an essential criterion for instrument selection when measuring stroke patient function. Therefore, studies that evaluate the responsiveness of ACTIVLIM-Stroke measure are warmly required.
This study investigates the responsiveness of ACTIVLIM-Stroke in comparison with selected well-known outcome measures, such as the Barthel Index (BI) and the 6-minute walk test (6MWT).
We performed a 2-month longitudinal study involving baseline and follow-up assessments. Sixty-eight stroke patients (mean age 58.52±12.33 years old; poststroke duration 6.86±11.63 months; sex 53% female and 47% male) were recruited from the Rehabilitation Departments in Benin and Belgium. Subjects were eligible for inclusion if they had experienced a stroke event at least 2 weeks prior to examination, were undergoing rehabilitation or not, scored 24 or more on the Mini Mental State Examination and freely agreed to participate. Functional status information was collected at baseline and follow-up time points.
The study was approved by the Ethics Committee of the Université catholique de Louvain in Belgium and the local ethics committees of the participating caregiver centres and hospitals in Benin. Each participant signed an informed consent form prior to inclusion in the study.
Functional status was measured using a variety of instruments, combining patient-reported and clinician-reported anchors with observed performance. The primary outcome measure was ACTIVLIM-Stroke. Secondary outcome measures were the BI and the 6MWT. The BI is one of the most widely used generic and ordinal scales that measure severity of disability in daily life. It measures a range of activity domains such as communication, eating, self-care, mobility and transfer through 10 items.7 The 6MWT is a performance test that evaluates walking endurance; it consists in recording the distance (m) a subject walked in 6 min.8 This test is demonstrated to reflect people functional status.9 ,10 Participants performed the 6MWT by walking as quickly as possible on a flat ground square path with a 50–85 m perimeter marked on the floor. They were allowed to stop and rest as necessary. Patients were informed of the time after 2, 4 and 5 min, and the distance walked was recorded to the nearest metre. The higher the score after each outcome measure, the better the functional status of the patient. As such, a positive difference between the second and first scores reflects a positive clinical change.
In addition to the BI and 6MWT, the modified Rankin scale (MRS), which is an assessment of overall disability, was employed to characterise the change in health status of the participants over the 2-month period. This clinician-rated scale categorises the severity of disability based on observation, ranking patients in seven different levels from ‘no symptoms at all’ to ‘dead’. An increasing MRS score indicates worsening status.11
Analyses were conducted using three approaches: global, group and individual. The global approach consisted of statistically comparing patient baseline scores with those reported at the 2-month follow-up. The group approach allowed investigation of responsiveness indices by patient groups classified as improved, stable or deteriorated according to their MRS scores over the study period. Finally, the individual approach evaluated changes in the functional status of each patient by applying the following formula12: m1 and m2 represented the patient's functional status measures, respectively, at baseline (time 1) and follow-up (time 2). SE1 and SE2 correspond to their respective associated SE of measurements at time 1 and time 2. The t score approximately follows a standardised normal distribution such that a patient with a t score above 1.96 showed important improvement, while a patient with a t score below −1.96 was significantly worsening.12 Patients were divided into five classes according to their t score significance limits: important deterioration (t score <−1.96), slight deterioration (−1.96≤ t score <0), no change (t score=0), slight improvement (0< t score ≤1.96) and important improvement (t score >1.96).
Analysis of internal responsiveness to change was performed by comparing the follow-up data with baseline data using a paired t test and effect size (ES) where applicable. The ES is calculated as the ratio between the mean change and the SD of the baseline score.13 Analysis of external responsiveness was evaluated by examining the relationships and concordance between ACTIVLIM-Stroke change and any observed changes in the BI and 6MWT scores. The magnitude of change was interpreted according to Cohen's classification, which describes change as non-significant (ES<0.2), small (0.2≤ES<0.5), moderate (0.5≤ES<0.8) and large (ES≥0.8).14
In terms of internal responsiveness, the mean change in the 68-subject cohort was calculated and the statistical significance was tested based on a paired t test and a signed rank test, respectively, for linear and ordinal-type instruments (table 1). ACTIVLIM-Stroke revealed a significant positive change (p<0.001), indicating that the overall sample experienced a general health improvement between time 1 and time 2. The ES was 0.78, which corresponded to a large clinical change. The clinicians’ ratings of overall disability using the MRS showed that 35 patients (mean poststroke delay of 2.01±2.6 months) improved and that 32 patients (mean poststroke delay of 10.3±7.7 months) were stable, while one deteriorated. As only one patient was rated as deteriorated, the responsiveness analysis by group approach was undertaken with the improved and stable groups (table 2). In the improved group, ACTIVLIM-Stroke confirmed a significant and large positive change (p<0.001; ES=0.95). In the stable group, ACTIVLIM-Stroke also detected significant changes (p<0.001, ES=0.29); however, this change was considered minor given the small ES.
Analyses based on an individual approach (figure 1) revealed that 8 had a t score between −1.96 and 0, 4 had a t score of 0, 25 had a t score between 0 and 1.96, and 31 patients had a t score above 1.96. This distribution showed that in the group of patients qualified as stable by clinicians according to MRS change score, some had experienced small improvement that was not detectable by MRS. As such, patients were classified into three different groups according to the magnitude of their change as follows: (1) importantly improved (n=31), (2) non-clinically meaningfully improved (n=25) and (3) not improved (n=12) (table 3).
Correlations in external responsiveness were observed between scores changes from all instruments. There was a strong positive correlation between ACTIVLIM-Stroke change and BI change (p=0.62, p<0.001), MRS change (p=0.64, p<0.001) and 6MWT change (r=0.50, p<0.001). Moreover, measurements obtained from 6MWT, BI and MRS confirmed the overall improvement detected by ACTIVLIM-Stroke in the global approach analysis (table 1). BI and 6MWT also demonstrated a large positive change in the group of patients classified as improved according to the MRS (table 2). However, in the stable group, only the 6MWT confirmed no significant change (p=0.13; ES=0.11). The BI similarly detected a small significant improvement, similar to ACTIVLIM-Stroke, confirming the higher sensitivity of ACTIVLIM-Stroke compared with MRS (table 2). Furthermore, a strong concordance was observed between individual self-reported changes based on ACTIVLIM-Stroke and all other instruments (table 3). Indeed, in the groups of patients that reported important individual improvements, all instruments showed significant (p<0.001) and clinically meaningful changes with ESs of 1.87 and 1.09, respectively, for ACTIVLIM-Stroke and the 6MWT. In the group that reported a non-clinically meaningful improvement, all instruments were concordant as they revealed a significant change (p≤0.03). Furthermore, ACTIVLIM-Stroke (ES=0.38) and 6MWT (ES=0.36) confirmed that these changes were clinically small. Accordingly, in the group that reported no individual improvement, none of these instruments detected statistical significant change (p≥0.1; ES=0.11). Moreover, the higher the poststroke delay was, the lower the ACTIVLIM-Stroke mean change was (figure 2 and table 3), demonstrating a relationship between functional recovery and poststroke duration.
This study evaluated the ability of the ACTIVLIM-Stroke questionnaire to measure relevant changes in stroke patients in comparison with two widely used outcome measures: the BI and the 6MWT. Results showed that ACTIVLIM-Stroke is very sensitive to change and can accurately measure minor and clinically relevant changes in patients over time.
Our analyses showed that changes in ACTIVLIM-Stroke were associated with changes in BI and 6MWT. Data obtained from ACTIVLIM-Stroke, BI and 6MWT all demonstrated that the functional status of the whole sample significantly improved after 2 months. ACTIVLIM-Stroke seemed to be more responsive than BI or 6MWT as it revealed a higher ES. However, this global approach may mask important information as data were pooled together, allowing no distinction between the patients that made significant improvements and those who did not. As such, despite the overall significant improvement, there were wide variations in the magnitude of change among the 68 stroke patients analysed. To rectify this, patients were classified into improved or stable groups based on the MRS as determined by clinicians. This group approach analyses revealed that patients who were classified as improved showed significant improvement with ACTIVLIM-Stroke, BI and 6MWT. In the stable group, only 6MWT showed no significant changes. ACTIVLIM-Stroke and BI detected significant, yet small changes in this group, suggesting that they may be more sensitive than MRS and 6MWT. Although MRS is a valuable instrument for assessing stroke patient global outcomes,15 yet it demonstrates a lack of sensitivity to small changes.16 This can be explained by the fact that it is a single-item scale that allows only a broad description of a patient's overall status through a seven-level response format (from ‘no symptoms at all’ to ‘dead’). The ACTIVLIM-Stroke and the BI comprised 20 and 10 non-redundant items, respectively. As such, they capture more details than the MRS, permitting more discrimination and consequently more sensitivity to small changes. In addition, the 6MWT, which was designed to reflect functional recovery in chronic disabled people,9 ,10 is a specific test of walking capacity. It measures only part of physical performance (ie, locomotion) and does not evaluate other subdomains of functional status and, therefore, may not be sensitive to small changes occurring in a subdomain other than locomotion. This could justify why 6MWT is less sensitive than the ACTIVLIM-Stroke questionnaire. Similarly, BI exhibits less sensitivity compared with ACTIVLIM-Stroke. This result can be due to the fact that BI measures basic ADL with less detail compared with other commonly used activity of daily living (ADL) scales such as the Functional Independence Measure (FIM) or the Frenchay Index. It is possible that FIM appears to be as sensitive as the ACTIVLIM-Stroke; however, this hypothesis still needs to be investigated.
As shown in previous studies,17 ,18 an individual approach, such as the one taken in the present study, provides essential information regarding responsiveness, given that individuals in a group do not equally experience the mean change of the group. Here, an individual approach analysis allowed the investigation of how each of the 68 patients improved after the 2-month time frame. Based on their individual self-reported activity limitations, patients could then be classified not only as improved or stable, but also in more discriminative groups as having importantly or slightly improved, or showing no improvement. This indicates that, compared with the MRS, ACTIVLIM-Stroke has a greater ability to detect change. It is important to note that the individual approach could only be applied with ACTIVLIM-Stroke, which, like any Rasch-built scale, provides the individual's measures with associated SEs. Indeed, using the approach developed by Wright and Stone,12 the SEs are an essential requirement for investigating statistically significant individual changes.
The degree of agreement between changes measured by different instruments is essential in responsiveness investigations. This study showed that the individual change reported by patients themselves matched the change measured by other instruments, confirming the external responsiveness of ACTIVLIM-Stroke. 6MWT and BI revealed important significant mean changes in the group of patients who reported having importantly improved. They showed only a small mean change in the patient group reporting non-clinically meaningful improvement, while they showed no significant change in the group reporting no improvement over the 2-month period. These results also demonstrated that patient's self-report scores related well to clinician-based scores and observed performances, confirming previous findings.19 ,20
In addition, our selected outcome measures covered a variety of assessment methods. While BI and MRS are clinician-reported anchors, ACTIVLIM-Stroke is a self-reported questionnaire and 6MWT is a performance-based test. Combining all of these outcome measures allowed a multidimensional assessment of change as determined by the patient's own judgement, the clinician's perception and observed performance.
Our results confirmed that ACTIVLIM-Stroke, BI and 6MWT can be used to monitor the functional recovery of patients undergoing rehabilitation. Nonetheless, it appeared that ACTIVLIM-Stroke showed a greater sensitivity to change. BI and 6MWT present some shortcomings compared with ACTIVLIM-Stroke. First, BI has previously been described as less sensitive to change at extremes of disability (floor and ceiling effects).16 ,21 Second, being an ordinal scale, BI restricts statistical investigations. For example, analysis should be limited to non-parametric statistics with no possibility to compute mean changes and ESs. Third, both BI and 6MWT do not provide associated SE to individual measures, making statistical investigations of responsiveness at individual level impossible. More importantly, individual approach of clinical change is essential in clinical trial since it has been demonstrated that ES calculations are potentially misleading indicators of rating scale responsiveness at the group comparison level so that, when comparing various instruments, ES computed at group level should be interpreted carefully.22 Therefore, it is important to analyse change at individual level since changes that have a meaning in groups may not be meaningful in individuals.23 Moreover, an outcome measure that permits decision making regarding the significance of a change on an individual basis is extremely useful in clinical practice, where each patient constitutes his own control over time. The linear characteristics of ACTIVLIM-Stroke and the fact that it provides individual SEs allow thorough investigations of its responsiveness in global, group and individual approaches. This particular advantage makes ACTIVLIM-Stroke a recommendable tool in clinical trial and practice. As it is time efficient, this scale is also a reasonable option when tracking changes in stroke patients’ functional status and may provide assistance when making decisions regarding the efficacy of interventions or treatments.
It has been argued that the use of more sensitive measurement instruments would increase the evidence on the efficacy of rehabilitation treatment.16 ,24 A good health status outcome measure should allow the detection of small changes over time. This property is essential when analysing the efficacy of a treatment or when monitoring the variations in the health status of a cohort of patients. This study confirms that distribution-based methods and anchor-based approaches are complementary in responsiveness investigations.4 ,25 It also shows that global scales with single item are much less sensitive to changes in disability than the activity of daily living scales.16 Our results also highlighted that, in this cohort, acute and subacute stroke patients experienced better improvement than chronic stroke patients (figure 2). This indicated that functional recovery might be more important in the phases immediately after stroke than in the chronic phase. This observation is supported by existing evidence that generally describes functional poststroke recovery as slowing down in chronic phase.26–28
Finally, instruments that are responsive in both severe and mild strokes are required for patients’ clinical follow-up. Indeed, due to ceiling effect, many available scales are useless in clinical trials that enrol patients with mild disease conditions. In this study, we have demonstrated the high sensitivity to change of ACTIVLIM-Stroke in patients with relatively moderate disability as revealed by the baseline scores, confirming its usefulness as outcome measure for clinical practice and trial. Nevertheless, further studies could analyse the responsiveness of ACTIVLIM-Stroke in patients with more severe disability. It would also be useful to investigate how responsive is the ACTIVLIM-Stroke questionnaire in comparison to other commonly used scales such as the Functional Independence Measure and the Frenchay Activity index.
In summary, different statistical indices and methodological approaches were combined to investigate the comparative responsiveness of ACTIVLIM-Stroke, BI and 6MWT. Our results showed that all these instruments are responsive and can be used to measure clinical changes in the functional status of stroke patients. ACTIVLIM-Stroke presents good internal and external responsiveness and can detect even small changes. This scale appears to be more sensitive than BI and 6MWT, and offers significant methodological and statistical advantages for investigating change.
The authors thank the patients who participated in the study.
Contributors CSB, IN, JO and J-LT contributed to the study design and the data analysis. CSB, IN, JO and VH participated in the data collection. CSB performed the statistical analyses and drafted the manuscript. All authors participated in the data interpretation, critically revised the draft of the manuscript for important intellectual content and contributed to the writing. All authors have read and approved the final manuscript.
Competing interests None.
Patient consent Obtained.
Ethics approval Committee of the Université Catholique de Louvain in Belgium and the local ethics committees of the participating caregiver centres and hospitals in Benin.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement Dataset is available from the corresponding author at firstname.lastname@example.org