- Research article
- Open Access
Does targeting manual therapy and/or exercise improve patient outcomes in nonspecific low back pain? A systematic review
BMC Medicine volume 8, Article number: 22 (2010)
A central element in the current debate about best practice management of non-specific low back pain (NSLBP) is the efficacy of targeted versus generic (non-targeted) treatment. Many clinicians and researchers believe that tailoring treatment to NSLBP subgroups positively impacts on patient outcomes. Despite this, there are no systematic reviews comparing the efficacy of targeted versus non-targeted manual therapy and/or exercise. This systematic review was undertaken in order to determine the efficacy of such targeted treatment in adults with NSLBP.
MEDLINE, EMBASE, Current Contents, AMED and the Cochrane Central Register of Controlled Trials were electronically searched, reference lists were examined and citation tracking performed. Inclusion criteria were randomized controlled trials of targeted manual therapy and/or exercise for NSLPB that used trial designs capable of providing robust information on targeted treatment (treatment effect modification) for the outcomes of activity limitation and pain. Included trials needed to be hypothesis-testing studies published in English, Danish or Norwegian. Method quality was assessed using the criteria recommended by the Cochrane Back Review Group.
Four high-quality randomized controlled trials of targeted manual therapy and/or exercise for NSLBP met the inclusion criteria. One study showed statistically significant effects for short-term outcomes using McKenzie directional preference-based exercise. Research into subgroups requires much larger sample sizes than traditional two-group trials and other included studies showed effects that might be clinically important in size but were not statistically significant with their samples sizes.
The clinical implications of these results are that they provide very cautious evidence supporting the notion that treatment targeted to subgroups of patients with NSLBP may improve patient outcomes. The results of the studies included in this review are too patchy, inconsistent and the samples investigated are too small for any recommendation of any treatment in routine clinical practice to be based on these findings. The research shows that adequately powered controlled trials using designs capable of providing robust information on treatment effect modification are uncommon. Considering how central the notion of targeted treatment is to manual therapy principles, further studies using this research method should be a priority for the clinical and research communities.
'Identifying what treatment works best for whom'  in low back pain has been an on-going aim of clinicians and has been a research priority over the last decade . Central to that aim is the notion that targeting treatment to subgroups of people with low back pain might improve patient outcomes and increase health system efficiency. If this aim were achieved, the impact would be widespread, as back pain affects most people at some point in their lives  and the consequent health care, community and personal costs are considerable .
However, a definitive diagnosis is not possible in 80% of low back pain and is most accurately labelled 'nonspecific low back pain' (NSLBP). Perhaps due to this diagnostic imprecision, there is an ongoing debate about the best treatment for NSLBP  and considerable variability in its management across clinical disciplines. For some clinical disciplines, back pain is a frequent reason for people who seek their assistance , especially for the primary care professions that practise manual therapy. Manual therapy treatment usually involves manual techniques (mobilization, manipulation and traction) and exercise  and these key treatment approaches are recommended in recent international guidelines for the management of low back pain [5, 8, 9].
A central element in the current debate about best practice management of low back pain is the efficacy of targeted versus generic (non-targeted) treatment. Many primary care clinicians (chiropractors, general practitioners, physiotherapists and osteopaths) resist the notion that non-targeted treatment is appropriate. In clinical practice, they observe highly variable patient presentations and most believe that targeting treatment to people with particular patterns of symptoms and signs (treatment effect modifiers) results in better patient outcomes . Treatment decisions are influenced by this belief but there is little agreement about what symptoms and signs are important treatment effect modifiers  and some argue that non-targeted treatment may be equally effective [12, 13]. Research that adequately addresses the validity of these disparate approaches to the care of NSLBP is fundamental in order to break the logjam that currently inhibits consensus regarding best practice.
The design of a controlled trial traditionally applies different treatment to two or more groups randomized from a population sample. An assumption is that these groups are similar at baseline on all variables likely to influence outcome and, therefore, any difference in outcome is due to one treatment being more effective than another. However, if the subgrouping approach is correct and important treatment modifiers do occur in NSLBP, trials using this traditional research design may not recognize the heterogeneity of treatment response in participants. This could result in potential treatment effects being diluted by subgroups of people who are unlikely to respond. Due to differential sampling, this unrecognized heterogeneity may also result in contradictory results from replication studies.
Most controlled trials of manual therapy or exercise for NSLBP have been performed using a 'non-targeted treatment' design. Systematic reviews summarizing the results of these trials conclude that these treatments result in better patient outcomes than if no treatment is given, but no particular treatment is clearly better than any other and all produce modest treatment effects [14–17]. On the other hand, clinicians convinced of the benefits of targeted treatment are supported by preliminary evidence that treatment targeted to empirically-derived subgroups of patients with NSLBP can improve patient outcomes and reduce the costs of care [18–21].
There are no systematic reviews comparing the efficacy of targeted versus non-targeted manual therapy and/or exercise. For a systematic review to summarize this evidence, the included trials must contain a clinical prediction rule that seeks to subgroup NSLBP based on treatment modifiers and match treatment to these subgroups. In theory, treatment efficacy may vary depending on the prediction rule used to identify the target population. Therefore, the clinical prediction rule/treatment combinations being tested should be clearly identified in systematic reviews of targeted treatment.
The potential for spurious findings is present at all phases of subgrouping research. Trials in which post-hoc analysis was performed in order to identify treatment modifiers (hypothesis-setting studies) provide a lower level of evidence than those where the clinical prediction rule is clearly identified before the start of the trial (hypothesis-testing studies) .
There is an important distinction between prognostic factors and treatment effect modifiers. Prognostic factors are symptoms and signs that indicate likely outcomes regardless of treatment. Treatment effect modifiers are symptoms and signs that indicate likely response to a specific treatment. Correct analysis of a clinical prediction rule/treatment effect involves a test of interaction that allows differentiation between prognostic effects and treatment effect modifiers [22, 23]. Systematic reviews of targeted treatment need to report this interaction and describe when it is not possible for it to be identified.
The aim of this systematic review was to determine the efficacy of targeted manual therapy and/or exercise on pain and activity limitation in adults with NSLBP.
Types of studies
Inclusion criteria were randomized controlled trials (RCT) comparing targeted manual therapy and/or exercise interventions to non-targeted interventions in NSLBP. They needed to be hypothesis-testing studies and, due to a lack of translation resources, published in English, Danish or Norwegian.
Some studies of targeted treatment investigate simple clinical prediction rules that predict treatment response to one treatment  and others investigate more complex clinical reasoning (a subgroup system) that predict treatment response to a number of treatments . For simplicity, in this review the term 'clinical prediction rule' refers to both situations.
Many RCTs that aim to investigate targeted treatment use two-group designs that, due to confounding by different treatments in the two groups or by the research design used, do not provide robust information on treatment effect modification. That is, they are not comparing one targeted treatment with the same treatment when untargeted or they compare two different treatments using methods that do not allow one to determine whether subgrouping is important . Some recent RCTs have overcome this confounding effect by using a more precise design in which the two treatment groups are accompanied by a clinical prediction rule covariate . This covariate allows the identification of the treatment effect in the people who were positive on the prediction rule and the treatment effect in the people who were negative on the prediction rule. This RCT design does provide robust information on treatment effect modification  and, in this review, such a study was classified as a 'two-group plus subgroup covariate RCT'.
A second RCT design that is also capable of providing information on the effect of treatment targeted to subgroups is a design that includes more than two treatment groups and tests multiple clinical prediction rules (a subgroup system). In this design, randomization to multiple treatment groups is blind to prediction rule status and the people in each treatment group are a mixture of those who are rule-positive and those who are rule-negative. Post-hoc unblinding of rule status allows the identification of the effect of rule-matched and rule-unmatched treatment. In this review, such a study was classified as a'multi-arm subgroup-system RCT'. A disadvantage of this design is that, usually, only the effect size for the whole subgroup system is reported and, therefore, it is not possible to determine if the treatment modifier effects across the subgroups vary - possibly being important for some treatment subgroups in the system and not for others.
Therefore, an additional inclusion criterion for this review was that RCTs needed to be either a 'two-group plus subgroup covariate RCT' or a 'multi-arm subgroup-system RCT'. Exclusion criteria were: observational studies and uncontrolled studies; studies comparing non-targeted interventions; and studies comparing two targeted interventions.
Types of participants
Participants needed to be experiencing NSLBP, but they could not be pregnant. Arbitrarily, more than 85% of participants needed to be aged 18 years or over. Trials containing people with both low back pain and leg pain were included if at least 85% of the participants had no symptoms or signs of neurocompression (numbness, pins and needles or lower limb muscle weakness) or 'sciatica'. Studies containing participants with specific low back pain (for example, fracture, infection, cancer or inflammatory arthritis) were excluded.
Low back pain was defined as pain occurring below the lower ribs and above the gluteal folds, including the buttocks. Using the Cochrane Back Review Group criteria , the duration of back pain was categorized as acute (less than 6 weeks), sub-acute (6 - 12 weeks) and chronic (greater than 12 weeks). Trials with at least 85% of participants whose duration of pain was in one of these temporal categories were only classified under that category.
Types of intervention
Mobilization, manipulation and traction were classified as 'manual therapy' and were classified as 'exercise' if they included therapeutic exercise . Included studies had to report sufficient data in order to determine the size of the effect attributable to the targeted therapy, including point estimates and measures of variability. The clinical prediction rule used to target the intervention had to have been clearly identified before the trial commenced (hypothesis-testing studies). Trials in which post-hoc analysis was performed in order to identify the responders to the intervention (hypothesis-setting studies) were excluded. The clinical prediction rule had to be concordant with the intervention. For example, a trial of 'McKenzie exercises' which contained no assessment of directional preference would have been excluded, as directional preference is central to the clinical reasoning used to determine McKenzie exercise prescription.
Types of outcome measures
Self-reported pain and activity limitation were the outcome measures. Results were defined as short-term if they were measured less than 3 months after randomization, intermediate-term when between 3 months and 1 year and long-term when greater than 1 year. Where outcomes were measured at multiple time points within these time frames, arbitrarily the outcomes closest to 6 weeks, 6 months and 18 months were used.
Data sources and search strategy
The electronic databases MEDLINE, EMBASE, Current Contents, AMED and The Cochrane Central Register of Controlled Trials were searched from inception to February 2009. Reference and citation tracking of included articles were performed. A sensitive search strategy was used which was based on that recommended by the Cochrane Back Review Group  and is available on request from the first author (PK).
Selection of studies for inclusion and assessment of method quality
In order to clarify any misinterpretation, the inclusion criteria form was independently piloted on five eligible abstracts by the reviewers (HM, DP and PK). The method quality criteria recommended by the Cochrane Back Review Group  were adapted for this review of targeted treatment (see Appendix). All reviewers independently piloted these criteria on three excluded papers. The screening on title and abstract, screening of retrieved articles, method quality assessment and data extraction were independently performed by two reviewers (HM and DP). Any disagreement was resolved by discussion, including a third independent reviewer if required (PK). Assessment of method quality was not blinded to trial authors, institution or journal.
A high quality trial was defined as a trial that obtained, at a minimum, a 'yes' score for randomization, allocation concealment, outcome assessor blinding and also a 'yes' score for any three of the other method quality criteria . As this was a review of targeted manual therapy and/or exercise, 'blinded to the intervention' was defined as inadequate information for the patient or clinician to know whether or not the intervention received was aligned with the specific clinical prediction rule being tested.
Relevant data were independently extracted by two of the reviewers (HM and DP) using a standardized form that included: quality criteria; participant characteristics; trial characteristics; description of interventions; the decision rules used to target the manual therapy or exercise; and point estimates and measures of variability for outcomes. The data extraction form was pilot-tested on three excluded trials. If necessary, the trial authors were contacted for additional information [18, 19, 21, 26].
The group means and standard deviations (SD) were extracted for each comparison on each available outcome measure. Where a SD had to be calculated from a confidence interval, it was calculated using the method described in the Cochrane Handbook (v4.2.5 p117) .
All the included trials measured activity limitation using the Oswestry Disability Index  or the Roland-Morris Disability Questionnaire . These two assessment instruments attempt to measure the same underlying construct of activity limitation and have been shown to display similar responsiveness [30, 31]. All the included trials that measured the effects on pain used a Visual Analogue Scale or Numerical Rating Scale. For the purposes of this systematic review, data derived using either of these activity limitation questionnaires were treated as being comparable and data from either of these measures of pain intensity as being comparable. All pain, activity limitation and patient satisfaction scores were converted to a 0-100 scale.
Calculating the efficacy of targeting treatment
The mean and SD for each group, for each comparison, at each outcome period (if available) were entered into Cochrane Collaboration Revman (v5.0.2) software to calculate mean effect (mean difference) . Where the outcomes of two treatment groups were combined to create a comparison treatment  an n-weighted mean and SD were calculated. The direction of all the reported results was standardized so that effects favouring targeted treatment were positive scores. Meta-analysis was not performed due to the clinical diversity and methodological variability of the included studies.
In studies where individual patient data are available, the appropriate statistical analysis needed to differentiate between treatment modifier effects and prognostic effects in a targeted treatment is a test of interaction, often a form of ANOVA [22, 23, 26]. This is important, as not identifying the prognostic effects in a clinical prediction rule will lead to an over-estimate of targeted treatment efficacy. However, it is rare that authors of systematic reviews have access to individual patient data and, therefore, reviewers of results from targeted treatment studies usually only have access to group summary outcome data. Mark Hancock (personal communication, 2009) recommends that, in circumstances where only study level data are available, the following formula be used to determine the interaction between treatment modifier and treatment allocation:
((people prediction rule-positive and received rule-positive treatment) - (prediction rule-positive and comparison treatment)) - ((prediction rule-negative and rule-positive treatment) - (prediction rule-negative and comparison treatment))
The first (double bracketed) part of this formula determines the treatment effect in people who are positive on the clinical prediction rule and who, therefore, share the same prognostic effect as the prediction rule. The second (double bracketed) part of the formula determines the treatment effect in people who are negative on the clinical prediction rule and who also share the same prognostic effect of the prediction rule. The product of the formula is an estimate of the 'treatment effect modifier size' (treatment effect in rule-positive people minus treatment effect in rule-negative people) when the prognostic effect of the prediction rule has been removed. This process is illustrated in Figure 1. In the current review, this Hancock formula was used whenever these data from a 'two-group plus subgroup covariate RCT' were available [19, 26]. In this review, the term 'subgroup system effect size' is used to describe the additional effect of subgroup-matched treatment compared with non-subgroup-matched treatment when identified in a 'multi-arm subgroup system RCT'.
To aid the interpretability of the results, the total improvement (clinical course) of the targeted group from baseline was displayed diagrammatically as a proportion of its baseline score. Two components of that improvement were displayed: (a) the proportion of that change attributable to the additional effect of targeting treatment using the clinical prediction rule; and (b) the proportion of that change attributable to other reasons (natural history, nonspecific treatment effects and the likely improvement had this group received the comparison treatment).
A flow chart (Figure 2) documents the selection process of trials included in the review. Four studies met the inclusion criteria [18, 19, 21, 26]. The characteristics of the included studies are shown in Table 1. The reasons for the exclusion of other trials retrieved in full text are noted in Table 2.
The quality assessment scores for the included studies are shown in Table 3. The median quality assessment sum score was 8, with a range of 7 to 10. All four RCTs met the criteria of the Cochrane Back Review Group  for high quality studies.
Effects of targeting treatment
The included studies investigated a total of three clinical prediction rules for targeting treatment. These are summarised in Table 4 and were the McKenzie directional preference-based exercise , the Delitto Treatment Based Classification method  and the Flynn manipulation prediction rule .
The mean effects (mean difference) for all the target treatments are shown in Table 5. Of the 10 treatment/outcome combinations that were reported across all the studies, using the statistical methods employed in this review, two (20%) showed mean effects that were statistically significant (P < 0.05) and both were from the same trial . For reference, the means and SD for the groups in each study are listed in Additional file 1.
Improvements in patient outcomes (clinical course) for the target treatment group are displayed diagrammatically in Figures 3 to 4 as a proportion of baseline scores. As the capacity to identify treatment modifier effects varies between two-group plus subgroup covariate RCTs and multi-arm subgroup system RCTs, the results from studies with the same RCT design type are presented together.
McKenzie directional preference-based exercises
A single high quality study investigated McKenzie directional preference-based exercise . It was a multi-arm subgroup system RCT that showed statistically significant improvements in short-term activity and short-term pain limitation due to the matched treatment effect. The size of these effects ranged from 22.8% to 33.8% of baseline scores. As this study only included people with a directional preference, these results are only applicable to people who display a directional preference. However, as it is not clinically congruent to give directional preference exercises to people without a directional preference, this is a reasonable limitation to the generalizability of the study.
The analysis used in this review compared the effect of directional preference exercises with the mean of both comparison groups (the opposite direction exercise group and the non-directional exercise group), but an alternative would have been to use only the opposite direction exercise group as the comparison. As clinicians and patients were not blind to treatment group allocation in this trial, treatment expectation may have inflated the subgroup system effect size in this RCT. Use of only the opposite direction exercise group as the comparison group would have resulted in slightly larger effect sizes (within 3 points on a 0-100 scale). As the outcomes in the opposite direction exercise group may have been biased by a low expectation of treatment outcome, in this review the mean of both comparison groups was chosen as a more conservative estimate of effect.
In summary, a three-way test of interaction was not performed in this study to assess the ability of the prediction rule to identify people who respond to this matched treatment. However, the size of the matched treatment effect was statistically significant for both short-term activity and short-term pain limitation.
Delitto treatment-based classification
A single high quality study investigated the Delitto treatment-based classification method . The results of this multi-arm subgroup system RCT showed matched treatment effects of 12.9% of baseline scores for short-term activity limitation and 8.5% for short-term pain but neither were statistically significant. In summary, a three-way test of interaction in this study indicated that the ability of the prediction rule to identify people who respond to a matched treatment was statistically significant, but a test of the size of that matched treatment effect was not statistically significant.
Flynn manipulation prediction rule
Two high quality studies [19, 26] investigated the Flynn manipulation prediction rule. Both used the two-group plus subgroup covariate RCT design and were analysed in this review using the Hancock formula.
Childs et al.  compared targeted spinal manipulation plus range-of-motion exercises with the control treatment of guidelines-based exercise. The results showed a treatment modifier effect size of 21.1% of baseline scores for short-term activity limitation and 8.5% in intermediate-term activity limitation but neither was statistically significant. In summary, a three-way test of interaction in this study indicated that the ability of the prediction rule to identify people who respond to this targeted treatment was statistically significant, but using the statistical methods in this review, the size of that treatment modifier effect was not statistically significant.
Hancock et al.  compared the results of spinal mobilization with the control treatment of detuned ultrasound. The results showed a treatment modifier effect size of 8.4% of baseline scores for short-term pain and 0.6% in intermediate-term pain but neither was statistically significant. In contrast with Childs et al., these results showed 'no treatment modifier effect' on short-term or immediate-term activity limitation and this may reflect a number of factors. One factor may be that the Childs et al. RCT used the particular spinal manipulation technique that the Flynn rule was designed for, whereas the Hancock et al. RCT was a pragmatic study in which most patients received spinal mobilisation techniques and only 5% received manipulation. Another factor may be the different demographic and cultural settings in which the RCTs occurred. In summary, a three-way test of interaction in the Hancock et al. study indicated that the ability of the prediction rule to identify people who respond to a targeted treatment was not statistically significant and a test of the size of any treatment modifier effect was not statistically significant at any outcome time point.
Four RCTs of manual therapy and/or exercise met the inclusion criteria and all were of high method quality. Using the statistical methods in this review, only one study showed statistically significant effect sizes and these were for short-term outcomes  following McKenzie directional preference-based exercise. However, there are reasons for caution in the interpretation of these results.
Large effects were only observed in the multi-arm subgroup system RCT of Long et al. . Large effects could be defined as being of clinically important size and, in low back pain, clinicians and researchers believe these to be approximately 30% of baseline scores . However, in the Long et al. study, patients and clinicians were not blinded to treatment allocation and as one treatment group received exercise that was concordant with prediction rule status (directional preference) and another group received exercise that was opposite to the rule status, an expectation bias may have inflated the effect size. An alternative definition of clinically important effects could be a treatment modifier effect size that results in the cumulative effect being greater than 30% of baseline scores. For example, a treatment modifier effect size of 20% of baseline scores might be clinically useful if, when added to an existing effect of 15%, resulted in the total amount of change being clinically important.
Similarly, large effects were only observed in the RCT that studied people with chronic low back pain . It is possible that subgroup targeted treatment is more effective in people with persistent back pain than in people with a favourable natural history, but the available data in this review are inadequate to test that hypothesis.
All studies that showed, or showed a trend towards, statistically significant effect sizes, did so only for short-term outcomes. It may be that the effects of targeted manual therapy and/or exercise in NSLBP are transient or it may be that the recurrent nature of NSLBP means that treatments that are effective for an episode of pain may, nonetheless, be unable to influence the rate of recurrence or severity of subsequent episodes.
Although the two-group plus subgroup covariate RCT design potentially produces the most accurate estimate of treatment effect modifier size, it requires large samples to obtain adequate power . Only two studies in this review used this two-group plus subgroup covariate RCT design [19, 26]. Given their sample sizes, the use of the Hancock formula resulted in increased uncertainty (larger confidence intervals) about the effect size and reinforces the need for adequate power to be included in the design of such RCTs . It is possible that, had the sample sizes been larger in some of the included studies, more effects of targeting treatment would have been statistically significant. For example, the Childs et al. study  showed a treatment effect modification size of 21.1% of baseline scores for short-term activity limitation but using the statistical method in this review, this result was not significant (P = 0.10). However, had the sample size in the study been increased to approximately 211 participants, instead of 131, this result would have been statistically significant. One method to reduce the uncertainty in estimates of the effect of targeted treatment in future trials would be to achieve adequate sample sizes through multi-centre and multi-national collaboration.
Tests of interaction, such a three-way ANOVA of treatment group × prediction rule status × time, can determine whether a clinical prediction rule identifies patients who respond to the target treatment but they do not quantify treatment effect modifier size. Treatment effect modifier size is the difference in effect in rule-positive people compared with rule-negative people at a particular outcome time period. Identifying the treatment effect modifier size requires alternative statistical techniques, such as observing the size of the interaction coefficient from linear regression. As it is uncommon for such results to be reported, systematic reviewers have to use alternative strategies, such as the Hancock formula, to compare effect sizes across studies. It is possible that a prediction rule may, at a statistically significant level, identify people who respond to a target treatment but the treatment effect modifier size at a particular time point is not statistically significant or clinically important. In a multi-arm subgroup system RCT, the equivalent test that identifies the matched treatment effect size is a T-test, or a pairwise post-hoc comparison after an ANOVA.
If it is important to investigate whether targeted manual therapy is more useful than non-targeted care, more studies that use this two-group plus subgroup covariate RCT design and tests of interaction should be performed, as they uniquely allow precise identification of treatment effect modifier size for a specific treatment/subgroup combination. Even where statistically significant results of clinically important size are reported in such studies, there remains a need for replication studies, preferably by independent research groups, in order to clarify the stability of findings across samples, care settings and cultures. Considering how central the notion of targeted treatment is to manual therapy principles, it is noteworthy how few high quality, well-designed RCTs have been performed to test hypotheses about the efficacy of targeted treatment.
The strengths of this systematic review are the comprehensiveness of the search, the rigour in the method of analysis and the attempt to address the absence of previously published reviews that quantify the effects of targeted manual therapy and/or exercise on patient outcomes. A limitation of this review is that as only published papers in the English, Danish and Norwegian languages were included, there is potential for the findings to contain cultural and publication bias.
Four high quality studies were included in this review of targeted manual therapy and/or exercise for NSLBP. Statistically significant effects were rare and when present were only for short-term outcomes. The clinical implications of these results are that they provide very cautious evidence supporting the notion that treatment targeted to subgroups of patients with NSLBP may improve patient outcomes but this notion has yet to be adequately researched. The results of the studies included in this review are too patchy, inconsistent and investigated in samples too small for recommendations for targeting treatment in routine clinical practice to be based on these findings. The research implications are that adequately powered RCTs using designs capable of providing robust information on treatment effect modification are infrequent. Considering how central the notion of targeted treatment is to manual therapy principles, further studies using this research method should be a priority for the clinical and research communities.
Criteria list for the method quality assessment
Was the method of randomization adequate? A random (unpredictable) assignment sequence. Examples of adequate methods are computer generated random number table and use of sealed opaque envelopes. Methods of allocation using date of birth, date of admission, hospital numbers, or alternation will not be regarded as appropriate.
Was the treatment allocation concealed? Assignment generated by an independent person not responsible for determining the eligibility of the patients. This person has no information about the persons included in the trial and has no influence on the assignment sequence or on the decision about eligibility of the patient.
Were the groups similar at baseline regarding the most important prognostic indicators? In order to receive a 'yes', groups have to be similar at baseline regarding demographic factors, duration and severity of complaints, and value of main outcome measure(s).
Was the patient blinded to the intervention? The review author determines if enough information about the blinding is given in order to score a 'yes'. A yes is awarded if the participant was blind to the results of the clinical prediction rule, as the comparison in this review is between the effect of manual therapy or exercise when targeted using this clinical prediction rule versus the effect of the same manual therapy or exercise when not targeted.
Was the care provider blinded to the intervention? The review author determines if enough information about the blinding is given in order to score a 'yes'. A yes was awarded if the care provider was blind to whether the manual therapy or exercise was targeted or not, as the comparison in this review is between the effect of treatments when targeted versus the effect of the same treatments when not targeted.
Was the outcome assessor blinded to the intervention? The review author determines if enough information about the blinding is given in order to score a 'yes'. A yes is awarded if the outcome assessor was reported to be blinded to whether the intervention was targeted or not, even if the details of how blinding was maintained is not provided.
Were co-interventions avoided or similar? Co-interventions should either be avoided in the trial design or be similar between the index and control groups. A yes is awarded if the authors collected data on co-interventions and tested for differences between groups or if a comparable proportion of people in each group reported seeking additional treatment.
Was the compliance acceptable in all groups? The review author determines if the compliance to the interventions is acceptable, based on the reported intensity, duration, number and frequency of sessions for both the index intervention and control intervention(s) Acceptable compliance was defined as adherence levels of 80% for short-term follow-ups and 70% for intermediate-term and long-term follow-ups.
Was the drop-out rate described and acceptable? The number of participants who were included in the study but did not complete the observation period or were not included in the analysis must be described and reasons given. If the percentage of withdrawals and drop-outs does not exceed 20% for immediate-term and short-term follow-ups, 30% for intermediate-term and long-term follow-ups and does not lead to obvious bias a 'yes' is scored.
Was the timing of the outcome assessment in all groups similar? Timing of outcome assessment should be identical for all intervention groups and for all important outcome assessments.
Did the analysis include an intention-to-treat analysis? All randomised patients are reported/analysed in the group they were allocated to by randomization for the most important moments of effect measurement (minus missing values) irrespective of noncompliance and co-interventions. When data are missing, acceptable strategies, such as mean substitution, last recorded measurement etc are used in data analysis.
nonspecific low back pain
randomized controlled trials
Pransky G, Cifuentes M: Point of view. Spine. 2009, 34 (12): 1250-10.1097/BRS.0b013e3181a9e84b.
Borkan J, Koes B, Reis S, Cherkin D: A report from the Second International Forum for primary care research on low back pain - Reexamining priorities. Spine. 1998, 23 (18): 1992-1996. 10.1097/00007632-199809150-00016.
Walker BF, Muller R, Grant WD: Low back pain in Australian adults. Prevalence and associated disability. J Manipulative Physiological Therapeutics. 2004, 27 (4): 238-244. 10.1016/j.jmpt.2004.02.002.
Kent PM, Keating JL: The epidemiology of low back pain. Chiropractic Osteopathy. 2005, 13: 13-10.1186/1746-1340-13-13. doi:10.1186/1746-1340-13-13.
Savigny P, Kuntze S, Watson P, Underwood M, Ritchie G, Cotterell M, Hill D, Browne N, Buchanan E, Coffey P, et al: Low back pain: early management of persistent non-specific low back pain. 2009, London: National Collaborating Centre for Primary Care and Royal College of General Practitioners
Foster NE, Thompson KA, Baxter GD, Allen JM: Management of non-specific low back pain by physiotherapists in Britain and Ireland. A descriptive questionnaire of current clinical practice. Spine. 1999, 24 (13): 1332-1342. 10.1097/00007632-199907010-00011.
IFOMT: International Federation of Orthopaedic Manipulative Therapists. 2006, accessed 8 November, 2006., [http://www.ifomt.org/ifomt/about/omtdefinition]
Airaksinen O, Brox JI, Cedraschi C, Hildebrandt J, Klaber-Mofett J, Kovacs F, Mannion AF, Reis S, Staal JB, Ursin H, et al: European guidelines for the management of chronic nonspecific low back pain in primary care. Eur Spine J. 2006, 15 (Suppl 2): S192-298. 10.1007/s00586-006-1072-1.
van Tulder M, Becker A, Bekkering T, Breen A, del Real MT, Hutchinson A, Koes B, Laerum E, Malmivaara A: European guidelines for the management of acute nonspecific low back pain in primary care. Eur Spine J. 2006, 15 (Suppl 2): S169-191. 10.1007/s00586-006-1071-2.
Kent PM, Keating J: Do primary-care clinicians think that non-specific low back pain is one condition?. Spine. 2004, 29 (9): 1022-1031. 10.1097/00007632-200405010-00015.
Kent PM, Keating JL: Classification in non-specific low back pain - what methods do primary care clinicians currently use?. Spine. 2005, 30: 1433-1440. 10.1097/01.brs.0000166523.84016.4b.
Waddell G: The Back Pain Revolution. 1998, Edinburgh: Churchill Livingstone
Zusman M: Structure-oriented beliefs and disability due to back pain. Australian J Physiotherapy. 1998, 44 (1): 13-20.
Ferreira M, Ferreira P, Latimer J, Herbert R, Maher C: Efficacy of spinal manipulative therapy for low back pain of less than three months' duration. J Manipulative Physiological Therapeutics. 2003, 26 (9): 593-601. 10.1016/j.jmpt.2003.08.010.
Ernst E, Harkness E: Spinal manipulation: a systematic review of sham-controlled, double-blind, randomized clinical trials. J Pain Symptom Mgmt. 2001, 22 (4): 879-889. 10.1016/S0885-3924(01)00337-2.
Assendelft WJ, Morton SC, Yu EI, Suttorp MJ, Shekelle PG: Spinal manipulative therapy for low back pain (Cochrane Review). The Cochrane Library. 2004, Chichester: John Wiley & Sons, Ltd
Cherkin DC, Sherman KJ, Deyo RA, Shekelle PG: A review of the evidence for the effectiveness, safety, and cost of acupuncture, massage therapy, and spinal manipulation for back pain. Ann Intern Med. 2003, 138 (11): 898-906.
Brennan GP, Fritz JM, Hunter SJ, Thackeray A, Delitto A, Erhard RE: Identifying subgroups of patients with acute/subacute 'nonspecific' low back pain - results of a randomized clinical trial. Spine. 2006, 31 (6): 623-631. 10.1097/01.brs.0000202807.72292.a8.
Childs JD, Fritz JM, Flynn TW, Irrgang JJ, Johnson KK, Majkowski GR, Delitto A: A clinical prediction rule to identify patients with low back pain most likely to benefit from spinal manipulation: A validation study. Ann Intern Med. 2004, 141 (12): 920-928.
Fritz JM, Delitto A, Erhard RE: Comparison of classification-based physical therapy with therapy based on clinical practice guidelines for patients with acute low back pain a randomized clinical trial. Spine. 2003, 28 (13): 1363-1372. 10.1097/00007632-200307010-00003.
Long A, Donelson R, Fung T: Does it matter which exercise? A randomized control trial of exercise for low back pain. Spine. 2004, 29 (23): 2593-2602. 10.1097/01.brs.0000146464.23007.2a.
Klebanoff MA: Subgroup analysis in obstetrics clinical trials. Am J Obstet Gynecology. 2007, 197: 119-122. 10.1016/j.ajog.2007.02.030.
Hancock M, Herbert R, Maher CG: A guide to interpretation of studies investigating subgroups of responders to physical therapy interventions. Physical Therapy. 2009, 89 (7): 698-704. 10.2522/ptj.20080351.
Brookes ST, Whitely E, Egger M, Smith GD, Mulheran PA, Peters TJ: Subgroup analyses in randomized trials: risks of subgroup-specific analyses: power and sample size for the interaction test. Clin Epidemiol. 2004, 57 (3): 229-236. 10.1016/j.jclinepi.2003.08.009.
van Tulder M, Furlan A, Bombardier C, Bouter L: Updated method guidelines for systematic reviews on the Cochrane Collaboration Back Group. Spine. 2003, 28 (12): 1290-1299. 10.1097/00007632-200306150-00014.
Hancock MJ, Maher CG, Latimer J, Herbert RD, McAuley JH: Independent evaluation of a clinical prediction rule for spinal manipulative therapy: a randomised controlled trial. Eur Spine J. 2008, 17: 936-943. 10.1007/s00586-008-0679-9.
Higgins JPT, Green S, editors: Cochrane Handbook for Systematic Reviews of Interventions 4.2.5. The Cochrane Library. 2005, Chichester, UK: John Wiley & Sons, Ltd, [updated May 2005]., 3
Fairbank JC, Couper J, Davies JB, O'Brien JP: The Oswestry Low Back Pain Disability Questionnaire. Physiotherapy. 1980, 66: 271-273.
Roland M, Morris R: A study of the natural history of back pain. Part I: Development of a reliable and sensitive measure of disability in low- back pain. Spine. 1983, 8 (2): 141-144. 10.1097/00007632-198303000-00004.
Davidson M, Keating JL: A comparison of five low back disability questionnaires: reliability and responsiveness. Physical Therapy. 2002, 82 (1): 8-24.
Lauridsen HH, Hartvigsen J, Manniche C, Korsholm L, Grunnet-Nilsson N: Responsiveness and minimal clinically important difference for pain and disability instruments in low back pain patients. BMC Musculoskeletal Disorders. 2006, 7 (82): doi:10.1186/1471-2474-1187-1182.
Cochrane Collaboration IMS: Accessed 10 June 2009., [http://www.cc-ims.net/revman/download]
McKenzie R, May S: Lumbar Spine, Mechanical Diagnosis and Therapy. 2003, Waikanae: Spinal Publications Ltd, 2
Delitto A, Erhard RE, Bowling RW: A treatment-based classification approach to low back syndrome: identifying and staging patients for conservative treatment. Physical Therapy. 1995, 75 (6): 470-489.
Flynn T, Fritz JW, Whitman M, Wainner RS, Magel J, Rendeiro D, Butler B, Garber M, Allison S: A clinical prediction rule for classifying patients with low back pain who demonstrate short-term improvement with spinal manipulation. Spine. 2002, 27 (24): 2835-2843. 10.1097/00007632-200212150-00021.
Ostelo RW, Deyo RA, Stratford P, Waddell G, Croft P, Von Korff M, Bouter LM, de Vet H: Interpreting change scores for pain and functional status in low back pain towards international consensus regarding minimal important change. Spine. 2008, 33 (1): 90-94. 10.1097/BRS.0b013e31815e3a10.
Browder DA, Childs JD, Cleland JA, Fritz JM: Effectiveness of an extension-oriented treatment approach in a subgroup of subjects with low back pain: A randomized clinical trial. Physical Therapy. 2007, 87 (12): 1608-1618. 10.2522/ptj.20060297.
Cairns MC, Foster NE, Wright C: Randomized controlled trial of specific spinal stabilization exercises and conventional physiotherapy for recurrent low back pain. Spine. 2006, 31 (19): E670-681. 10.1097/01.brs.0000232787.71938.5d.
Celestini M, Marchese A, Serenelli A, Graziani G: A randomized controlled trial on the efficacy of physical exercise in patients braced for instability of the lumbar spine. Europa Medicophysica. 2005, 41 (3): 223-231.
Cherkin DC, Deyo RA, Battie M, Street J, Barlow W: A comparison of physical therapy, chiropractic manipulation, and provision of an educational booklet for the treatment of patients with low back pain. New Eng J Med. 1998, 339 (15): 1021-1029. 10.1056/NEJM199810083391502.
Childs JD, Fritz JM, Piva SR, Erhard RE: Clinical decision making in the identification of patients likely to benefit from spinal manipulation: a traditional versus an evidence-based approach. J Orthopaedic Sports Physical Therapy. 2003, 33 (5): 259-272.
Chiradejnant A, Latimer J, Maher CG, Stepkovitch N: Does the choice of spinal level treated during posteroanterior (PA) mobilisation affect treatment outcome?. Physiotherapy Theory Practice. 2002, 18 (4): 165-174. 10.1080/09593980290058544.
Chiradejnant A, Kanlayanaphotporn R: Effectiveness of a specific mobilisation treatment on a certain type of low back pain... 12th Annual Symposium on Complementary Health Care -- Abstracts: 19th-21st September Exeter, UK1402. Focus on Alternative and Complementary Therapies. 2005, 10 (Supplement 1): 12.
Chiradejnant A, Maher CG, Latimer J, Stepkovitch N: Efficacy of 'therapist-selected' versus 'randomly selected' mobilisation techniques for the treatment of low back pain: A randomised controlled trial. Australian J Physiotherapy. 2003, 49: 233-241.
Clare HA, Adams R, Maher CG: Construct validity of lumbar extension measures in McKenzie's derangement syndrome. Manual Therapy. 2007, 12 (4): 328-334. 10.1016/j.math.2006.07.006.
Descarreaux M, Normand MC, Laurencelle L, Dugas C: Evaluation of a specific home exercise program for low back pain. J Manipulative Physiological Ther. 2002, 25 (8): 497-503. 10.1067/mmt.2002.127078.
Elnaggar IM, Nordin M, Sheikhzadeh A, Parnianpour M, Kahanovitz N: Effects of spinal flexion and extension exercises on low-back pain and spinal mobility in chronic mechanical low-back pain patients. Spine. 1991, 16 (8): 967-972. 10.1097/00007632-199108000-00018.
Erhard RE, Delitto A, Cibulka MT: Relative effectiveness of an extension program and a combined program of manipulation and flexion and extension exercises in patients with acute low back syndrome. Phys Ther. 1994, 74 (12): 1093-1100.
Fritz JM, Whitman JM, Childs JD: Lumbar spine segmental mobility assessment: An examination of validity for determining intervention strategies in patients with low back pain. Arch Physical Med Rehab. 2005, 86 (9): 1745-1752. 10.1016/j.apmr.2005.03.028.
Fritz JM, Lindsay W, Matheson JW, Brennan GP, Hunter SJ, Moffit SD, Swalberg A, Rodriquez B: Is there a subgroup of patients with low back pain likely to benefit from mechanical traction? Results of a randomized clinical trial and subgrouping analysis. Spine. 2007, 32: E793-E800. 10.1097/BRS.0b013e31815d001a.
Geisser ME, Wiggert EA, Haig AJ, Colwell MO: A randomized, controlled trial of manual therapy and specific adjuvant exercise for chronic low back pain. Clin J Pain. 2005, 21 (6): 463-470. 10.1097/01.ajp.0000135237.89834.23.
Gillan MG, Ross JC, McLean IP, Porter RW: The natural history of trunk list, its associated disability and the influence of McKenzie management. Eur Spine J. 1998, 7 (6): 480-483. 10.1007/s005860050111.
Goodsell M, Lee M, Latimer J: Short-term effects of lumbar posteroanterior mobilization in individuals with low-back pain. J Manipulative Physiological Ther. 2000, 23 (5): 332-342.
Greenman PE: Syndromes of the lumbar spine, pelvis, and sacrum. Physical Med Rehab Clinics of North America. 1996, 7 (4): 773-785.
Hough E, Stephenson R, Swift L: A comparison of manual therapy and active rehabilitation in the treatment of non specific low back pain with particular reference to a patient's Linton & Hallden psychological screening score: a pilot study. BMC Musculoskeletal Disorders. 2007, 8: 106-10.1186/1471-2474-8-106.
Konstantinou K, Foster N, Rushton A, Baxter D, Wright C, Breen A: Flexion mobilizations with movement techniques: the immediate effects on range of movement and pain in subjects with low back pain. J Manipulative Physiological Ther. 2007, 30 (3): 178-185. 10.1016/j.jmpt.2007.01.015.
Mayer JM, Ralph L, Look M, Erasala GN, Verna JL, Matheson LN, Mooney V: Treating acute low back pain with continuous low-level heat wrap therapy and/or exercise: a randomized controlled trial. Spine J. 2005, 5 (4): 395-403. 10.1016/j.spinee.2005.03.009.
Miller ER, Schenk RJ, Karnes JL, Rousselle JG: A comparison of the McKenzie approach to a specific spine stabilization program for chronic low back pain. J Manual Manipulative Therapy. 2005, 13 (2): 103-112. 10.1179/106698105790824996.
Monticone M, Barbarino A, Testi C, Arzano S, Moschi A, Negrini S: Symptomatic efficacy of stabilizing treatment versus laser therapy for sub-acute low back pain with positive tests for sacroiliac dysfunction: a randomised clinical controlled trial with 1 year follow-up. Europa Medicophysica. 2004, 40 (4): 263-268.
Mujic SE, Trebinjac S, Sakota S, Avdic D: The effects of McKenzie and Brunkow exercise program on spinal mobility comparative study. Bosnian J Basic Med Sci. 2004, 4 (1): 62-68.
Newton WP: Bed rest, exercises, or ordinary activity for acute low back pain?. J Family Practice. 1995, 41 (1): 96-97.
North American Spine Society Board Do: Spine Patient Outcome Research Trial (SPORT): multi-center randomized clinical trial of surgical and non-surgical approaches to the treatment of low back pain. Spine J. 2003, 3 (6): 417-419.
O'Brien N, Hanlon M, Meldrum D: Randomised, controlled trial comparing physiotherapy and Pilates in the treatment of ordinary low back pain. Physical Therapy Rev. 2006, 11 (3): 224-225.
O'Sullivan PB, Twomey LT, Allison GT: Evaluation of specific stabilizing exercise in the treatment of chronic low back pain with radiologic diagnosis of spondylolysis or spondylolisthesis. Spine. 1997, 22 (24): 2959-2967. 10.1097/00007632-199712150-00020.
Petersen T, Kryger P, Ekdahl C, Olsen S, Jacobsen S: The effect of McKenzie therapy as compared with that of intensive strengthening training for the treatment of patients with subacute or chronic low back pain - A randomized controlled trial. Spine. 2002, 27 (16): 1702-1709. 10.1097/00007632-200208150-00004.
Petersen T, Larsen K, Jacobsen S: One-year follow-up comparison of the effectiveness of McKenzie treatment and strengthening training for patients with chronic low back pain: Outcome and prognostic factors. Spine. 2007, 32 (26): 2948-2956. 10.1097/BRS.0b013e31815cda4a.
Riipinen M, Niemisto L, Lindgren KA, Hurri H: Psychosocial differences as predictors for recovery from chronic low back pain following manipulation, stabilizing exercises and physician consultation or physician consultation alone. J Rehab Med. 2005, 37 (3): 152-158.
Rossignol M, Abenhaim L, Seguin P, Neveu A, Collet JP, Ducruet T, Iro S: Coordination of primary health care for back: a randomized controlled trial. Spine. 2000, 25 (2): 251-258. 10.1097/00007632-200001150-00018.
Schenk RJ, Jozefczyk C, Kopf A: A randomized trial comparing interventions in patients with lumbar posterior derangement. J Manual Manipulative Therapy. 2003, 11 (2): 95-102. 10.1179/106698103790826455.
Skikic E, Trebinjac S, Sakota S, Avdic D: The effects of McKenzie and Brunkow exercise program on spinal mobility comparative study. Bosnian J Basic Med Sci. 2004, 4: 62-68.
Spratt KF, Weinstein JN, Lehmann TR, Woody J, Sayre H: Efficacy of flexion and extension treatments incorporating braces for back pain patients retrodisplacement, spondylolisthesis, or normal sagittal translation. Spine. 1993, 18 (13): 1839-1849. 10.1097/00007632-199310000-00020.
Stankovic R, Johnell O: Conservative treatment of acute low-back pain. A prospective randomized trial: McKenzie method of treatment versus patient education in 'mini back school'. Spine. 1990, 15 (2): 120-123. 10.1097/00007632-199002000-00014.
Sweetman BJ, Heinrich I, Anderson JA: A randomized controlled trial of exercises, short wave diathermy, and traction for low back pain, with evidence of diagnosis-related response to treatment. J Orthopaedic Rheumatology. 1993, 6 (4): 159-166.
Wright A, Lloyd-Davies A, Williams S, Ellis R, Strike P: Individual active treatment combined with group exercise for acute and subacute low back pain. Spine. 2005, 30 (11): 1235-1241. 10.1097/01.brs.0000164266.00150.b6.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1741-7015/8/22/prepub
The authors are grateful to Dr Mark Hancock for comments on draft manuscripts and to Professor Jennifer Keating for participation in the conception and design of the study.
The manuscript submitted does not contain information about medical devices or drugs. No benefits in any form have been, or will be, received from a commercial party related directly or indirectly to the subject of this manuscript.
The conception and design of the study and initial draft manuscript were by PK. All authors (PK, HM, DP) were involved in the analysis and interpretation of data, revision of the manuscript and approval of the final manuscript.
Electronic supplementary material
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Kent, P., Mjøsund, H.L. & Petersen, D.H. Does targeting manual therapy and/or exercise improve patient outcomes in nonspecific low back pain? A systematic review. BMC Med 8, 22 (2010). https://doi.org/10.1186/1741-7015-8-22
- Baseline Score
- Manual Therapy
- Prediction Rule
- Clinical Prediction Rule
- Randomized Control Trial Design