A large study of massage therapy for low back pain in the July 2011 issue of Annals of Internal Medicine (see Cherkin et al) suggests that massage may be helpful — “massage therapy improved function and decreased pain more than usual care.” This is consistent with the idea that back pain is often caused or complicated by “trigger points” or muscle knots. I like it. Their conclusion fits my pro-massage bias.
But while the research has mostly been reported as a good news story for massage, you may want to curb your enthusiasm. There’s also some genuinely bad news for massage hidden in the fine print. There’s a critical flaw that is acknowledged but not emphasized. And the results also suggest that it doesn’t much matter what kind of massage — a result quite likely to ruffle some feathers, if anyone pays attention to that detail. I will pay attention, of course. I will report in detail on several neglected implications of this research.
Massage studies are generally rare, and the big ones can be counted on a hand. Hundreds of patients were tested by Seattle researchers Dr. Daniel Cherkin, Dr. Richard Deyo and several colleagues, and the size of the experiment alone makes it worth taking a close look at. It also had a number of other rare qualities and obvious improvements over other massage research:
- two different massage techniques were compared to each other and to a neutral (control) group
- all the therapists were experienced, and the same therapists were used for both kinds of massage treatment
- patients stuck with the treatments, and follow-up was good and long term
- results were measured in many different ways
And yet for all that, serious flaws still remained, and the authors acknowledge that it’s “difficult to determine the true magnitude of the benefits of massage observed in this trial.” A lack of blinding was the worst of them. It’s such a significant problem that it’s hard to know if the study can really tell us anything at all, despite its strengths.
A translation of the study: so what happened?
Four hundred patients with “moderately severe” chronic low back pain, without a clear cause, were split up into three groups: for ten weeks, one group got weekly hour-long relaxation massages, another got more advanced therapeutic massage, and patients in a third group were essentially just paid $50 to do nothing in particular. Massage was provided by moderately-trained therapists2 with at least five years experience.
- 60% of massage patients seemed to improve about 30% over 10 weeks
- gains were lost steadily after the last massage
- only trivial differences after six months, and none after a year
- no meaningful difference between types of massage
- patients who were left out also improved — but not as much
After ten weeks of massage — with a market value of about $500–1000 for each patient — about 60% of patients seemed to have about 30% improved function and symptoms from the starting point. However, there was no meaningful difference between the two massage groups — the difference was actually smaller than the range of uncertainty in the data. Patients who were left out entirely also improved — a very important point — but not quite as much.
A 30% improvement sounds good, but people in the “usual care” (no massage) group also improved, about 10%. On a pain scale of 10, massaged patients dropped about 2 points on average, a pain reduction of roughly 30%, while those without any massage dropped only about half a point. So the difference between massage and no massage was pretty small — just barely enough to be considered clinically useful. 30% is a noteworthy improvement from baseline, but far from a cure, not a whole lot better than doing nothing, and low bang for buck for real patients.
Artist’s depiction of an “6” & a “4” on the pain scale, from Allie Brosh’s hilarious re-imagining of the pain scale. Massage patients in this study dropped about two points on the scale.
When the massaging ended, patients slowly but steadily lost their gains, while the never-massaged patients continued to slowly but steadily improve. By six months all scores were pretty much identical, with just a little advantage still remaining for massaged patients — they still had slightly better scores for function.
After a year, there were no differences left between any of the patients, and on average these chronic sufferers still had some back pain. Based on these results, the authors of the study came to the conclusion that
massage therapy may be effective for treatment of chronic back pain, with benefits lasting at least 6 months. No clinically meaningful difference between relaxation and structural massage was observed in terms of relieving disability or symptoms.
I think it’s a bit silly to describe a distinctly temporary 30% improvement in symptoms for 60% of patients as “effective,” and sillier still to say that the benefits lasted “at least” six months when the benefits were so reduced at that point that they barely register.
And then there’s the fact that improvements these patients seemed to enjoy may have been partially due to a data mirage. In fact, it’s almost certain …
Am I getting massaged here or not? The effect of knowing you’re getting a raw deal
This study lacked “blinding”: A key fact was not hidden from the participants, resulting in significant data pollution by their reaction. The patients who didn’t get any massage knew full well that they were participating in a massage study — they signed up for it and got paid — but also knew that they were not actually getting any massage therapy. (And free massage therapy at that.)
Ask yourself: How would you feel about that? Who wouldn’t feel a little disappointed and pessimistic?
Whether it’s effective therapy for back pain or not, people like massage. So these patients knew they had pulled the short straw, and probably expected to do less well — a perfect setup for slanted results. This is not a hidden flaw that I’m exposing: Cherkin and colleagues know it, and acknowledge that this problem could well be “making massage therapy seem more superior than it really is” in this study. This has been (charmingly) called a “frustrebo effect” — consisting of both a true negative placebo and “frustrated,” negative reporting — is a known problem with designing studies of popular treatments.3
Way back in 2000, Dr. Lloyd Oppel wrote an excellent, brief critical analysis of this problem with a much smaller massage-for-back-pain study.4 It’s sad to see a much larger study, a decade later, making exactly the same mistake.
Low back pain is notoriously sensitive to expectations.5 When you’re comparing a group of people who are consciously disappointed in their fate, you can pretty much count on their results seeming and/or actually being less good. Consequently, you could say that this study showed that not getting massaged was kind of a drag, and it can’t actually tell us if massage was actually better than just doing nothing.
Bear in mind that while massaged patients improved about 30% from the baseline, the non-massaged patients also improved — closing the meaningful gap between the two. The important question is not “How much did massage help?” but “How much more did it help than doing nothing?” That difference was very modest to begin with, and it’s a virtual certainty that the difference would have been even less if the un-massaged hadn’t known what they were missing. The frustrebo effect closes the gap.
Blinding is one of the pillars of a good clinical trial: if participants know too much, their hopes and fears and miscellaneous mental messes usually foul up the results. Generally speaking, people are quite alert and they can easily tell when they aren’t getting massaged for free. The only way to keep them happy about this is to make sure that they never knew it was a possibility in the first place. In this study, the patients needed to not know what they weren’t getting.
It’s really too bad that they didn’t not know.
The massage smackdown: relaxation vs. structural
An interesting feature of this experiment was that Cherkin et al compared the effects of garden variety relaxation massage — classic Swedish — with so-called “structural” massage. And what is “structural” massage, you might wonder? Good question!6
According to the description, it’s a dog’s breakfast of several allegedly advanced massage techniques, all revolving around the dubious notion that low back pain is some kind of biomechanical failure or structural imbalance, correctable with just the right kind of pressures and manipulations of soft tissue:
Myofascial techniques are intended to engage and release identified restrictions in myofascial tissues. Neuromuscular techniques are used to resolve soft-tissue abnormalities by mobilizing restricted joints, lengthening constricted muscles and fascia, balancing agonist and antagonist muscles, and reducing hypertonicity.
Gobbledygook! A smorg of trendy and traditional massage jargon, most of it hopelessly vague, some of it mutually exclusive. The whole school of thought that massage should have a “structural” intention is still a major theme in the profession, but it’s debatable at best and debunked nonsense at worst.7 The idea of structuralism was taken to an extreme in this study, and perhaps deliberately, as we see in this final odd point of the description:
[Structural] therapists could recommend a home exercise consisting of psoas stretch to enhance and prolong any benefits of structural massage.
Really? A stretch8 of a single muscle? A muscle with questionable relevance? The same muscle for every patient? This is advanced “structural” massage therapy? It seems more like clinical comedy.9
Including a stretch of this muscle as the only exercise to complement massage is an veeeery interesting choice. On the one hand, it’s ridiculous on the face of it. On the other, it’s also a savvy nod to the kind of clinical reasoning that is actually going on in massage therapy offices everywhere. For better or worse, psoas stretch is a common prescription, and a good representative sample of what supposedly “advanced” massage actually looks like out there in the real world.
And maybe that is a useful thing to test.
I would have been much happier if the experiment had also tested actually advanced massage techniques — as defined by me, since this is my fantasy — instead of a potpourri of vague and trendy ones. But, failing that, why not test techniques that are actually popular, however misguided they may be? Yes indeed, why not? And that is what we got: a test of just the sort of stuff that patients are likely to encounter in massage therapy offices in the wild.
And it didn’t work any better than the Swedish massage …
The results of the smackdown, according to the referees:
A course of relaxation massage, using techniques commonly taught in massage schools and widely used in practice, had effects similar to those of structural massage, a more specialized technique.
Here again Cherkin and his colleagues use language to describe their results that I find just slightly disingenuous and biased in favour of massage, and at odds with the data. “Similar” is not the right description. It was actually “no real difference at all.”
All that pretension. All those assumptions, psoas stretches, and lovely-sounding structural theories. All those expensive technique workshops those therapists went to, and all the extra money they charge real patients for their “expertise” to help pay off their investment in the workshops.10 It all added up to … nothing. They could have done relaxation massage instead and their patients would have been just as well off.
Maybe even better off. They would have spent less, for starters.
There were slight differences in results, but most of them actually favoured relaxation massage. The greatest of them was at the one-year mark: patients who’d gotten advanced massage actually scored a full point lower on the pain scale than those lucky Swedish massaged patients.
These are the kinds of data differences that should not be emphasized, however, because the wiggle room for error is actually larger than the measured difference — like trying to measure centimetres with a ruler that only has inches on it. The technically correct thing to say about the results is just that there was “no statistically significant difference” between the results.11
Still … it’s hard for emotional primate minds to ignore the fact that the data points were actually a little worse for advanced massage. And it certainly does help to drive home the fact that advanced massage was definitely not better. Maybe not actually worse. But clearly not actually better.
I’ve got bad news (in spades)
This study has been widely reported as a good news, “it works” story for massage. And the authors own conclusions sound pretty positive. Not so fast. As is ever thus, it’s complicated.
The results make typical so-called advanced massage really look bad, and they make the popular modality empires and structuralism as a paradigm look ridiculous. The technique gurus push and sell the idea that their methods are dramatically more effective than humble Swedish. If they were even half-right, these “advanced” therapists should have gotten results at least 50% better than their lesser-trained comrades — not just better by a statistically significant margin, but much better, impressively better, decisively better, undeniably better, argument-stopping better, better with bells on …
Instead, it’s like the New York Yankees accepted a challenge from a beer league softball team and couldn’t do better than a tie score.
The gap between the pretension and carefully measured results is a nasty condemnation of a huge chunk of an industry, at least half of all massage the way it is actually being practiced (probably much more).
This study has many weaknesses, and cannot actually tell us if “massage works” — the other bad news — but if nothing else it has certainly produced extremely strong evidence that the major advanced massage modalities do not actually work at all, and are just not worth the money. It’s a certification racket, and massage therapists need to get just as cynical about it as they probably already are about Big Pharma.
And I’ve got good news (just barely)
On the bright side? Relaxation massage is relatively good stuff: cheaper, more accessible, and there’s nothing “just” about it. A good Swedish massage is high art, and not nearly as simplistic as it has been portrayed by therapists who figure they’re too good for Swedish. In particular, relaxation massage places a much higher priority on addressing the human nervous system. I think that guiding principle may well prove in time to be a more advanced method than yarding on people’s tissues with the barbaric intention12 of actually physically changing them.
Not that relaxation massage actually “won” the contest — that would have been interesting — as the benefits of both styles were roughly equal and therefore equally unimpressive.
I cannot actually agree with the authors that their massage recipients got “clinically meaningful improvement” (especially at six months). It is possible that they had clinically meaningful improvement, but it is by no means certain. The problems with the study make it impossible difficult to conclude that any kind of massage actually worked. Indeed, when you get roughly equivalent results from quite different treatments, it tends to suggest that the results weren’t due to anything unique or specific that was being done.
It is also possible that improvements in pain and function are due to nonspecific effects, such as time spent in a relaxing environment, being touched, receiving care from a caring therapist, being given self-care advice, or increased body awareness.
Massage: nonspecific effect nirvana?
“Nonspecific effects” is an important concept here. Nonspecific effects are the many potential effects of being treated and cared for that occur with any treatment, as opposed to effects that only occur when a specific treatment occurs. In particular, non-specific effects tend to be related to the interaction between a patient and a health care provider, because any kind of care — nothing specific — has some therapeutic effects.
In short, people get better (or claim to) when they get some compassionate attention.
A massage appointment is nonspecific effect nirvana. The entire point of a good massage is to provide a great interaction between patient and therapist. A massage patient is at the luxurious centre of attention, being cared for in a way that is arguably the single nicest (nonsexual) experience any human can have. Nonspecific effects are dialed up to 11 in this environment. (Unpleasantly painful massage is a complicated exception.)
That these effects exist and are generally optimized in this situation is hard to deny. It’s more a question of how much of the benefits of massage can be attributed to them. Most? All?
The data produced by this experiment can’t tell us, but if it’s “most or all” then these are just the sorts of results you’d get: a wide gap between satisfied massage patients and disgruntled un-massaged patients, and no difference between relaxation massage and alleged expert techniques.
Drs. Cherkin and Deyo and their colleagues seem to have a case of acute rose-coloured vision. Much like their 1999 study of acupuncture,13 their conclusions are a fair bit more glowing and optimistic than the data seems to support. Describing massage as “effective” for “at least 6 months” sounds like they are talking about the results of some other study with much better and more certain results! At best, even if we could trust this data completely, it showed only modest and temporary benefits to quite a lot of expensive massage therapy. But we truly can’t trust this data: those apparent benefits may been mostly or entirely due to the acknowledge, obvious and unfortunate “frustrebo” of the patients who got nothing. We have to assume that the benefits of massage were not actually as strong as they appeared to be here, which almost certainly reduces it to a clinically trivial level.
About Paul Ingraham
I am a science writer in Vancouver, Canada. I was a Registered Massage Therapist for a decade and the assistant editor of ScienceBasedMedicine.org for several years. I’ve had many injuries as a runner and ultimate player, and I’ve been a chronic pain patient myself since 2015. Full bio. See you on Facebook or Twitter.
If you found this article useful, you may also be interested in some other articles I’ve published:
- Complete Guide to Low Back Pain — An extremely detailed guide to the myths, controversies, and treatment options for low back pain
- The “Impress Me” Test — Most controversial therapies are fighting over scraps of “positive” evidence that damn them with faint praise
- Modality Empires — The trouble with the toxic tradition of ego-driven, trademarked treatment methods in massage therapy, chiropractic, and physiotherapy
- Psoas, So What? — Massage therapy for the psoas major and iliacus (iliopsoas) muscles is not that big a deal
- Does Massage Therapy Work? — A review of the science of massage therapy … such as it is
- Your Back Is Not Out of Alignment — Debunking the obsession with alignment, posture, and other biomechanical bogeymen as major causes of pain
- The Mind Game in Low Back Pain — How back pain is powered by fear and loathing, and greatly helped by rational confidence
- ScienceBasedMedicine.org [Internet]. Novella S. Acupuncture Does Not Work for Back Pain (Part I); 2011 Jul 28 [cited 14 Jan 6]. Dr. Steve Novella: “Study author, Dr. Daniel Cherkin, is quoted as saying: ‘We found that simulated acupuncture, without penetrating the skin, produced as much benefit as needle acupuncture — and that raises some new questions about how acupuncture works.’ This is wrong – these results call into question if acupuncture works.”
- Standards for certification in massage therapy in Washington state are not high, and not low — a reasonable representative sample of the massage industry. In any case, I think that extra education does little to increase the therapeutic effectiveness of massage interventions. I think more education probably makes therapists safer and better at clinical reasoning and assessment, but not much better at implementing techniques. I have had excellent massages from poorly trained therapists on many occasions … and vice versa.
- Power M, Hopayian K. Exposing the evidence gap for complementary and alternative medicine to be integrated into science-based medicine. J R Soc Med. 2011 Apr;104(4):155–61. PubMed #21502214 ❐ PainSci #55256 ❐
- Oppel L. Is massage therapy genuinely effective? CMAJ. 2000 Oct;163(8):953; author reply 953–4. PubMed #11068563 ❐ PainSci #52962 ❐
- For more detail, see another article on PainScience.com, The Mind Game in Low Back Pain: How back pain is powered by fear and loathing, and greatly helped by rational confidence.
- It’s a curious term. Although I guessed the meaning, I’ve never before actually heard the term “structural massage,” and I would know it, if anyone would — I am constantly exposed to massage ideas from readers around the world.
- “Structuralism” is the excessive focus on causes of pain like crookedness and biomechanical problems. It’s an old and inadequate view of how pain works, but it persists because it offers comforting, marketable simplicity that is the mainstay of entire styles of therapy. For more information, see Your Back Is Not Out of Alignment: Debunking the obsession with alignment, posture, and other biomechanical bogeymen as major causes of pain.
- Stretching as therapy is generally much more dubious and controversial than massage itself. See Quite a Stretch.
- The psoas muscle has mystique in the world of massage: difficult and technical to massage, it is virtually worshipped for its alleged keystone-like importance to posture, core stability, and back pain, all based on a chain of reasoning full of rusty old links. See my deconstruction of psoas worship.
- A “modality empire” is a proprietary method (mode) of therapy championed by a single charismatic entrepreneur (the emperor). They sell books and workshops to professionals seeking to buy credibility in the form of increasing “levels” of certification, but the quality of these certifications is completely unregulated and often dubious. There is a great deal of overlap between modality empires and quackery. Many modality empires are simply repackaging old ideas. See Modality Empires: The trouble with the toxic tradition of ego-driven, trademarked treatment methods in massage therapy, chiropractic, and physiotherapy.
- Stated with full jargony glory: “There was no difference in function between the 2 types of massage (adjusted difference, 0.5 point, and the confidence bounds around the adjusted estimate of absolute difference between massage groups excluded values large enough to be considered clinically relevant (2-point difference in RDQ score and 1.5-point difference in symptom bothersomeness score). Similar results were found for symptom bothersomeness.”
Funny drawing of the most unfunny kind of massage. I wish this weren’t as “true” a representation of “structural” massage as it is, but this is actually what a lot of “fascial release” is like. I’ve experienced it repeatedly myself & heard about it from many readers. (Drawing by Claude Serre.)
- Cherkin DC, Sherman KJ, Avins AL, et al. A randomized trial comparing acupuncture, simulated acupuncture, and usual care for chronic low back pain. Arch Intern Med. 2009 May;169(9):858–66. PubMed #19433697 ❐ PainSci #54907 ❐
More than 600 participants were either given standard acupuncture treatments or simulated acupuncture. Although this study has been widely reported as if it was a controlled comparison of acupuncture to “standard medical treatment” for back pain, in fact it is not controlled (or blinded), and does not have the power to prove that acupuncture works for back pain.
The apparent difference between real and fake acupuncture they observed was minor. Nevertheless, the authors are excessively friendly to acupuncture and declare it to be “effective” in their conclusion in spite the obvious poverty of the data. In particular, they gloss over the damning implications of their most important finding: what little effect they think they found had nothing at all to do with needle placement. Acupuncture means nothing if needle placement doesn’t matter. The interpretation of Dr. Steven Novella is much more sensible: “The only reasonable scientific conclusion to draw from this is that acupuncture does not work.” For Dr. Novella’s meticulous and expert analysis, see Acupuncture Does Not Work for Back Pain (Part I).