Skip to main content


We’d like to understand how you use our websites in order to improve them. Register your interest.

Depression sum-scores don’t add up: why analyzing specific depression symptoms is essential


Most measures of depression severity are based on the number of reported symptoms, and threshold scores are often used to classify individuals as healthy or depressed. This method – and research results based on it – are valid if depression is a single condition, and all symptoms are equally good severity indicators. Here, we review a host of studies documenting that specific depressive symptoms like sad mood, insomnia, concentration problems, and suicidal ideation are distinct phenomena that differ from each other in important dimensions such as underlying biology, impact on impairment, and risk factors. Furthermore, specific life events predict increases in particular depression symptoms, and there is evidence for direct causal links among symptoms. We suggest that the pervasive use of sum-scores to estimate depression severity has obfuscated crucial insights and contributed to the lack of progress in key research areas such as identifying biomarkers and more efficacious antidepressants. The analysis of individual symptoms and their causal associations offers a way forward. We offer specific suggestions with practical implications for future research.

Peer Review reports


At present major depression has become a monolith, with the assumption that the diagnosis can be made merely on the number of depressive symptoms present […]. It may be politically important to utter such simplifications to doctors in general medical settings, but it is a convenient fiction.

– Goldberg, 2011, p. 227 [1]

Major depressive disorder (MDD) is one of the most common psychiatric disorders, with an estimated lifetime prevalence rate in the USA of 16.2% [2]. It is the leading cause of disability worldwide, and one of the top three causes of disease burden worldwide [3]. About 60% of individuals meeting criteria for MDD, as defined by the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) [4], report severe or very severe impairment of functioning [2] that highly compromises the capacity for self-care and independent living.

The severity of MDD is routinely estimated by adding up severity scores for many disparate symptoms to create a sum-score, and threshold values for these sum-scores are commonly used to classify individuals as depressed or not depressed. This practice of constructing sum-scores and collapsing individuals with different symptoms into one undifferentiated category is based on the assumption that depression is a single condition, and that all symptoms are interchangeable and equally good indicators. This review shows that this common practice discards much critical information about individual symptoms whose analysis can provide important insights.

Depression heterogeneity

In the DSM-5, MDD is characterized by nine symptoms: 1. depressed mood; 2. markedly diminished interest or pleasure; 3. increase or decrease in either weight or appetite; 4. insomnia or hypersomnia; 5. psychomotor agitation or retardation; 6. fatigue or loss of energy; 7. feelings of worthlessness or inappropriate guilt; 8. diminished ability to think or concentrate, or indecisiveness; and 9. recurrent thoughts of death or recurrent suicidal ideation. To qualify for the diagnosis, an individual must exhibit five or more symptoms, one of which must be either depressed mood or anhedonia. Of note, all symptoms except the first contain sub-symptoms (e.g., diminished interest or pleasure). Moreover, three symptoms – sleep problems, weight/appetite problems, and psychomotor problems – encompass opposite features (insomnia vs. hypersomnia; weight/appetite gain vs. loss; psychomotor retardation vs. agitation). This leads to roughly 1,000 unique combinations of symptoms that all qualify for a diagnosis of MDD, some of which do not share a single symptom [5]. It is not surprising that symptom variability among individuals diagnosed with MDD is well-established [5-7].

Cutoff values based on sum-scores from rating scales such as the Beck Depression Inventory (BDI) [8] or the Hamilton Rating Scale for Depression (HRSD) [9] are routinely used as the main criterion to enroll participants in research studies. While the DSM has a hierarchical structure that features two core symptoms, and while symptoms have to cause significant distress or impairment in important areas of functioning for a diagnosis, these criteria are not accounted for in such scales, further increasing the heterogeneity of depressed samples [5].

The next section reviews evidence underlining the importance of attending to particular depression symptoms. We then describe how the use of sum-scores obfuscates important insights in various domains, and suggest that this may help to explain slow progress in key research areas, such as identifying biomarkers and more efficacious antidepressants. We conclude the review with a list of suggestions that have practical research implications.

Review of symptom-based depression research

Extensive research has described individual depression symptoms; however, the significance of individual symptoms has not been systematically reviewed previously. Here, we describe how attending to specific symptoms has led to insights in research on biomarkers, antidepressant efficacy, depression risk factors, impaired psychological functioning, and causal effects among particular depression symptoms.

Symptom specificity in biomarker research

Despite extraordinary research expenditures and large genome-wide association studies, no pathognomonic biological markers of depression have been identified. This has been a major disappointment. In 1980, the DSM-III [10] preamble predicted that biomarkers associated with most diagnoses would be identified by the time the DSM-IV [11] appeared; 35 years and two DSM versions later, and with the exception of some neurological disorders, not one biological test for mental disorders was ready for inclusion in the criteria sets for the DSM-5, and not a single psychiatric diagnosis can be validated by laboratory or imaging biomarkers [12].

For depression research, results are specifically disappointing. In a recent large genome-wide association study with 34,549 subjects, no single locus reached genome-wide significance [13]. This is consistent with numerous other large genetic studies that have failed to identify any confirmed associations for MDD [14-17]. Studies predicting antidepressant response by common genetic variants have led to similarly disappointing results [18].

The analysis of specific symptoms offers opportunities to investigate biological factors that may be related to specific syndromes. Jang et al. [19] showed that 14 depression symptoms differ from each other in their degree of heritability (h2 range, 0–35%). Somatic symptoms such as loss of appetite and loss of libido, as well as cognitions such as guilt or hopelessness (possibly reflecting heritable personality traits), showed higher heritability coefficients than other symptoms like negative affect or tearfulness. Another study [20] revealed differential associations of symptoms with specific genetic polymorphisms; for example, the symptom ‘middle insomnia’ assessed by the HRSD was correlated with the GGCCGGGC haplotype in the first haplotype block of TPH1. In addition, a recent report of 7,500 twins identified three genetic factors that exhibited pronounced differential associations with specific MDD symptoms [21]; the authors concluded that the “DSM-IV syndrome of MD[D] does not reflect a single dimension of genetic liability” (p. 599). Guintivano and Brown [22] analyzed several independent samples of post-mortem brains and blood samples from living subjects to document that 80% of the variation in one of the most relevant specific symptoms, suicidal behavior, could be explained by how polymorphisms of the gene SKA2 interacted with anxiety and stress.

Moving away from genes and gene expression to hormones, the hypothesis that depression can be caused by inflammation has received considerable attention in recent years [23,24]. However, evidence shows that less than half of the individuals diagnosed with depression exhibit elevated inflammatory markers [25], and elevated levels of cytokines are neither highly sensitive nor specific to MDD [26]. Furthermore, somatic symptoms such as sleep problems, appetite gain, and weight gain seem elevated in the context of inflammation [27-29], suggesting symptom specificity. A recent review acknowledges intragroup variability of MDD as main limitation of the research on inflammation and depression [26], and suggests that future analyses of distinct endophenotypes may move the field forward.

In summary, individual depression symptoms differ in their biological correlates. This underlines the heterogeneous nature of depression, which may in turn explain the lack of progress in validating depression diagnosis with biomarkers. Analyzing associations between symptom sum-scores and genetic markers can only capture the shared genetic variance of all symptoms, which may be low. A symptom-based approach offers opportunities for future research that could provide a potential partial explanation for the “mystery of missing heritability” [30] – the conundrum that specific genetic markers explain only small proportions of the variance even for mental disorders that are highly heritable. Specific markers may correlate better with specific symptoms independent of diagnostic categories – genes do not read the DSM [31]. Studies on symptom-polymorphism associations instead of syndrome-polymorphism associations, similar to the one conducted by Myung et al. [20], may prove insightful.

The impact of antidepressants on specific symptoms

Several large meta-analyses of clinical trials have demonstrated that antidepressants outperform placebos in less than half of the trials, and that clinically relevant improvements can be documented only for a minority of severely depressed patients [32-34]. Part of the difficulty may be that measuring antidepressant efficacy via sum-scores conceals important effects on specific symptoms [35]. Little research has been conducted on the effect of antidepressants on individual depression symptoms compared to the mountain of literature on specific side effects.

Significant side effects for both tricyclic antidepressants and selective serotonin reuptake inhibitors have prevalence rates of up to 27% in clinical trials [36,37], and common side effects include insomnia, hypersomnia, nervousness, anxiety, agitation, tremor, restlessness, fatigue, somnolence, weight gain or weight loss, increased or decreased appetite, hypertension, sexual dysfunction, dry mouth, constipation, blurred vision, and sweating [38,39] (Table 1). Side effects vary across drugs, and some have more benign effects in specific domains. For instance, certain atypical antidepressants have a superior sexual side effect profile [40], and individuals treated with bupropion and nortriptyline show decreased rates of weight gain [41].

Table 1 Depression symptoms and common antidepressant side effects

Curiously, some of the common side effects reported by patients are the very symptoms that are used to measure depression (Table 1). This means that reductions in sum-scores thanks to reduced depression are concealed by increases in sum-scores caused by drug side effects. In addition, the instrument most commonly used in clinical trials is the HRSD which, compared to other depression scales such as the BDI, abounds in somatic symptoms that resemble the side effect profile caused by antidepressant treatment [42].

The presence of particular symptoms has been used to predict treatment response. Sleep problems, for instance, reduce the efficacy of depression treatment [43]; patients with persistent insomnia are more than twice as likely to remain depressed [44], and insomnia can become chronic despite successful resolution of depressive symptoms [45]. Other symptoms also moderate treatment efficacy: anxiety symptoms reduce depression remission rates, successful anxiety treatment prolongs depression remission [46-48], and loss of interest, diminished activity, and inability to make decisions predict poorer antidepressant response [49].

The overlap of antidepressant side effects and depression symptoms provides a compelling reason for analyzing symptoms such as weight problems, sleep problems, or sexual dysfunction separately from sum-scores. A detailed analysis of how different antidepressants influence specific symptoms may improve our ability to determine antidepressant efficacy.

Risk factor heterogeneity

Risk factors identified for depression include previous episodes of depression [50], demographic variables such as age and sex [51,52], and personality traits such as neuroticism [53]. Statistical models use these and other risk factors to predict the presence or absence of depression.

However, risk factors differ for different symptoms as first demonstrated by Lux and Kendler [54], who analyzed the associations of 25 risk factors on 9 different symptoms in a cross-sectional study of 1,015 individuals. The influence of risk factors differed substantially for different symptoms in a pattern the authors found difficult to reconcile with the general practice of summing symptoms. In another large prospective study, risk factors for depression in medical residents showed strong differential impact on changes of depression symptoms over time [55]. Restricting analyses to a sum-score suggested that women are at greater risk to develop depression during residency, but analyzing individual symptoms revealed that male residents were more likely to experience elevated levels of suicidal ideation under stress, whereas female study participants were more prone to develop increases in sleep, appetite, and concentration problems as well as fatigue.

Adverse life events are well-established risk factors for depression [56], and the depression symptoms individuals experience after a life event seem to depend on the nature of the event. In one experimental study, as well as different cross-sectional and longitudinal investigations of college students and adult samples [57-61], specific types of life events were associated with distinct patterns of depressive symptoms. For instance, after a romantic breakup, individuals mainly experienced depressed mood and feelings of guilt, whereas chronic stress was associated with fatigue and hypersomnia [59].

Overall, risk factors differ substantially for different depressive symptoms, and sum-scores obscure such insights. Studying the etiology of specific depression symptoms may enable the development of personalized prevention that focuses on specific problems and symptoms before they transition into a full-fledged depressive episode.

MDD symptoms differentially impact on functioning

Most depressed individuals suffer from severe functional impairment in various domains of living such as home life, workplace, or family [2,62]. Their impairment is often long-lasting and equal to that caused by other chronic medical conditions such as diabetes or congestive heart failure [63,64]. The question of whether individual depression symptoms differentially impair psychosocial functioning is thus of great importance.

In a study of 3,703 depressed outpatients, DSM-5 criterion symptoms varied substantially in their associations with impairment [65]. Sad mood explained 20.9% of the explained variance of impaired functioning, but hypersomnia only contributed 0.9%. Symptoms also differed in their impacts across impairment subdomains. For example, interest loss had high impact on social activities, whereas fatigue most severely impacted home management. The overall findings are consistent with an earlier study documenting differential impact of DSM-III criterion symptoms of depression on functioning [66].

While these results require replication in different samples, they offer further evidence for the value of considering depression symptoms separately. Not all symptoms contribute equally to severity ratings, and two individuals with similar sum-scores may suffer from dramatically different levels of impairment.

Causal associations among symptoms

Measuring depression severity by sum-scores of symptoms ignores a plethora of information pertaining to the intra-individual development of depression, including the power of individual symptoms to cause other symptoms.

Insomnia, for example, leads to psychomotor impairment [67], cognitive impairment [68], fatigue [69], low mood [70], and suicidal ideation or actual suicide [71] – symptoms that closely resemble DSM symptomatic criteria for depression (psychomotor problems; fatigue; diminished ability to think or concentrate, or indecisiveness; suicidal ideation). A meta-analysis of laboratory-based sleep loss studies documented the strength of these effects: sleep-deprived subjects performed 0.87 standard deviations (SD) lower than the control group on psychomotor tasks, 1.55 SD lower on cognitive tasks, and reported mood 3.16 SD lower than the control group. Collapsing over all three measures, performance of sleep-deprived subjects at the 50th percentile in their group was equivalent to subjects at the 9th percentile in the control group [72]. Another recent meta-analysis revealed that psychiatric patients with sleep disturbances are about twice as likely to report suicidal behaviors compared to patients without sleep problems, a finding that generalized across various conditions including MDD, post-traumatic stress disorder (PTSD), and schizophrenia [73].

Hopelessness describes negative expectancies about the future [74]. Although not part of the DSM-5 MDD criteria, it plays a major role in the cognitive triad originally described by Beck [75], performs more strongly than some DSM symptoms in distinguishing depressed from healthy individuals [76], and is assessed in various scales. Numerous studies have confirmed the predictive role of hopelessness for suicidal ideation and suicide [71]. The effects are long-reaching: hopelessness predicted suicidal thoughts, attempts, and actual suicide up to 13 years into the future in a large community sample [77], and was identified as a predictor of suicide among psychiatric patients followed for up to 20 years [78]. The association of hopelessness and suicide generalizes from depressed individuals to patients with other psychiatric conditions [79,80], once more underlining symptom specificity irrespective of a given diagnosis. Hopelessness predicts suicide better than the sum-score from an inventory assessing multiple depressive symptoms [80], and mediates the effect of rumination on suicidal ideation and other depressive symptoms in children and undergraduates [81,82]. In adolescents, rumination predicts the development of subsequent symptoms of depression, bulimia, and substance abuse, while depression and bulimia symptoms in turn predict increases in rumination [82,83]. Symptoms are associated in complex dynamic networks that can form vicious circles which transcend any specific diagnosis, a notion that is also supported by recently developed self-report methods demonstrating complex interactions among symptoms [84,85].

In contrast to longitudinal studies that span months or years, experience sampling methods that allow for the analysis of a large number of timepoints over a comparably short timeframe have consistently revealed short-term associations among depression symptoms (for a review, see [86]). For example, sleep quality predicted affect during the next day in a sample of 621 women, while daytime affect was not related to subsequent night-time sleep quality [70], implying a clear direction of causation. Complementing such group-level analyses with longitudinal idiographic studies is likely to contribute important information. Bringmann et al. [87] documented differences among depressed patients in the way their emotions impacted each other across time; for instance, they found the autoregressive coefficient of rumination to vary substantially across participants – rumination at a given timepoint strongly predicted rumination at the next timepoint for some individuals but not for others. Another study identified heterogeneity in the direction of causation between depression symptoms and physical activity [88]. Overall, a growing chorus of voices advocates the study of inter-individual differences [89-91] which may pave the way towards the development of more personalized treatment approaches. Heterogeneity may also help to resolve controversies about how some symptoms cause others. Sleep deprivation, for instance, has rapid mood-enhancing effects in some depressed patients [92], but other reports suggest that sleep difficulties cause low mood [70].

The notion that symptoms trigger, influence, or maintain other symptoms is widely recognized in clinical practice. A major goal in cognitive therapy is trying to break causal links between different MDD symptoms [75] and approaches like mindfulness-based cognitive therapy suggest that stopping rumination prevents it from causing other depression symptoms [93]. Kim and Ahn [94] demonstrated that causally central depression symptoms (symptoms that trigger many other symptoms) are judged to be more typical symptoms of depression by clinicians, are recalled with greater accuracy than peripheral symptoms, and are more likely to result in an MDD diagnosis. The authors concluded that clinicians think about causal networks of symptoms in ways far more sophisticated than the atheoretical DSM approach of counting symptoms.

Psychometric evidence

Psychometric techniques such as factor analysis (grouping symptoms) and latent class analysis (grouping individuals) are commonly used to address heterogeneity of MDD. In a more detailed discussion of these methods we draw two general conclusions, both of which support the study of individual symptoms [5].

First, extensive efforts to identify specific forms of treatment effective for specific depression subtypes have been disappointing. There has been little agreement about the number and nature of depression subtypes [95-98], and limited success in identifying external validators for subtypes [99-102]. A recent systematic review that compared the results of 34 factor and latent class analyses concluded that they did not provide evidence for valid subtypes of MDD [95], suggesting the analysis of individual symptoms.

Second, most rating scales for depression are multifactorial and do not measure one underlying factor [103-105]. However, individual symptoms are often at least moderately inter-correlated [106], and the first factor – often a general mood factor or higher-order factor – explains substantially more variance than subsequent factors [103,107]. This means that sum-scores certainly carry information about the general psychopathological load of a particular person, but that the approximation may be fairly rough and that summing symptoms may ignore important information [5,108] (for instance, because MDD symptoms are differentially impairing [65] and because sum-scores do not take into account reciprocal interactions of symptoms [108]).

Applying psychometric tools such as item response theory (IRT) and structural equation modeling (SEM) can yield important insights on the level of individual symptoms because they allow the examination of exact relationships between symptoms and underlying dimensions. One example technique that helps to understand such relations is differential item functioning; a prior study testing for this revealed that different MDD risk factors, such as neuroticism or adverse life events, impact on specific depression symptoms, implying that symptoms are ‘biased’ towards certain risk factors [55]. A second practical application is research on residual dependencies. A major assumption of IRT and SEM models is that the underlying latent variables fully explain the correlation of the manifest indicators. This is rarely the case [109], and especially unlikely in the context of MDD, seeing that symptoms influence each other directly [86,110]. Ignoring such residual dependencies unaccounted for by the latent variables, however, can substantially bias inferences [109,111].

Practical research implications

Few would defend the notion that depression is a homogeneous, discrete disease. Nonetheless, research on depression generally assigns individuals with diverse symptoms to the same disease category, and the search for potential causes then proceeds as if depression is a distinct disease entity, similar to measles or tuberculosis. This could help to explain the inability to find biomarkers or other external variables that can validate the diagnosis of depression [112-116].

Wide-spread reliance on sum-scores exacerbates the problem. Because depression symptoms are understood as interchangeable indicators of MDD, they are counted instead of being analyzed [54,109]. As we have shown above, however, symptoms are not equivalent, and sum-scores add apples and oranges. As a result, two individuals with equal sum-scores may have clinical conditions whose severities differ drastically. This does not deny the possibility that a central mechanism may switch on multiple aspects of depression in some depressed individuals; that obviously occurs, for instance, as a result of interferon treatment that can cause anhedonia, concentration problems, fatigue, and sleep problems [117]. The analysis of individual symptoms is nonetheless likely to reveal patterns that are currently neglected.

We conclude with a list of practical symptom-based implications that could advance depression research:

  1. i)

    Analyze each symptom separately

  2. ii)

    Assess non-DSM symptoms

  3. iii)

    Distinguish between sub-symptoms

  4. iv)

    Measure symptoms more objectively

  5. v)

    Assess symptoms across diagnoses

  6. vi)

    Improve reliability of assessment

  7. vii)

    Use multiple scales to assess symptoms

  8. viii)

    Investigate networks of symptom interactions

  9. ix)

    Investigate symptom profiles in clinical trials

Improved measurement of MDD symptoms

The first group of research implications is for the measurement of depression symptoms. After reviewing many depression rating scales, Snaith [42] concluded that “The measurement of ‘depression’ is as confused as the basic construct of the state itself” (p. 296). Below we explain why this is the case, and suggest several important steps that could reduce confusion.

Assessment of important non-DSM symptoms

First, expanding the range of symptoms analyzed may offer new insights. Today’s DSM MDD criterion symptoms were determined largely by clinical consensus instead of empirical evidence – one of the first proposed sets of symptoms goes back to the 1957 report by Cassidy [118], who described clinical features of manic-depressive disorders. The list was reworked later by Feighner [119], without published data to support the changes. Today’s criterion symptoms for MDD closely resemble the ones proposed over 40 years ago, and numerous critical calls for a psychometric (re)evaluation of depression and its symptoms have had little impact (e.g., [54,76,120]). Anxiety and anger are especially interesting symptoms for depression research; both are highly prevalent in depressed patients and associated with worse clinical outcomes [46,121]. In a large clinical trial, over half of the depressed patients reported significant levels of anxiety, and remission of depression was less likely and also took longer in this group [46]. Elevated baseline anxiety levels in treatment studies predict higher depression levels later on [122], and anxiety was identified as a risk symptom for adverse mental health trajectories in a large epidemiological study [123]. Anger is also prevalent among depressed patients, and has been identified as a clinical marker of a more severe, chronic, and complex depression [121]. The recently published Symptoms of Depression Questionnaire includes a variety of non-DSM symptoms, such as anger and anxiety, and may prove an important tool for future research [124].

Distinguishing between sub-symptoms

Making more detailed assessments of compound symptoms offers additional opportunities. Insomnia and hypersomnia are opposites; subsuming them into ‘sleep problems’ hampers progress. A recent meta-analysis revealed that the specific sleep problems of insomnia, parasomnia, and sleep-related breathing disorders, but not hypersomnia were related to suicidal behavior across a broad range of psychiatric conditions such as MDD, PTSD, and schizophrenia. Nightmares could also be included in future depression questionnaires, seeing that individuals suffering from nightmares showed a drastically elevated risk for suicidality [125]. Psychomotor problems pose yet another example, the impact of psychomotor retardation on impairment of psychosocial functioning in the Sequenced Alternatives to Relieve Depression (STAR*D) study was four times greater than the impact of psychomotor agitation [65]. Fatigue and sleepiness also need differentiation. As Ferentinos et al. [69] point out, “insomnia causes fatigue, while sleep apnea and narcolepsy cause mostly daytime sleepiness; fatigue is alleviated by rest, while sleepiness is relieved by sleep […]. Unfortunately, however, fatigue and sleepiness may sometimes be confounded in clinical practice, research, and psychometry” (p. 38).

Precise measurement of symptoms

The assessment of symptoms with higher precision offers further opportunities. More complex constructs, such as sadness, could be assessed with more than one question. Self-report information can be augmented with objective data. Patient reports about sleep quality can be complemented by physiological data on sleep patterns and sleep duration. Diaries can track sleep quality and weight changes, and impaired concentration can be measured using tests such as the d2 Test of Attention [126].

Transdiagnostic assessment of symptoms

Many symptoms are present in multiple disorders. Mental disorders, such as MDD, PTSD, or generalized anxiety disorder, are highly comorbid [127] in part because they share defining symptoms such as sleep problems. Anxiety is prevalent among many psychiatric conditions. Fatigue is a diagnostic criterion for several DSM disorders, but it also arises from many other medical conditions in ways that can artificially increase depression rates in such populations [128]. These symptoms may thus not be particularly useful for determining the presence of depression. However, the transdiagnostic study of common psychopathological symptoms – e.g., the similarities and differences of fatigue across different conditions – may offer substantial insights.

This idea also has implications for semi-structured interviews, such as the Structural Clinical Interview for DSM Disorders (SCID). In contrast to most scales, these instruments offer the opportunity to assess a large amount of symptoms from different diagnoses. However, it is currently impossible to utilize data gathered via semi-structured interviews for symptom-based research due to the skip questions. Skip questions are a heuristic to save time both for the interviewer and the interviewee: if an individual reports none of the core symptoms necessary for a diagnosis (such as anhedonia and sad mood for MDD), all other symptoms are skipped. While this speeds assessments, it loses vast amounts of information about specific symptoms. Researchers employing the SCID and similar instruments who query study participants about all symptoms even in the absence of core symptoms will generate important new findings.

Reliability of symptom measurement

One of the main challenges for symptom-based research is reliably measuring symptoms. Common rating scales were often not designed or validated for using symptom-level information. Instead, the assessment of symptoms was meant as measurement for an underlying disease [109]. This is an advantage of sum-scores: they include a number of at least moderately correlated symptoms, and are thus less susceptible to this measurement problem.

A possible solution to increase the reliability of symptom assessment for self-report questionnaires or clinical interviews is to follow the general psychometric practice of assessing variables of interest with more than one item. A good example is the Inventory of Depression and Anxiety Symptoms that uses multiple questions per symptom domain. For instance, suicidal tendencies are measured via 6 different items [129], allowing for a more reliable measurement. If this became standard practice, it would likely reduce measurement error on the symptom level.

Use of multiple depression scales

Finally, for studies that must rely on symptom sum-scores, different depression instruments should be utilized simultaneously, and conclusions should be considered robust only if they generalize across different scales. Despite their aim to measure the same underlying construct, there are marked differences between different instruments for measuring depression. For instance, scales differ in how they classify depressed patients into severity groups, so the scale chosen for a particular study can bias who qualifies for enrollment, and who achieves remission [130]. Instruments also include a variety of different symptoms, and their sum-scores are often only moderately correlated, suggesting that results may often be idiosyncratic to the particular scale used in a study [42,103,104,131]. In a review of 280 different depression scales, Santor et al. [131] concluded that most research is based on just a few scales, such as the HRSD and BDI, so much of what we know about depression depends on the quality of these scales. This is bad news, considering the low psychometric quality of the HRSD and BDI (poor inter-rater reliability, poor re-test reliability, poor content validity, and poor psychometric performance of certain items) [104,105]. While some changes were made to the DSM criteria in the last decades, most rating scales used today are at least 20 years old (in the case of the HRSD, half a century) and do not reflect these changes; most do not even include all nine DSM-5 criterion symptoms [103].

Network models

While the more traditional SEM and IRT models assume that all depression symptoms share a common cause and are locally independent (i.e., uncorrelated beyond the common cause; see [109]), a growing number of studies have shown that symptoms can trigger other symptoms. A recently developed framework – the network approach to psychopathology – allows the study of such dynamic interactions. Network models estimate the relationships among symptoms within or across time [106,109,110], and offer a new perspective on why symptoms cluster. While latent variable models explain symptom covariation by a latent factor that is viewed as the common cause of all symptoms, network models suggest that syndromes are constituted by the connections among symptoms. This perspective encourages consideration of how vicious circles of symptoms can fuel each other, an alternative to the schema in which all symptoms arise from a single brain disorder.

Reporting of symptom profiles

We anticipate fundamental advances from researchers who report and analyze information about specific symptoms. For instance, inconsistent reports about the efficacy of antidepressants may result from samples with different symptom patterns that may respond differently to different agents. A meta-analysis to test this hypothesis requires data on individual symptoms that is not available in the Food and Drug Administration database of depression studies.

A recent study by Uher et al. [132] suggests the available opportunities. The authors found that individuals with high baseline levels of systemic inflammation exhibited increased depression recovery under nortriptyline, while low inflammation levels were associated with superior depression improvement under escitalopram, supporting earlier work on the topic [133]. These results are especially interesting considering that inflammation levels are particularly elevated among depressed individuals with somatic symptoms [28], specifically appetite and weight gain [27]. If patients with high and low baseline inflammation levels exhibit different symptoms, it should be possible to select study participants who will respond to a particular drug. Finding biological markers for specific depressive symptoms will open new research vistas.


Depression symptoms are commonly added up to create sum-scores that are assumed to reflect the severity of a uniform underlying depressive disorder. This schema discards data about specific symptoms, treating all as equivalent and interchangeable indicators of MDD. It also fosters asking simplistic questions such as ‘what causes depression?’ or ‘what treatment is best for depression?’ Analyzing specific symptoms and their causal associations is an initial step towards personalized treatment of depression that recognizes the heterogeneity of MDD. This is certainly more complicated than the study of sum-scores, but well worth the effort. As John Tukey [134] pointed out, “Clarity in the large comes from clarity in the medium scale; clarity in the medium scale comes from clarity in the small. Clarity always comes with difficulty” (p. 88).



Beck Depression Inventory


Diagnostic and Statistical Manual of Mental Disorders


Hamilton Rating Scale for Depression


Item response theory


Major depressive disorder


Post-traumatic stress disorder


Sequenced Alternatives to Relieve Depression


Structural Clinical Interview for DSM Disorders


Structural equation modeling


  1. 1.

    Goldberg D. The heterogeneity of “major depression”. World Psychiatry. 2011;10:226–8.

  2. 2.

    Kessler RC, Berglund P, Demler O, Jin R, Koretz D, Merikangas KR, et al. The epidemiology of major depressive disorder: results from the National Comorbidity Survey Replication (NCS-R). JAMA. 2003;289:3095–105.

  3. 3.

    Lopez AD, Mathers CD, Ezzati M, Jamison DT, Murray CJL. Global burden of disease and risk factors. Washington, DC: World Bank; 2006.

  4. 4.

    American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders. 5th ed. Washington, DC: APA; 2013.

  5. 5.

    Fried EI, Nesse RM. Depression is not a consistent syndrome: an investigation of unique symptom patterns in the STAR*D study. J Affect Disord. 2015;172:96–102.

  6. 6.

    Zimmerman M, Ellison W, Young D, Chelminski I, Dalrymple K. How many different ways do patients meet the diagnostic criteria for major depressive disorder? Compr Psych. 2015;56:29–34.

  7. 7.

    Olbert CM, Gala GJ, Tupler LA. Quantifying heterogeneity attributable to polythetic diagnostic criteria: theoretical framework and empirical application. J Abnorm Psychol. 2014;123:452–62.

  8. 8.

    Beck A, Steer RA, Garbin MG. Psychometric properties of the Beck Depression Inventory: 25 years of evaluation. Clin Psychol Rev. 1988;8:77–100.

  9. 9.

    Hamilton M. A rating scale for depression. J Neurol Neurosurg Psychiatry. 1960;23:56–62.

  10. 10.

    American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 3rd ed. Washington, DC: APA; 1980.

  11. 11.

    American Psychiatric Association. The diagnostic and statistical manual of mental disorders. 4th ed. Washington, DC: APA; 1994.

  12. 12.

    Kapur S, Phillips AG, Insel TR. Why has it taken so long for biological psychiatry to develop clinical tests and what to do about it? Mol Psychiatry. 2012;17:1174–9.

  13. 13.

    Hek K, Demirkan A, Lahti J, Terracciano A. A genome-wide association study of depressive symptoms. Biol Psychiatry. 2013;73:667–78.

  14. 14.

    Lewis CM, Ng MY, Butler AW, Cohen-Woods S, Uher R, Pirlo K, et al. Genome-wide association study of major recurrent depression in the UK population. Am J Psychiatry. 2010;167:949–57.

  15. 15.

    Wray NR, Pergadia ML, Blackwood DHR, Penninx BWJH, Gordon SD, Nyholt DR, et al. Genome-wide association study of major depressive disorder: new results, meta-analysis, and lessons learned. Mol Psychiatry. 2012;17:36–48.

  16. 16.

    Shi J, Potash JB, Knowles JA, Weissman MM, Coryell W, Scheftner WA, et al. Genome-wide association study of recurrent early-onset major depressive disorder. Mol Psychiatry. 2011;16:193–201.

  17. 17.

    Daly J, Ripke S, Lewis CM, Lin D, Wray NR, Neale B, et al. A mega-analysis of genome-wide association studies for major depressive disorder. Mol Psychiatry. 2013;18:497–511.

  18. 18.

    Tansey KE, Guipponi M, Perroud N, Bondolfi G, Domenici E, Evans D, et al. Genetic predictors of response to serotonergic and noradrenergic antidepressants in major depressive disorder: a genome-wide analysis of individual-level data and a meta-analysis. PLoS Med. 2012;9:e1001326.

  19. 19.

    Jang KL, Livesley WJ, Taylor S, Stein MB, Moon EC. Heritability of individual depressive symptoms. J Affect Disord. 2004;80:125–33.

  20. 20.

    Myung W, Song J, Lim S-W, Won H-H, Kim S, Lee Y, et al. Genetic association study of individual symptoms in depression. Psychiatry Res. 2012;198:400–6.

  21. 21.

    Kendler KS, Aggen SH, Neale MC. Evidence for multiple genetic factors underlying DSM-IV criteria for major depression. Am J Psychiatry. 2013;70:599–607.

  22. 22.

    Guintivano J, Brown T. Identification and replication of a combined epigenetic and genetic biomarker predicting suicide and suicidal behaviors. Am J Psychiatry. 2014;171:1287–96.

  23. 23.

    Valkanova V, Ebmeier KP, Allan CL. CRP, IL-6 and depression: a systematic review and meta-analysis of longitudinal studies. J Affect Disord. 2013;150:736–44.

  24. 24.

    Wium-Andersen MK, Orsted DD, Nielsen SF, Nordestgaard BG. Elevated C-reactive protein levels, psychological distress, and depression in 73,131 individuals. JAMA Psychiatry. 2013;70:176–84.

  25. 25.

    Raison CL, Miller AH. Is depression an inflammatory disorder? Curr Psychiatry Rep. 2011;13:467–75.

  26. 26.

    Young JJ, Bruno D, Pomara N. A review of the relationship between proinflammatory cytokines and major depressive disorder. J Affect Disord. 2014;169C:15–20.

  27. 27.

    Lamers F, Vogelzangs N, Merikangas KR, de Jonge P, Beekman ATF, Penninx BWJH. Evidence for a differential role of HPA-axis function, inflammation and metabolic syndrome in melancholic versus atypical depression. Mol Psychiatry. 2013;18:692–9.

  28. 28.

    Duivis HE, Vogelzangs N, Kupper N, de Jonge P, Penninx BWJH. Differential association of somatic and cognitive symptoms of depression and anxiety with inflammation: findings from the Netherlands Study of Depression and Anxiety (NESDA). Psychoneuroendocrinology. 2013;38:1573–85.

  29. 29.

    Motivala SJ, Sarfatti A, Olmos L, Irwin MR. Inflammatory markers and sleep disturbance in major depression. Psychosom Med. 2005;67:187–94.

  30. 30.

    Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: genetic interactions create phantom heritability. Proc Natl Acad Sci U S A. 2012;109:1193–8.

  31. 31.

    Stefanis N. Genes do not read DSM-IV: implications for psychosis classification. Ann Gen Psychiatry. 2008;7:S68.

  32. 32.

    Pigott HE, Leventhal AM, Alter GS, Boren JJ. Efficacy and effectiveness of antidepressants: current status of research. Psychother Psychosom. 2010;79:267–79.

  33. 33.

    Kirsch I, Deacon BJ, Huedo-Medina TB, Scoboria A, Moore TJ, Johnson BT. Initial severity and antidepressant benefits: a meta-analysis of data submitted to the Food and Drug Administration. PLoS Med. 2008;5:e45.

  34. 34.

    Khan A, Khan S, Brown WA. Are placebo controls necessary to test new antidepressants and anxiolytics? Int J Neuropsychopharmacol. 2002;5:193–7.

  35. 35.

    Bech P. Rating scales in depression: limitations and pitfalls. Dialogues Clin Neurosci. 2006;8:207–15.

  36. 36.

    Trindade E, Menon D, Topfer L, Coloma C. Adverse effects associated with selective serotonin reuptake inhibitors and tricyclic antidepressants: a meta-analysis. Can Med Assoc J. 1998;159:1245–52.

  37. 37.

    Bet PM, Hugtenburg JG, Penninx BWJH, Hoogendijk WJG. Side effects of antidepressants during long-term use in a naturalistic setting. Eur Neuropsychopharmacol. 2013;23:1443–51.

  38. 38.

    Baldwin DS. Essential considerations when choosing a modern antidepressant. Int J Psychiatry Clin Pract. 2003;7:3–8.

  39. 39.

    Rosse R, Fanous A, Gaskins B, Deutsch S. Side effects in the modern psychopharmacology of depression. Prim Psych. 2007:14.

  40. 40.

    Serretti A, Chiesa A. Treatment-emergent sexual dysfunction related to antidepressants: a meta-analysis. J Clin Psychopharmacol. 2009;29:259–66.

  41. 41.

    Blumenthal SR, Castro VM, Clements CC, Rosenfield HR, Murphy SN, Fava M, et al. An electronic health records study of long-term weight gain following antidepressant use. JAMA Psychiatry. 2014;71:889–96.

  42. 42.

    Snaith P. What do depression rating scales measure? Br J Psychiatry. 1993;163:293–8.

  43. 43.

    Dew MA, Reynolds CF, Houck PR, Hall M, Buysse DJ, Frank E, et al. Temporal profiles of the course of depression during treatment. Predictors of pathways toward recovery in the elderly. Arch Gen Psychiatry. 1997;54:1016–24.

  44. 44.

    Pigeon WR, Hegel M, Unützer J, Fan M-Y, Sateia MJ, Lyness JM, et al. Is insomnia a perpetuating factor for late-life depression in the IMPACT cohort? Sleep. 2008;31:481–8.

  45. 45.

    Thase MME, Rush AJ, Manber R, Kornstein SG, Klein DN, Markowitz JC, et al. Differential effects of nefazodone and cognitive behavioral analysis system of psychotherapy on insomnia associated with chronic forms of major depression. J Clin Psychiatry. 2002;63:493–500.

  46. 46.

    Fava M, Rush AJ, Alpert JE, Balasubramani GK, Wisniewski SR, Carmin CN, et al. Difference in treatment outcome in outpatients with anxious versus nonanxious depression: a STAR*D report. Am J Psychiatry. 2008;165:342–51.

  47. 47.

    Shippee ND, Rosen BH, Angstman KB, Fuentes ME, DeJesus RS, Bruce SM, et al. Baseline screening tools as indicators for symptom outcomes and health services utilization in a collaborative care model for depression in primary care: a practice-based observational study. Gen Hosp Psychiatry. 2014;36:563–9.

  48. 48.

    Wu Z, Chen J, Yuan C, Hong W, Peng D, Zhang C, et al. Difference in remission in a Chinese population with anxious versus nonanxious treatment-resistant depression: a report of OPERATION study. J Affect Disord. 2013;150:834–9.

  49. 49.

    Uher R, Perlis RH, Henigsberg N, Zobel A, Rietschel M, Mors O, et al. Depression symptom dimensions as predictors of antidepressant treatment outcome: replicable evidence for interest-activity symptoms. Psychol Med. 2012;42:967–80.

  50. 50.

    Colman I, Naicker K, Zeng Y, Ataullahjan A, Senthilselvan A, Patten SB. Predictors of long-term prognosis of depression. Can Med Assoc J. 2011;183:1955–6.

  51. 51.

    Piccinelli M. Gender differences in depression: critical review. Br J Psychiatry. 2000;177:486–92.

  52. 52.

    Kendler KS, Myers J, Prescott CA. Sex differences in the relationship between social support and risk for major depression: a longitudinal study of opposite-sex twin pairs. Am J Psychiatry. 2005;162:250–6.

  53. 53.

    Kendler KS, Kuhn J, Prescott CA. The interrelationship of neuroticism, sex, and stressful life events in the prediction of episodes of major depression. Am J Psychiatry. 2004;161:631–6.

  54. 54.

    Lux V, Kendler KS. Deconstructing major depression: a validation study of the DSM-IV symptomatic criteria. Psychol Med. 2010;40:1679–90.

  55. 55.

    Fried EI, Nesse RM, Zivin K, Guille C, Sen S. Depression is more than the sum score of its parts: individual DSM symptoms have different risk factors. Psychol Med. 2014;44:2067–76.

  56. 56.

    Hammen C. Stress and depression. Annu Rev Clin Psychol. 2005;1:293–319.

  57. 57.

    Keller MC, Nesse RM. Is low mood an adaptation? Evidence for subtypes with symptoms that match precipitants. J Affect Disord. 2005;86:27–35.

  58. 58.

    Keller MC, Nesse RM. The evolutionary significance of depressive symptoms: different adverse situations lead to different depressive symptom patterns. J Pers Soc Psychol. 2006;91:316–30.

  59. 59.

    Keller MC, Neale MC, Kendler KS. Association of different adverse life events with distinct patterns of depressive symptoms. Am J Psychiatry. 2007;164:1521–9.

  60. 60.

    Cramer AOJ, Borsboom D, Aggen SH, Kendler KS. The pathoplasticity of dysphoric episodes: differential impact of stressful life events on the pattern of depressive symptom inter-correlations. Psychol Med. 2013;42:957–65.

  61. 61.

    Fried EI, Nesse RM, Guille C, Sen S. The differential influence of life stress on individual symptoms of depression. Acta Psychiatr Scand. 2015. Ahead of print.

  62. 62.

    Hirschfeld RM, Dunner DL, Keitner G, Klein DN, Koran LM, Kornstein SG, et al. Does psychosocial functioning improve independent of depressive symptoms? A comparison of nefazodone, psychotherapy, and their combination. Biol Psychiatry. 2002;51:123–33.

  63. 63.

    Mathers CD, Loncar D. Projections of global mortality and burden of disease from 2002 to 2030. PLoS Med. 2006;3:e442.

  64. 64.

    Murray CJL, Lopez A. Global burden of disease: a comprehensive assessment of mortality and disability from diseases, injuries, and risk factors in 1990 and projected to 2020. Cambridge, MA: Harvard School of Public Health; 1996.

  65. 65.

    Fried EI, Nesse RM. The impact of individual depressive symptoms on impairment of psychosocial functioning. PLoS One. 2014;9:e90311.

  66. 66.

    Tweed DL. Depression-related impairment: estimating concurrent and lingering effects. Psychol Med. 1993;23:373–86.

  67. 67.

    Fairclough SH, Graham R. Impairment of driving performance caused by sleep deprivation or alcohol: a comparative study. Hum Factors. 1999;41:118–28.

  68. 68.

    Durmer J, Dinges D. Neurocognitive consequences of sleep deprivation. Semin Neurol. 2005;25:117–29.

  69. 69.

    Ferentinos P, Kontaxakis V, Havaki-Kontaxaki B, Paparrigopoulos T, Dikeos D, Ktonas P, et al. Sleep disturbances in relation to fatigue in major depression. J Psychosom Res. 2009;66:37–42.

  70. 70.

    De Wild-Hartmann JA, Wichers MC, van Bemmel AL, Derom C, Thiery E, Jacobs N, et al. Day-to-day associations between subjective sleep and affect in regard to future depression in a female population-based sample. Br J Psychiatry. 2013;202:407–12.

  71. 71.

    Fawcett J, Scheftner WA, Fogg L, Clark DC, Young MA, Hedeker D, et al. Time-related predictors of suicide in major affective disorder. Am J Psychiatry. 1990;147:1189–94.

  72. 72.

    Pilcher JJ, Huffcutt AI. Effects of sleep deprivation on performance: a meta-analysis. Sleep. 1996;19:318–26.

  73. 73.

    Malik S, Kanwar A, Sim LA, Prokop LJ, Wang Z, Benkhadra K, et al. The association between sleep disturbances and suicidal behaviors in patients with psychiatric diagnoses: a systematic review and meta-analysis. Syst Rev. 2014;3:18.

  74. 74.

    Abramson LY, Metalsky GI, Alloy LB. Hopelessness depression: a theory-based subtype of depression. Psychol Rev. 1989;96:358–72.

  75. 75.

    Beck A, Rush AJ, Shaw FS, Emery G. Cognitive therapy of depression. New York: Guilford Press; 1979.

  76. 76.

    McGlinchey JB, Zimmerman M, Young D, Chelminski I. Diagnosing major depressive disorder VIII: are some symptoms better than others? J Nerv Ment Dis. 2006;194:785–90.

  77. 77.

    Kuo W-H, Gallo JJ, Eaton WW. Hopelessness, depression, substance disorder, and suicidality – a 13-year community-based study. Soc Psychiatry Psychiatr Epidemiol. 2004;39:497–501.

  78. 78.

    Brown GK, Beck A, Steer RA, Grisham JR. Risk factors for suicide in psychiatric outpatients: a 20-year prospective study. J Consult Clin Psychol. 2000;68:371–7.

  79. 79.

    Klonsky ED, Kotov R, Bakst S, Rabinowitz J, Bromet EJ. Hopelessness as a predictor of attempted suicide among first admission patients with psychosis: a 10-year cohort study. Suicide Life Threat Behav. 2012;42:1–10.

  80. 80.

    Beck A, Brown G, Berchick R. Relationship between hopelessness and ultimate suicide: a replication with psychiatric outpatients. Am J Psychiatry. 1990;147:190–5.

  81. 81.

    Smith JM, Alloy LB, Abramson LY. Cognitive vulnerability to depression, rumination, hopelessness, and suicidal ideation: multiple pathways to self-injurious thinking. Suicide Life Threat Behav. 2006;36:443–54.

  82. 82.

    Abela JRZ, Brozina K, Haigh EP. An examination of the response styles theory of depression in third- and seventh-grade children: a short-term longitudinal study. J Abnorm Child Psychol. 2002;30:515–27.

  83. 83.

    Nolen-Hoeksema S, Stice E, Wade E, Bohon C. Reciprocal relations between rumination and bulimic, substance abuse, and depressive symptoms in female adolescents. J Abnorm Psychol. 2007;116:198–207.

  84. 84.

    Frewen PA, Allen SL, Lanius RA, Neufeld RWJ. Perceived causal relations: novel methodology for assessing client attributions about causal associations between variables including symptoms and functional impairment. Assessment. 2012;19:480–93.

  85. 85.

    Frewen PA, Schmittmann VD, Bringmann LF, Borsboom D. Perceived causal relations between anxiety, posttraumatic stress and depression: extension to moderation, mediation, and network analysis. Eur J Psychotraumatol. 2013;4:20656.

  86. 86.

    Wichers MC. The dynamic nature of depression: a new micro-level perspective of mental disorder that meets current challenges. Psychol Med. 2013;616:1–12.

  87. 87.

    Bringmann LF, Vissers N, Wichers MC, Geschwind N, Kuppens P, Peeters F, et al. A network approach to psychopathology: new insights into clinical longitudinal data. PLoS One. 2013;8:e60188.

  88. 88.

    Rosmalen JGM, Wenting AMG, Roest AM, de Jonge P, Bos EH. Revealing causal heterogeneity using time series analysis of ambulatory assessments: application to the association between depression and physical activity after myocardial infarction. Psychosom Med. 2012;74:377–86.

  89. 89.

    Hofmann SG. Toward a cognitive-behavioral classification system for mental disorders. Behav Ther. 2014;45:576–87.

  90. 90.

    Molenaar P. A manifesto on psychology as idiographic science: bringing the person back into scientific psychology, this time forever. Measurement. 2004;2014:37–41.

  91. 91.

    Hamaker EL. Why researchers should think “within-person”: a paradigmatic rationale. In: Mehl M, Conner T, editors. Handbook of methods for studying daily life. New York, NY: Guilford Publications; 2012. p. 43–61.

  92. 92.

    Peterson MJ, Benca RM. Sleep in mood disorders. Sleep Med Clin. 2008;3:231–49.

  93. 93.

    Ma SH, Teasdale JD. Mindfulness-based cognitive therapy for depression: replication and exploration of differential relapse prevention effects. J Consult Clin Psychol. 2004;72:31–40.

  94. 94.

    Kim NS, Ahn W. Clinical psychologists’ theory-based representations of mental disorders predict their diagnostic reasoning and memory. J Exp Psychol Gen. 2002;131:451–76.

  95. 95.

    Van Loo HM, de Jonge P, Romeijn J-W, Kessler RC, Schoevers RA. Data-driven subtypes of major depressive disorder: a systematic review. BMC Med. 2012;10:156.

  96. 96.

    Baumeister H, Hutter N, Bengel J, Härter M. Quality of life in medically ill persons with comorbid mental disorders: a systematic review and meta-analysis. Psychother Psychosom. 2011;80:275–86.

  97. 97.

    Lichtenberg P, Belmaker RH. Subtyping major depressive disorder. Psychother Psychosom. 2010;79:131–5.

  98. 98.

    Bech P. Struggle for subtypes in primary and secondary depression and their mode-specific treatment or healing. Psychother Psychosom. 2010;79:331–8.

  99. 99.

    Melartin T, Leskelä U, Rytsälä H, Sokero P, Lestelä-Mielonen P, Isometsä E. Co-morbidity and stability of melancholic features in DSM-IV major depressive disorder. Psychol Med. 2004;34:1443.

  100. 100.

    Pae CU, Tharwani H, Marks DM, Masand PS, Patkar AA. Atypical depression: a comprehensive review. CNS Drugs. 2009;23:1023–37.

  101. 101.

    Davidson JRT. A history of the concept of atypical depression. J Clin Psychiatry. 2007;68:10–5.

  102. 102.

    Young MA, Keller MB, Lavori PW, Scheftner WA, Fawcett JA, Endicott J, et al. Lack of stability of the RDC endogenous subtype in consecutive episodes of major depression. J Affect Disord. 1987;12:139–43.

  103. 103.

    Shafer AB. Meta-analysis of the factor structures of four depression questionnaires: Beck, CES-D, Hamilton, and Zung. J Clin Psychol. 2006;62:123–46.

  104. 104.

    Gullion CM, Rush AJ. Toward a generalizable model of symptoms in major depressive disorder. Biol Psychiatry. 1998;44:959–72.

  105. 105.

    Bagby RM, Ryder AG, Schuller DR, Marshall MB. Reviews and overviews – the Hamilton Depression Rating Scale: has the gold standard become a lead weight? Am J Psyc. 2004;161:2163–77.

  106. 106.

    Cramer AOJ, Waldorp LJ, van der Maas HLJ, Borsboom D. Comorbidity: a network perspective. Behav Brain Sci. 2010;33:137–50. Discussion 150–93.

  107. 107.

    Uher R, Farmer A, Maier W, Rietschel M, Hauser J, Marusic A, et al. Measuring depression: comparison and integration of three scales in the GENDEP study. Psychol Med. 2008;38:289–300.

  108. 108.

    Faravelli C. Assessment of psychopathology. Psychother Psychosom. 2004;73:139–41.

  109. 109.

    Schmittmann VD, Cramer AOJ, Waldorp LJ, Epskamp S, Kievit RA, Borsboom D. Deconstructing the construct: a network perspective on psychological phenomena. New Ideas Psychol. 2013;31:43–53.

  110. 110.

    Borsboom D, Cramer AOJ. Network analysis: an integrative approach to the structure of psychopathology. Annu Rev Clin Psychol. 2013;9:91–121.

  111. 111.

    Braeken J. Modeling residual dependencies in latent variable models with copulas. KU Leuven: Leuven; 2008.

  112. 112.

    Lilienfeld SO. DSM-5: centripetal scientific and centrifugal. Clin Psychol Sci Pract. 2014;21:269–79.

  113. 113.

    Parker G. Beyond major depression. Psychol Med. 2005;35:467–74.

  114. 114.

    Costello C. The advantages of the symptom approach to depression. In: Costello C, editor. Symptoms of depression. New York: John Wiley and Sons; 1993. p. 1–21.

  115. 115.

    Hasler G, Drevets WC, Manji HK, Charney DS. Discovering endophenotypes for major depression. Neuropsychopharmacology. 2004;29:1765–81.

  116. 116.

    Persons JB. The advantages of studying psychological phenomena rather than psychiatric diagnoses. Am Psychol. 1986;41:1252–60.

  117. 117.

    Musselman D, Lawson D, Gumnick J, Manatunga A, Penna S, Goodkin R, et al. Paroxetine for the prevention of depression induced by high-dose interferon alfa. New Eng J Med. 2001;344:961–6.

  118. 118.

    Cassidy WL, Planagan NB, Spellman M, Cohen ME. Clinical observations in manic-depressive disease; a quantitative study of one hundred manic-depressive patients and fifty medically sick controls. J Am Med Assoc. 1957;164:1535–46.

  119. 119.

    Feighner JP, Robins E, Guze SB, Woodruff RA, Winokur G, Munoz R. Diagnostic criteria for use in psychiatric research. Arch Gen Psychiatry. 1972;26:57–63.

  120. 120.

    Andrews G, Slade T, Sunderland M, Anderson T. Issues for DSM-V: simplifying DSM-IV to enhance utility: the case of major depressive disorder. Am J Psychiatry. 2007;164:1784–5.

  121. 121.

    Judd LL, Schettler PJ, Coryell W, Akiskal HS, Fiedorowicz JG. Overt irritability/anger in unipolar major depressive episodes: past and current characteristics and implications for long-term course. JAMA Psychiatry. 2013;70:1171–80.

  122. 122.

    Gollan JK, Fava M, Kurian B, Wisniewski SR, Rush AJ, Daly E, et al. What are the clinical implications of new onset or worsening anxiety during the first two weeks of SSRI treatment for depression? Depress Anxiety. 2012;29:94–101.

  123. 123.

    Van Loo HM, Cai T, Gruber MJ, Li J, de Jonge P, Petukhova M, et al. Major depressive disorder subtypes to predict long-term course. Depress Anxiety. 2014;13:1–13.

  124. 124.

    Pedrelli P, Blais MA, Alpert JE, Shelton RC, Walker RSW, Fava M. Reliability and validity of the Symptoms of Depression Questionnaire (SDQ). CNS Spectr. 2014;19:535–46.

  125. 125.

    Sjöström N, Waern M, Hetta J. Nightmares and sleep disturbances in relation to suicidality in suicide attempters. Sleep. 2007;30:91–5.

  126. 126.

    Brickenkamp R, Zillmer E. The d2 Test of Attention. Seattle: Hogrefe & Huber Publishers; 1998.

  127. 127.

    Kessler RC, Chiu WT, Demler O, Merikangas KR, Walters EE. Prevalence, severity, and comorbidity of 12-month DSM-IV disorders in the National Comorbidity Survey Replication. Arch Gen Psychiatry. 2005;62:617–27.

  128. 128.

    Zimmerman M, Chelminski I, McGlinchey JB, Young D. Diagnosing major depressive disorder X: can the utility of the DSM-IV symptom criteria be improved? J Nerv Ment Dis. 2006;194:893–7.

  129. 129.

    Watson D, O’Hara MW, Simms LJ, Kotov R, Chmielewski M, McDade-Montez EA, et al. Development and validation of the Inventory of Depression and Anxiety Symptoms (IDAS). Psychol Assess. 2007;19:253–68.

  130. 130.

    Zimmerman M, Martinez JH, Friedman M, Boerescu D, Attiullah N, Toba C. How can we use depression severity to guide treatment selection when measures of depression categorize patients differently? J Clin Psychiatry. 2012;73:1287–91.

  131. 131.

    Santor DA, Gregus M, Welch A. Eight decades of measurement in depression. Measurement. 2009;4:135–55.

  132. 132.

    Uher R, Tansey KE, Dew T, Maier W, Mors O, Hauser J, et al. An inflammatory biomarker as a differential predictor of outcome of depression treatment with escitalopram and nortriptyline. Am J Psychiatry. 2014;1:1–9.

  133. 133.

    Vogelzangs N, Duivis HE, Beekman ATF, Kluft C, Neuteboom J, Hoogendijk W, et al. Association of depressive disorders, depression characteristics and antidepressant medication with inflammation. Transl Psychiatry. 2012;2:e79.

  134. 134.

    Tukey JW. Analyzing data: sanctification or detective work? Am Psychol. 1969;24:83–91.

Download references


EIF was supported in part by the Cluster of Excellence ‘Languages of Emotion’ (EXC302), the Research Foundation Flanders (G.0806.13), the Belgian Federal Science Policy within the framework of the Interuniversity Attraction Poles program (IAP/P7/06), and the grant GOA/15/003 from University of Leuven.

Author information



Corresponding author

Correspondence to Eiko I Fried.

Additional information

Competing interests

The authors have no competing interests to report.

Author’s contributions

EIF initiated the paper and reviewed the literature, EIF and RMN helped in drafting the paper. EIF and RMN have seen and approved the final version.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Fried, E.I., Nesse, R.M. Depression sum-scores don’t add up: why analyzing specific depression symptoms is essential. BMC Med 13, 72 (2015).

Download citation


  • Depression symptoms
  • Diagnostic and Statistical Manual of Mental Disorders
  • Heterogeneity
  • Major depressive disorder
  • Nosology