Prenatal and Postpartum Depression in Fathers and Its Association With Maternal Depression
A Meta-analysis
- James F. Paulson, PhD;
- Sharnail D. Bazemore, MS
-
Corresponding Author: James F. Paulson, PhD, Department of Pediatrics, Eastern Virginia Medical School, E. V. Williams Hall, 855 W Brambleton Ave, Norfolk, VA 23510 (paulsojf@evms.edu).
Abstract
Context It is well established that maternal prenatal and postpartum depression is prevalent and has negative personal, family, and child developmental outcomes. Paternal depression during this period may have similar characteristics, but data are based on an emerging and currently inconsistent literature.
Objective To describe point estimates and variability in rates of paternal prenatal and postpartum depression over time and its association with maternal depression.
Data Sources Studies that documented depression in fathers between the first trimester and the first postpartum year were identified through MEDLINE, PsycINFO, EMBASE, Google Scholar, dissertation abstracts, and reference lists for the period between January 1980 and October 2009.
Study Selection Studies that reported identified cases within the selected time frame were included, yielding a total of 43 studies involving 28 004 participants after duplicate reports and data were excluded.
Data Extraction Information on rates of paternal and maternal depression, as well as reported paternal-maternal depressive correlations, was extracted independently by 2 raters. Effect sizes were calculated using logits, which were back-transformed and reported as proportions. Random-effects models of event rates were used because of significant heterogeneity. Moderator analyses included timing, measurement method, and study location. Study quality ratings were calculated and used for sensitivity analysis. Publication bias was evaluated with funnel plots and the Egger method.
Data Synthesis Substantial heterogeneity was observed among rates of paternal depression, with a meta-estimate of 10.4% (95% confidence interval [CI], 8.5%-12.7%). Higher rates of depression were reported during the 3- to 6-month postpartum period (25.6%; 95% CI, 17.3%-36.1%). The correlation between paternal and maternal depression was positive and moderate in size (r = 0.308; 95% CI, 0.228-0.384). No evidence of significant publication bias was detected.
Conclusions Prenatal and postpartum depression was evident in about 10% of men in the reviewed studies and was relatively higher in the 3- to 6-month postpartum period. Paternal depression also showed a moderate positive correlation with maternal depression.
- KEYWORDS:
- CLINICIAN'S CORNER
- DEPRESSION
- DEPRESSION, POSTPARTUM
- DEPRESSIVE DISORDER, MAJOR
- FATHERS
- MEN'S HEALTH
- META-ANALYSIS
- MOTHERS
- PARENTS
- PREGNANCY
- WOMEN'S HEALTH
The prevalence, risk factors, and effects of depression among new fathers are poorly understood. Although a large body of research on maternal depression documents incidence rates between 10% and 30% and negative family and child developmental outcomes,1,2,3 paternal prenatal and postpartum depression has received little attention from researchers and clinicians.4 The emerging literature on paternal depression suggests that, like their maternal counterparts, fathers are at increased risk of depression in the postpartum5 and gestational periods.6,7,8 Moreover, several studies have now documented negative child outcomes associated with paternal prenatal and postpartum depression.9,10
Although recent literature has addressed this phenomenon, studies in paternal prenatal and postpartum depression are troubled by inconsistent methods, clinical heterogeneity, and prevalence estimates that vary considerably.5,7,11,12,13,14,15,16,17,18,19,20 To date, only 2 reviews on prenatal and postpartum depression in fathers have been published, but neither sought to quantitatively synthesize or resolve the discrepancies across studies, methods, or other issues.5,11 We conducted the present meta-analysis of depression in expecting and new fathers to (1) estimate paternal depression between the first trimester and 1 year postpartum; (2) describe differences across time within this period; (3) examine the association between paternal and maternal depression; (4) estimate the prevalence of maternal prenatal and postpartum depression identified in paternal depression studies; and (5) identify how published rates of paternal depression were affected by methodological factors such as measurement method, study location, and sample risk status.
METHODS
Search Strategy
We used 3 methods to identify studies for this meta-analysis. First, we used the reference lists of the most relevant reviews.5,11 Next, we searched MEDLINE, PsycINFO, Dissertation Abstracts International, EMBASE, and Google Scholar using the search terms depression, paternal, father, postnatal, postpartum, prenatal, antenatal, and perinatal. Finally, we used the “ancestry approach,”21 which involves consulting the reference lists of retrieved articles to find earlier relevant studies. Because of the emergent nature of this body of observational research literature, an inclusive approach to study selection was used.22,23 Therefore, we included all relevant and accessible journal articles, dissertations, and book chapters that were produced between January 1980 and October 2009 that assessed paternal depression during pregnancy, the first postpartum year, or both.
Study Selection
Studies that reported an estimated number of depression cases among identified fathers were included. This resulted in the exclusion of several studies that reported mean scores for symptom severity because the exact number of cases could not be clearly determined. Several articles9,10,24,25,26,27 used data from common databases and were excluded to avoid duplication of data. Several studies measured depression on multiple occasions. In these cases, 1 depression measure per time period was selected based on these priorities: (1) structured interviews; (2) measures with demonstrated generality in men (eg, Beck Depression Inventory28) vs adaptations of maternal measures (eg, Edinburgh Postnatal Depression Scale29); and (3) measures with greater specificity for depression (eg, Beck Depression Inventory28 vs General Health Questionnaire30). We excluded studies that selected fathers based on established maternal mental health problems because this could bias meta-analytic estimates. Also, because the identified studies of teen fathers were characterized by significant economic and social stressors, only studies of fathers aged 18 years or older were included.
Data Abstraction and Quality Assessment
The 2 authors used a standardized coding manual (available from the authors on request) to extract the following data from articles: author names, publication year, sample size, period of assessment, study sample risk (0 or 1; high risk coded when a study denoted this clearly, including medically assisted pregnancies and infants with feeding, sleeping, or crying problems), location, sample size, response rate, number of fathers identified as depressed, number of mothers identified as depressed (when assessed), and correlation between maternal and paternal depressive symptoms. The coding manual was developed a priori and modified after use in several studies. Coding was done independently then aggregated, with disagreements resolved through discussion and consensus. Although quality assessment can be reliably conducted in meta-analyses of experimental studies, its use in observational research is controversial, with no clear consensus on rating methods or their appropriate use in analysis. As such, we used a simple objective rating system (based on the meta-analysis of similar data by Bennett et al2) that coded studies on a scale of 0 to 10, assigning 2 points each for sampling method (systematic or probability vs convenience or not reported), presence of clearly stated inclusion criteria, racial/ethnic diversity (≥20% minority), educational diversity (≤80% at 1 educational level), and response rate (reported at ≥60%). Studies that did not report these methodological issues received lower scores. Because evidence on the validity of quality ratings in observational research is lacking, we adopted the approach of Stroup et al23 of broadly including studies and using sensitivity analysis to determine incremental effects of lower-quality studies.
Effect Size and Statistical Analysis
Primary Outcome. The primary outcome was the point prevalence rate of paternal depression, defined as the number of cases divided by the total number of study participants. We coded these into both simple proportional effect sizes (by dividing the number of cases by the sample size) and logit units, as a direct transformation of these proportions. In this context, the logit transformation was used to form an unbounded (in contrast to the 0-to-1 bounded nature of proportions) estimate to facilitate moderator analysis.31 After analysis, logit units were back-transformed to proportions for the purposes of reporting.
Secondary Outcomes. Secondary outcomes included rates of depression in female partners, which we coded as raw proportions and logit units, and standardized zero-order correlations between paternal and maternal depressive symptoms (when measured with a continuous or ordinal scale).
All major analyses were conducted with Comprehensive Meta-Analysis, version 2.0.32 In general, random-effects models are argued to better address heterogeneity between studies and study populations, allowing for greater flexibility in parsing effect size variability. Moreover, they are less influenced by extreme variations in sample size.22 Because studies in this meta-analysis are characterized by heterogeneity and highly variable sample sizes, random-effects models were used. Heterogeneity among study point estimates was assessed with the Q statistic, with magnitude of heterogeneity being evaluated with the I2 index.31 When reported, all confidence intervals (CIs) reflect a 95% criterion.
We examined the following determinants of primary and secondary outcomes: period of measurement, risk status of the sample (eg, infant problems, medically assisted fertilization), and case identification method (interview vs rating scale). Study location was also coded because previous work has identified geographic variations in postpartum depression.33 Because the timing of paternal depression vis-à-vis childbirth is of great interest in this study, period of measurement was coded into blocks that included (1) first trimester to 6 months' gestation, (2) greater than 6 months' gestation to birth, (3) immediately postbirth to 3 months postpartum, (4) greater than 3 months postpartum to 6 months postpartum, and (5) greater than 6 months postpartum to 12 months postpartum.
Publication bias was assessed by visually inspecting funnel plots and applying the regression intercept of Egger et al.34 In addition, we used the fail-safe procedure of Orwin,35 which is based on effect sizes that would be considered practically insignificant rather than the traditional null-effect reference. This generated a number of unpublished studies with effects at the estimated population base rate for adult male depression36 that would be needed to move estimates to a nonsignificant difference from base rates.
To assess the robustness of the results, we performed sensitivity analyses by sequentially removing each study and rerunning the analysis. We also conducted a separate analysis excluding studies with quality ratings in the lowest third to determine if potential methodological weaknesses influenced meta-analytic estimates.
RESULTS
Included Studies
Of the initial 256 identified studies, most (n = 163) were excluded because they were not applicable to the present meta-analysis (eg, articles on other topics, depression not assessed in fathers, reviews or summaries, beyond postpartum period, infant death, teen parents). Of those that were reviewed in full text, 30 were excluded because the proportion of depressive cases was not reported, 16 reported on a sample already included in the present study, and 4 were not retrievable (Figure 1). After a thorough review, 43 studies met the inclusion criteria for this meta-analysis6,7,8,13,14,15,16,17,18,19,20,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68 (Table). Of these studies, 23 reported rates of paternal depression at 2 or more time points and 20 reported a single observation. Because the inclusion of multiple effect sizes from a single sample would compromise the independence assumption of meta-analysis,22 primary analyses used the earliest reported estimate, as this generally reflects a larger preattrition sample size.
Table. Characteristics of Studies Included in Meta-analysis
The Table provides details on the characteristics of the 43 studies. Studies originated in 16 countries, with the United States contributing the most (n = 17 studies). Most studies (n = 40) used a self-report rating scale as the primary case definition method, with the remainder (n = 3) using a structured or semistructured interview. Three studies enrolled men from higher-risk samples. Two studies used population-based sampling procedures embedded within larger birth cohort studies, but most studies (n = 30) recruited from maternity or postpartum units, the remainder coming from parenting/prenatal classes and other health services. Thirty studies reported response rates greater than 70%. In addition to reporting paternal depression, 35 studies reported rates of partners' maternal depression and 14 reported the correlation between maternal and paternal depressive symptoms. Sample sizes varied widely across studies (N = 23-10 975), with a median of 130 participants (first quartile = 80; third quartile = 307). In all, using initial sample sizes across the 43 studies, a total of 28 004 participants are represented in this meta-analysis.
Tests for Heterogeneity
According to the criteria set by Higgins and Thompson,69 the heterogeneity in published rates of paternal depression was statistically significant and large in magnitude (Q = 825.081; P < .001; I2 = 94.910; τ2 = 0.470). Maternal depression also demonstrated significant heterogeneity across studies (Q = 1394.968; P < .001; I2 = 97.563; τ2 = 0.792), but the evidence for heterogeneity among correlations between maternal and paternal depressive symptoms was equivocal (Q = 89.906; P < .001; I2 = 85.540; τ2 = 0.019).
Primary Outcomes
The overall random-effects estimate of paternal depression was 10.4% (95% CI, 8.5%-12.7%) (Figure 2). Although no significant differences in depression rates were observed between higher- and lower-risk samples (lower risk, 10.1%; 95% CI, 8.2%-12.4%; higher risk, 15.6%; 95% CI, 5.6%-36.5%; Q = 0.721; P = .40), moderator analyses revealed 3 significant factors. First, there was considerable variability between different time periods vis-à-vis birth (Q = 20.256; P < .001), with the 3- to 6-month postpartum period showing the highest rate (25.6%; 95% CI, 17.3%-36.1%) and the first 3 postpartum months showing the lowest rate (19 studies; 7.7%; 95% CI, 5.3%-11.1%). Second, national origin of the study accounted for variability in depression rates of fathers (Q = 7.108; P = .008), with the US studies reporting an average rate of 14.1% (95% CI, 10.9%-18.0%) and international studies reporting an average rate of 8.2% (95% CI, 5.9%-11.1%). Finally, interview-based case definition methods were associated with lower overall prevalence estimates (rating scale, 11.0%; 95% CI, 8.9%-13.5%; interview, 4.9%; 95% CI, 3.6%-6.7%; Q = 18.236; P < .001). Because paternal age and family size were inconsistently reported, conclusions could not be drawn regarding the moderator effects of either.
Overall effects were calculated through random-effects model estimates, with separate calculations for overall effects within each period and across all periods. Effect sizes were calculated via a logit transformation of rates (number of reported cases divided by the sample size), which were back-transformed to proportions after estimates and standard errors were computed. Studies are stratified by period of assessment. For studies that assessed depression at multiple time points, only the earliest estimate is reported. Data marker size corresponds to study sample size.
Maternal depression had a meta-analytic point estimate of 23.8% (95% CI, 18.7%-29.7%). Time period was a significant determinant of maternal depression (Q = 22.156; P < .001), with higher rates reported during the 3- to 6-month postpartum period (41.6%). Measurement method (rating scale, 25.5%; 95% CI, 20.0%-31.9%; interview, 9.8%; 95% CI, 5.9%-15.8%) was also a significant predictor of maternal depression rate (Q = 12.773; P < .001). Study location (United States, 29.6; 95% CI, 19.3%-42.5%; international, 19.7%; 95% CI, 15.0%-25.4%) demonstrated a trend toward higher rates in the United States (Q = 2.599; P = .107).
The overall random-effects estimate of maternal-paternal depressive symptom correlation was significantly larger than 0 and moderate in magnitude (r = 0.308; 95% CI, 0.228-0.384).
Tests for Publication Bias
Visual inspection of funnel plots (available from the authors) revealed no obvious evidence of publication bias. Quantitative evaluation of publication bias, as measured by the Egger intercept, was nonsignificant (P = .15). Finally, the Orwin fail-safe procedure, using a base rate of 3%,36 determined that 1444 unpublished studies at or below this level would be needed to bring the overall meta-analytic estimate of prenatal and postpartum depression to a nonsignificant difference from the base rate.
Sensitivity Analyses
Robustness of meta-analytic findings was examined by sequentially removing each study and reanalyzing the remaining data set (producing a new analysis for each study removed). No study affected the meta-analytic estimate more than 0.5%. Removing studies with quality ratings in the lowest 33% decreased the meta-analytic estimate of paternal depression by only 0.6% (from 10.4% to 9.8%). The pattern of differences across time periods, measurement methods, and study locations remained essentially unchanged in direction and magnitude.
COMMENT
In this meta-analysis of paternal prenatal and postpartum depression and its correlation with maternal depression, wide variation was observed in reported rates of depression for fathers and mothers. The overall meta-analytic rate of paternal depression between the first trimester and 1 year postpartum was 10.4%. Since recent national data on base rates of depression in men place the 12-month prevalence at 4.8%,70 this suggests that paternal prenatal and postpartum depression represents a significant public health concern. It must be noted that considerable variability was observed in reported rates of paternal depression. Although timing of measurement, study location, and measurement method were significant predictors, they accounted for only a small amount of overall heterogeneity. In terms of timing, fathers experienced the highest rates of depression 3 to 6 months postpartum, although the small number of studies measuring paternal depression during this period suggests cautious interpretation. Differences were also observed across study locations, with higher rates of prenatal and postpartum depression reported in the United States (14.1% vs 8.2% internationally). Questionnaire methods of case identification produced somewhat higher rates than did interview methods, although this should be interpreted cautiously because of the small number of studies that used interviews. Surprisingly, sample risk status was not a determinant of depression rates.
Maternal depression demonstrated considerable heterogeneity. This varied by time period, with a peak rate of 41.6% in the 3- to 6-month postpartum period, and by measurement method (higher rates with rating scales). Our random-effects estimate is somewhat larger than that of some reports,1,2 with the variability in rates being clearly observable.
In both men and women, the potential causes of unexplained heterogeneity are varied. Although interview vs self-report questionnaire methods were compared, there were too many different questionnaires and interviews to conduct an instrument-by-instrument moderator analysis. Sample location and undetermined sample characteristics may also contribute to heterogeneity. One possible source of heterogeneity is the liberal inclusion of cases that can be classified as minor depression, a category that includes individuals with depressive symptoms and impairment who do not meet strict criteria, either by severity, number, or duration of symptoms, for major depressive disorder.71 Few of the studies reported herein explicitly described minor depression, but the use of rating scale cutoff scores suggests that strict diagnostic criteria for major depressive disorder were not used to identify cases. This interpretation is also consistent with the lower rates of identification observed in studies using structured interviews, a method that is typically more conservative in making a clinical diagnosis.
Another area of focus for this meta-analysis was the correlation between maternal and paternal depressive symptoms. Examining this effect in the context of paternal prenatal and postpartum depression is important, as paternal depression has been examined almost exclusively in the context of fathers paired with index mothers or children. Moreover, an extant meta-analysis of maternal postpartum depression suggests that marital satisfaction, a close correlate of depression, is among the strongest predictors of maternal depression.72 In our meta-analysis, 12 of the 14 correlations were significant (meta-analytic estimate, 0.308), a moderate association by most standards. Although other authors have suggested that maternal depression may play some causal role in paternal depression,5,11,40 none of the studies included in this meta-analysis suggest direction of causal influence. Studies that speak to direction of effect are of great interest, particularly for their implications for screening and prevention, but strong evidence of this is not yet available.
Our study has several limitations. First, because studies used variable methods of measuring and reporting depression in different time periods, time frame–specific prevalence cannot be clearly established, limiting interpretation to the rate of depression observed at that point in time. Also, since point estimates are drawn from a pool of heterogeneous studies, many of which did not use strong population-based sampling methods, there is a potential of bias in our results from studies' methodological weaknesses. These may not have been adequately accounted for by our simplified method of quality rating. The method of identifying depressed cases was highly variable across studies, thereby limiting the specificity of our primary outcome. However, this variability in case identification accurately reflects inconsistencies in both applied and basic research into prenatal and postpartum depression.5 Removing relatively weaker studies in sensitivity analysis left effects essentially unchanged. We did not find substantial evidence of publication bias in this area, and fail-safe analysis suggested that our findings are robust to unpublished null findings.
With these limitations in mind, this meta-analysis allows us to draw several conclusions regarding paternal prenatal and postpartum depression. First, a significant number of expecting and new fathers experience depression during this period. Second, expecting and new fathers in the United States experience depression at marginally higher rates than do fathers internationally, a finding that bears further investigation vis-à-vis varying social norms and postpartum work practices cross-nationally. Third, there is a moderate correlation between depression in fathers and mothers. There are many implications of these findings. The observation that expecting and new fathers disproportionately experience depression suggests that more efforts should be made to improve screening and referral, particularly in light of the mounting evidence that early paternal depression may have substantial emotional, behavioral, and developmental effects on children.10,55 The correlation between paternal and maternal depression also suggests a screening rubric73—depression in one parent should prompt clinical attention to the other. Likewise, prevention and intervention efforts for depression in parents might be focused on the couple and family rather than the individual.
Future research in this area should focus on parents together to examine the onset and joint course of depression in new parents. This may increase our capacity for early identification of parental depression, add leverage for prevention and treatment, and increase the understanding of how parental depression conveys risk to infants and young children.
Author Contributions: Dr Paulson had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: Paulson.
Acquisition of data: Paulson, Bazemore.
Analysis and interpretation of data: Paulson, Bazemore.
Drafting of the manuscript: Paulson, Bazemore.
Critical revision of the manuscript for important intellectual content: Paulson, Bazemore.
Statistical analysis: Paulson, Bazemore.
Administrative, technical, or material support: Paulson, Bazemore.
Study supervision: Paulson.
Financial Disclosures: None reported.










