Consensus Statement
JAMA. 2000;283(15):2008-2012. doi: 10.1001/jama.283.15.2008

Meta-analysis of Observational Studies in Epidemiology

A Proposal for Reporting

  1. Donna F. Stroup, PhD, MSc;
  2. Jesse A. Berlin, ScD;
  3. Sally C. Morton, PhD;
  4. Ingram Olkin, PhD;
  5. G. David Williamson, PhD;
  6. Drummond Rennie, MD;
  7. David Moher, MSc;
  8. Betsy J. Becker, PhD;
  9. Theresa Ann Sipe, PhD;
  10. Stephen B. Thacker, MD, MSc;
  11. for the Meta-analysis Of Observational Studies in Epidemiology (MOOSE) Group
  1. Author Affiliations: Centers for Disease Control and Prevention, Atlanta, Ga (Drs Stroup, Williamson, and Thacker); University of Pennsylvania School of Medicine, Philadelphia (Dr Berlin); RAND Corporation, Santa Monica (Dr Morton), University of California, San Francisco (Dr Rennie), Stanford University, Stanford (Dr Olkin), Calif; JAMA, Chicago, Ill (Dr Rennie); Thomas C. Chalmers Centre for Systematic Reviews, Children's Hospital of Eastern Ontario Research Institute, Ottawa (Mr Moher); Michigan State University, East Lansing (Dr Becker); and Georgia State University, Atlanta (Dr Sipe).

Abstract

Objective  Because of the pressure for timely, informed decisions in public health and clinical practice and the explosion of information in the scientific literature, research results must be synthesized. Meta-analyses are increasingly used to address this problem, and they often evaluate observational studies. A workshop was held in Atlanta, Ga, in April 1997, to examine the reporting of meta-analyses of observational studies and to make recommendations to aid authors, reviewers, editors, and readers.

Participants  Twenty-seven participants were selected by a steering committee, based on expertise in clinical practice, trials, statistics, epidemiology, social sciences, and biomedical editing. Deliberations of the workshop were open to other interested scientists. Funding for this activity was provided by the Centers for Disease Control and Prevention.

Evidence  We conducted a systematic review of the published literature on the conduct and reporting of meta-analyses in observational studies using MEDLINE, Educational Research Information Center (ERIC), PsycLIT, and the Current Index to Statistics. We also examined reference lists of the 32 studies retrieved and contacted experts in the field. Participants were assigned to small-group discussions on the subjects of bias, searching and abstracting, heterogeneity, study categorization, and statistical methods.

Consensus Process  From the material presented at the workshop, the authors developed a checklist summarizing recommendations for reporting meta-analyses of observational studies. The checklist and supporting evidence were circulated to all conference attendees and additional experts. All suggestions for revisions were addressed.

Conclusions  The proposed checklist contains specifications for reporting of meta-analyses of observational studies in epidemiology, including background, search strategy, methods, results, discussion, and conclusion. Use of the checklist should improve the usefulness of meta-analyses for authors, reviewers, editors, readers, and decision makers. An evaluation plan is suggested and research areas are explored.

Because of pressure for timely and informed decisions in public health and medicine and the explosion of information in the scientific literature, research results must be synthesized to answer urgent questions.1-4 Principles of evidence-based methods to assess the effectiveness of health care interventions and set policy are cited increasingly.5 Meta-analysis, a systematic approach to identifying, appraising, synthesizing, and (if appropriate) combining the results of relevant studies to arrive at conclusions about a body of research, has been applied with increasing frequency to randomized controlled trials (RCTs), which are considered to provide the strongest evidence regarding an intervention.6-7

However, in many situations randomized controlled designs are not feasible, and only data from observational studies are available.8 Here, we define an observational study as an etiologic or effectiveness study using data from an existing database, a cross-sectional study, a case series, a case-control design, a design with historical controls, or a cohort design.9 Observational designs may lack the experimental element of a random allocation to an intervention and rely on studies of association between changes or differences in 1 characteristic (eg, an exposure or intervention) and changes or differences in an outcome of interest. These designs have long been used in the evaluation of educational programs10 and exposures that might cause disease or injury.11 Studies of risk factors generally cannot be randomized because they relate to inherent human characteristics or practices, and exposing subjects to harmful risk factors is unethical.12 At times, clinical data may be summarized in order to design a randomized comparison.13 Observational data may also be needed to assess the effectiveness of an intervention in a community as opposed to the special setting of a controlled trial.14 Thus, a clear understanding of the advantages and limitations of statistical syntheses of observational data is needed.15

Although meta-analysis restricted to RCTs is usually preferred to meta-analyses of observational studies,16-18 the number of published meta-analyses concerning observational studies in health has increased substantially during the past 4 decades (678 in 1955-1992, 525 in 1992-1995, and more than 400 in 1996 alone).19

While guidelines for meta-analyses have been proposed, many are written from the meta-analyst's (author's) rather than from the reviewer's, editor's, or reader's perspective20 and restrict attention to reporting of meta-analyses of RCTs.21-22 Meta-analyses of observational studies present particular challenges because of inherent biases and differences in study designs23; yet, they may provide a tool for helping to understand and quantify sources of variability in results across studies.24

We describe here the results of a workshop held in Atlanta, Ga, in April 1997, to examine concerns regarding the reporting of Meta-analysis Of Observational Studies in Epidemiology (MOOSE). This article summarizes deliberations of 27 participants (the MOOSE group) of evidence leading to recommendations regarding the reporting of meta-analyses. Meta-analysis of individual-level data from different studies, sometimes called "pooled analysis" or "meta-analysis of individual patient data,"25-26 has unique challenges that we will not address here. We propose a checklist of items for reporting that builds on similar activities for RCTs22 and is intended for use by authors, reviewers, editors, and readers of meta-analyses of observational studies.

METHODS

We conducted a systematic review of the published literature on the conduct and reporting of meta-analyses in observational studies. Databases searched included MEDLINE, Educational Resources Information Center, PsycLIT (http://www.wesleyan.edu/libr), and the Current Index to Statistics. In addition, we examined reference lists and contacted experts in the field. We used the 32 articles retrieved to generate the conference agenda and set topics of bias, searching and abstracting, heterogeneity, study categorization, and statistical methods. We invited experts in meta-analysis from the fields of clinical practice, trials, statistics, epidemiology, social sciences, and biomedical editing.

The workshop included an overview of the quality of reporting of meta-analyses in education and the social sciences. Plenary talks were given on the topics set by the conference agenda. For each of 2 sessions, workshop participants were assigned to 1 of 5 small discussion groups, organized around the topic areas. For each group, 1 of the authors served as facilitator, and a recorder summarized points of discussion for issues to be presented to all participants. Time was provided for the 2 recorders and 2 facilitators for each topic to meet and prepare plenary presentations given to the entire group. We proposed a checklist for meta-analyses of observational studies based on the deliberation of the independent groups. Finally, we circulated the checklist for comment to all conference attendees and representatives of several constituencies who would use the checklist.

RESULTS

The checklist resulting from workgroup deliberations is organized around recommendations for reporting background, search strategy, methods, results, discussion, and conclusions (Table 1).

Table. A Proposed Reporting Checklist for Authors, Editors, and Reviewers of Meta-analyses of Observational Studies

Background

Reporting of the background should include the definition of the problem under study, statement of hypothesis, description of the study outcome(s) considered, type of exposure or intervention used, type of study design used, and complete description of the study population. When combining observational studies, heterogeneity of populations (eg, US vs international studies), design (eg, case-control vs cohort studies), and outcome (eg, different studies yielding different relative risks that cannot be accounted for by sampling variation) is expected.8

Search

Reporting of the search strategy should include qualifications of the searchers, specification of databases used, search strategy and index terms, use of any special features (eg, "explosion"), search software used, use of hand searching and contact with authors, use of materials in languages other than English, use of unpublished material, and exclusion criteria used. Published research shows that use of electronic databases may find only half of all relevant studies, and contacting authors may be useful,27 although this result may not be true for all topic areas.28

For example, a meta-analysis of depression in elderly medical inpatients29 used 2 databases for the search. In addition, bibliographies of retrieved papers were searched. However, the authors did not report their search strategy in enough detail to allow replication. An example of a thorough "reject log" can be found in the report of a meta-analysis of electrical and magnetic field exposure and leukemia.30 Examples of a table characterizing studies included can be found in Franceschi et al31 and Saag et al.32 Complete specification of search strategy is not uniform; a review of 103 published meta-analyses in education showed that search procedures were described inadequately in the majority of the articles.10

Methods

Items in this checklist section are concerned with the appropriateness of any quantitative summary of the data; degree to which coding of data from the articles was specified and objective; assessment of confounding, study quality, and heterogeneity; use of statistical methods; and display of results. Empirical evidence shows that reporting of procedures for classification and coding and quality assessment is often incomplete: fewer than half of the meta-analyses reported details of classifying and coding the primary study data, and only 22% assessed quality of the primary studies.10

We recognize that the use of quality scoring in meta-analyses of observational studies is controversial, as it is for RCTs,16, 33 because scores constructed in an ad hoc fashion may lack demonstrated validity, and results may not be associated with quality.34 Nevertheless, some particular aspects of study quality have been shown to be associated with effect: eg, adequate concealment of allocation in randomized trials.35 Thus, key components of design, rather than aggregate scores themselves, may be important. For example, in a study of blinding (masking) of readers participating in meta-analyses, masking essentially made no difference in the summary odds ratios across the 5 meta-analyses.36 We recommend the reporting of quality scoring if it has been done and also recommend subgroup or sensitivity analysis rather than using quality scores as weights in the analysis.37-38

While some control over heterogeneity of design may be accomplished through the use of exclusion rules, we recommend using broad inclusion criteria for studies, and then performing analyses relating design features to outcome.8 In cases when heterogeneity of outcomes is particularly problematic, a single summary measure may well be inappropriate.39 Analyses that stratify by study feature or regression analysis with design features as predictors can be useful in assessing whether study outcomes indeed vary systematically with these features.40

Investigating heterogeneity was a key feature of a meta-analysis of observational studies of asbestos exposure and risk of gastrointestinal cancer.41 The authors of the meta-analysis hypothesized that studies allowing for a latent period between the initiation of exposure and any increases in risk should show, on average, appropriately higher standardized mortality ratios than studies that ignored latency. In other words, the apparent effect of exposure would be attenuated by including the latent period in the calculation of time at risk (the "denominator"), since exposure-related deaths (the "numerator") would, by definition, not occur during that latent period (Figure 1).

Figure. Effect of Latent Period on Heterogeneity

In fact, the data suggested that studies allowing for latent periods found on average somewhat higher standardized mortality ratios than studies ignoring latency. This example shows that sources of bias and heterogeneity can be hypothesized prior to analysis and subsequently confirmed by the analysis.

Results

Recommendations for reporting of results include graphical summaries of study estimates and any combined estimate, a table listing descriptive information for each study, results of sensitivity testing and any subgroup analysis, and an indication of statistical uncertainty of findings.

Discussion

The discussion should include issues related to bias, including publication bias, confounding, and quality. Bias can occur in the original studies (resulting from flaws in the study design that tend to distort the magnitude or direction of associations in the data) or from the way in which studies are selected for inclusion.42 Publication bias, the selective publication of studies based on the magnitude (usually larger) and direction of their findings, represents a particular threat to the validity of meta-analysis of observational studies.43-45 Thorough specifications of quality assessment can contribute to understanding some of the variations in the observational studies themselves. Methods should be used to aid in the detection of publication bias, eg, fail-safe procedures or funnel plots.46

Schlesselman47 comments on such biases in assessing the possible association between endometrial cancer and oral contraceptives. This meta-analysis combined both cohort and case-control studies and used a sensitivity analysis to illustrate the influence of specific studies, such as those published in English.

Conclusion

Due to these biases in observational studies, the conclusion of the report should contain consideration of alternative explanations for observed results and appropriate generalizations of the conclusion. A carefully conducted meta-analysis can reveal areas warranting further research. Finally, since funding source has been shown to be an important source of heterogeneity,48 the sponsoring organization should be disclosed and any effect on analysis should be examined.

COMMENT

Taking stock of what is known in any field involves reviewing the existing literature, summarizing it in appropriate ways, and exploring the implications of heterogeneity of population and study for heterogeneity of study results. Meta-analysis provides a systematic way of performing this research synthesis, while indicating when more research is necessary.

The application of formal meta-analytic methods to observational studies has been controversial.42 One reason for this has been that potential biases in the original studies, relative to the biases in RCTs, make the calculation of a single summary estimate of effect of exposure potentially misleading. Similarly, the extreme diversity of study designs and populations in epidemiology makes the interpretation of simple summaries problematic, at best. In addition, methodologic issues related specifically to meta-analysis, such as publication bias, could have particular impact when combining results of observational studies.44, 47

Despite these challenges, meta-analyses of observational studies continue to be one of the few methods for assessing efficacy and effectiveness and are being published in increasing numbers. Our goal is to improve the reporting of these meta-analyses so that readers can understand what was done in a given analysis, who did it, and why it was done. If bias is a problem, we suggest that an informative approach is to use broad inclusion criteria for studies and then to perform analyses (when the data permit) relating suspected sources of bias and variability to study findings.

Methodologic and interpretational concerns make the clear and thorough reporting of meta-analyses of observational studies absolutely essential. Our workshop was convened to address the problem of increasing diversity and variability that exist in reporting meta-analyses of observational studies. In constructing the checklist, we have attempted, where possible, to provide references to literature justifying the inclusion of particular items.

Assessment of the usefulness of recommendations for reporting is dependent on a well-designed and effectively conducted evaluation. The workshop participants proposed a 3-pronged approach to determine usefulness and implementation of these recommendations.

First, further comments should be incorporated into revisions of the checklist, to ensure its usefulness to journal reviewers and editors. The US Food and Drug Administration (FDA) receives and reviews petitions and applications for approval of regulated products and/or their labeling. The FDA's Center for Food Safety and Applied Nutrition is now receiving applications that use results of meta-analyses in support of the requested action. The revised checklist should be tested during the review of an application. One might randomly assign FDA reviewers who encounter systematic reviews of observational studies to use the checklist or not. Since the requirements for reporting for regulatory purposes might not completely coincide with those in the checklist and since sample size (the number of formal systematic reviews received by the FDA) might be small, this evaluation should document any potential incompatibility between requirements for regulatory reporting and the checklist.

Second, we will work with the Cochrane Collaboration to promote the use of these recommendations by Cochrane collaborative review groups.49 Members of the Cochrane Collaboration are involved routinely in performing systematic reviews. Some are now incorporating nonrandomized studies out of necessity. A trial of use of the checklist could be compared with the FDA experience.

Third, an evaluation of the checklist by authors, reviewers, readers, and editors could compare objective measures of the quality of articles written with and without the formal use of the guidelines. A challenge to the use of quality measures would be arriving at a valid measure of quality. A more important end point for trials in journals is process measures. Questions of interest include whether the use of the checklist makes preparation and evaluation of manuscripts easier or is otherwise helpful. Again, defining the constructs of interest present crucial challenges to this research.

Less formal evaluations, based on comments from users in any of the above groups, would certainly be helpful, as well. One would need to be concerned about contamination of the control groups when evaluating the checklist, as journals, for example, might adopt the checklist even in the absence of evidence of its efficacy from randomized trials.

In conclusion, the conference participants noted that meta-analyses are themselves observational studies, even when applied to RCTs.50 If a role for meta-analyses of observational studies in setting policy is to be achieved,51 standards of reporting must be maintained to allow proper evaluation of the quality and completeness of meta-analyses.

Acknowledgments

The Meta-analysis Of Observational Studies in Epidemiology (MOOSE) Group: Centers for Disease Control and Prevention, Atlanta, Ga: Gary Jeng, PhD, Rob Lyerla, PhD, Thomas Peterman, MD, Donna F. Stroup, PhD, MSc, Stephen B. Thacker, MD, MSc, and G. David Williamson, PhD; University of Pennsylvania School of Medicine, Philadelphia: Jesse A. Berlin, ScD; RAND Corporation, Santa Monica, Calif: Sally C. Morton, PhD; Stanford University, Stanford, Calif: Ingram Olkin, PhD; University of California, San Francisco, and JAMA, Chicago, Ill: Drummond Rennie, MD; Thomas C. Chalmers Centre for Systematic Reviews, Children's Hospital of Eastern Ontario Research Institute, Ottawa: David Moher, MSc; Michigan State University, East Lansing: Betsy J. Becker, PhD; Georgia State University, Atlanta: Theresa Ann Sipe, PhD; Centre for Statistics in Medicine, Oxford, England: Douglas Altman, PhD; Memorial Sloan-Kettering Cancer Center, New York, NY: Colin Begg, PhD; Hamilton Wentworth Regional Public Health Department, Hamilton, Ontario: Larry Chambers, PhD; Harvard Medical School and Brigham and Women's Hospital, Boston, Mass: Graham Colditz, PhD; University of Maryland, Baltimore: Kay Dickersin, PhD; AT&T Labs, Murray Hill, NJ: William DuMouchel, PhD; University of Colorado, Denver: Karen Kafadar, PhD; Cleveland, Ohio: Tom Lang, MA; Food and Drug Administration, Washington, DC: Lynn Larsen, PhD; University of Minnesota, Minneapolis: Thomas A. Louis, PhD; University of British Columbia, Vancouver: Parminder Raina, PhD; University of Pittsburgh, Pittsburgh, Pa: Allan Sampson, PhD; Family Health International, Research Triangle Park, NC: Ken Schulz, PhD, MSc; and Institute of Medicine, Washington, DC: Mike Stoto, PhD.

Acknowledgment: We thank Susan Eastwood, MA, University of California, San Francisco; Christine Friedenreich, PhD, Alberta Cancer Board and University of Calgary, Calgary; Sander Greenland, DrPH, Cstat, MA, MS, University of California, Los Angeles; Richard Horton, MBBCh, MD, Lancet, London, England; Diana Petitti, MD, MPH, Southern California Kaiser Permanente Medical Care Program, Pasadena; Duncan Saunders, MBBCh, PhD, University of Alberta, Edmonton; and Stephen D. Walter, PhD, McMaster University, Hamilton, Ontario, for their advice and substantive comments on an earlier draft of this article. In addition, we thank Barbara McDonnell, CDC, for managing complex logistics of this project.

Corresponding Author and Reprints: Donna F. Stroup, PhD, MSc, Centers for Disease Control and Prevention, 1600 Clifton Rd NE, Mail Stop C08, Atlanta, GA 30333 (e-mail: dfs2{at}cdc.gov).

References

  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
  6. 6.
  7. 7.
  8. 8.
  9. 9.
  10. 10.
  11. 11.
  12. 12.
  13. 13.
  14. 14.
  15. 15.
  16. 16.
  17. 17.
  18. 18.
  19. 19.
  20. 20.
  21. 21.
  22. 22.
  23. 23.
  24. 24.
  25. 25.
  26. 26.
  27. 27.
  28. 28.
  29. 29.
  30. 30.
  31. 31.
  32. 32.
  33. 33.
  34. 34.
  35. 35.
  36. 36.
  37. 37.
  38. 38.
  39. 39.
  40. 40.
  41. 41.
  42. 42.
  43. 43.
  44. 44.
  45. 45.
  46. 46.
  47. 47.
  48. 48.
  49. 49.
  50. 50.
  51. 51.

Related article

« Previous | Next Article »Table of Contents

More in JAMA & Archives Journals