You are seeing this message because your Web browser does not support basic Web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.


ABOUT JAMA
Advanced Search

Welcome   | My Account | E-mail Alerts | Access Rights | Sign In


  Vol. 285 No. 11, March 21, 2001 TABLE OF CONTENTS
  JAMA
  •  Online Features
  Toward Optimal Laboratory Use
 This Article
 •Abstract
 •PDF
 •Send to a friend
 • Save in My Folder
 •Save to citation manager
 •Permissions
 Citing Articles
 •Citation map
 •Citing articles on HighWire
 •Citing articles on Web of Science (291)
 •Contact me when this article is cited
 Related Content
 •Related letters
 •Related article
 •Similar articles in JAMA
 Topic Collections
 •Oncology
 •Cervical Cancer
 •Pathology & Laboratory Medicine
 •Statistics and Research Methods
 •Alert me on articles by topic
 Social Bookmarking
  Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit Add to Technorati Add to Twitter What's this?

Interobserver Reproducibility of Cervical Cytologic and Histologic Interpretations

Realistic Estimates From the ASCUS-LSIL Triage Study

Mark H. Stoler, MD; Mark Schiffman, MD,MPH; for the Atypical Squamous Cells of Undetermined Significance–Low-grade Squamous Intraepithelial Lesion Triage Study (ALTS) Group

JAMA. 2001;285:1500-1505.

ABSTRACT

Context  Despite a critical presumption of reliability, standards of interpathologist agreement have not been well defined for interpretation of cervical pathology specimens.

Objective  To determine the reproducibility of cytologic, colposcopic histologic, and loop electrosurgical excision procedure (LEEP) histologic cervical specimen interpretations among multiple well-trained observers.

Design and Setting  The Atypical Squamous Cells of Undetermined Significance–Low-grade Squamous Intraepithelial Lesion (ASCUS-LSIL) Triage Study (ALTS), an ongoing US multicenter clinical trial.

Subjects  From women enrolled in ALTS during 1996-1998, 4948 monolayer cytologic slides, 2237 colposcopic biopsies, and 535 LEEP specimens were interpreted by 7 clinical center and 4 Pathology Quality Control Group (QC) pathologists.

Main Outcome Measures  {kappa} Values calculated for comparison of the original clinical center interpretation and the first QC reviewer's masked interpretation of specimens.

Results  For all 3 specimen types, the clinical center pathologists rendered significantly more severe interpretations than did reviewing QC pathologists. The reproducibility of monolayer cytologic interpretations was moderate ({kappa} = 0.46; 95% confidence interval [CI], 0.44-0.48) and equivalent to the reproducibility of punch biopsy histopathologic interpretations ({kappa} = 0.46; 95% CI, 0.43-0.49) and LEEP histopathologic interpretations ({kappa} = 0.49; 95% CI, 0.44-0.55). The lack of reproducibility of histopathology was most evident for less severe interpretations.

Conclusions  Interpretive variability is substantial for all types of cervical specimens. Histopathology of cervical biopsies is not more reproducible than monolayer cytology, and even the interpretation of LEEP results is variable. Given the degree of irreproducibility that exists among well-trained pathologists, realistic performance expectations should guide use of their interpretations.



INTRODUCTION
 Jump to Section
 •Top
 •Introduction
 •Methods
 •Results
 •Comment
 •Author information
 •References

The interpretive reproducibility of cervical cytology and histopathology is critical to cervical cancer prevention programs. There is a societal presumption of high reproducibility of cytologic screening. In the medical community, histopathologic interpretations are generally considered the reference standard upon which treatment of cervical disease is based. No test or interpretation is perfect, and both society and the medical profession may have excessively high expectations. In fact, realistic standards of interpathologist agreement for cytology and histology have not been well defined by rigorous studies of large series of specimens.

Cytology screening interpretations define which women require focused clinical attention. Organized screening programs based on periodic conventional Papanicolaou (Pap) smears are successful in greatly reducing cervical cancer deaths.1 In recent years, however, cervical cytologic screening has come under attack because of a growing awareness of the test's imperfections, including irreproducibility and false negativity.2-12 Problems with specimen collection and preparation may be partly ameliorated with monolayer preparations.1, 13 Ultimately, cervical cytologic screening is entirely predicated on the combined judgment of the cytotechnologist and pathologist. Although clinicians vary in their management of women with abnormal cytology, different diagnostic interpretations of any given cytologic specimen may lead to radically different patient management.

Clinicians often evaluate abnormal cytologic findings using colposcopy, with guided biopsies of visually abnormal areas. Colposcopy itself is not well standardized14-15 and the reproducibility of biopsy interpretation might actually be as variable and problematic as cytologic interpretation.6, 16-27 Previous studies on the reproducibility of cervical preneoplasia interpretation have been limited in size and for the most part statistically inadequate.

Many clinicians now treat significant intraepithelial neoplasia, either proven or suspected, using the loop electrosurgical excision procedure (LEEP) to remove the cervical transformation zone. This procedure produces a large histology specimen that is processed similarly to a cone biopsy, oriented as a clock face in 12 sections. The resultant pathology report further defines the grade of neoplasia and guides the patient's subsequent management. Despite the widespread use of LEEP for the treatment of substantial cervical neoplasia, the reproducibility of LEEP histopathology has not been rigorously evaluated.

We evaluated the reproducibility of cytology, biopsy histopathology, and LEEP histopathology among multiple, well-trained observers in the context of an ongoing multicenter clinical trial.


METHODS
 Jump to Section
 •Top
 •Introduction
 •Methods
 •Results
 •Comment
 •Author information
 •References

Population

The Atypical Squamous Cells of Undetermined Significance–Low-grade Squamous Intraepithelial Lesion (ASCUS-LSIL) Triage Study (ALTS) is a multicenter randomized clinical trial with 3 study arms designed to evaluate the management of mildly abnormal cytology findings by 3 alternative methods: immediate colposcopy, conservative cytologic follow-up, or triage by human papillomavirus (HPV) DNA testing.28 At enrollment into ALTS during 1996-1998, women referred for ASCUS or LSIL conventional Pap smears had a repeat cytologic interpretation on monolayer cytology (ThinPrep, Cytyc, Boxborough, Mass). Women triaged to colposcopy as required by the study protocol underwent biopsy if lesions were visible upon application of acetic acid. Histologically confirmed cervical intraepithelial neoplasia (CIN) grades 2 to 3 was treated by LEEP. The few cases of prevalent, invasive carcinoma were treated more extensively as appropriate. A full description of the study is available elsewhere.28

During enrollment, the ALTS clinical centers interpreted 4948 monolayer cytology slides, 2237 biopsies (taking only the most severe result for each woman, as described below), and 535 LEEP specimens that were independently reviewed by the Pathology Quality Control Group (QC). There were 1 to 2 staff pathologists per clinical center (7 in all), who worked independently. No conferences were held regarding cases. The initial QC review was randomly assigned to 1 of the 4 QC pathologists. The QC review was masked to the clinical center interpretation and all other test results. The present analysis is based on the comparison of clinical center to first QC interpretations. When the first QC reviewer disagreed with the original clinical center interpretation, additional reviews were performed.28 While the QC algorithms and panel interpretations were used for ALTS to define disease end points, the patients were managed by the original clinical center interpretations unless CIN3 or cancer was suspected, in which case the final QC opinion was unmasked.

For cytologic specimens, cytotechnologists' screening marks were not removed during rescreening at the QC center at Johns Hopkins University. Quality control histology reviews were performed on all the original slides interpreted at the clinical centers. No recuts or substitute slides were used. Interpretations were coded using the Bethesda System squamous intraepithelial lesion (SIL) categories. Histologic and LEEP interpretations were categorized by severity of overall interpretation for a case rather than by individual block, analogous to actual clinical management. Thus, no woman contributed more than 1 interpretation to the data tables for a given specimen type. Analyses were repeated to look for trends in subgroups. These included dividing the data by each individual QC pathologist, dividing the data by each of 4 clinical centers, and analyzing the data over time to see if interpretative reproducibility varied over the period of enrollment. Finally, the results of HPV testing were briefly considered in association with the cytology and histology interpretations.28

Statistical Analysis

Reactive, reparative, and inflammatory changes were grouped as negative for this analysis. The very few invasive cancer interpretations were included in the high-grade intraepithelial group. For histology, the results were combined into cytology-like (Bethesda System SIL) groupings although the CIN terminology was retained (eg, CIN2 or 3 is analogous to high-grade squamous intraepithelial lesion [HSIL]), to permit comparisons across equal-sized data tables with cytology.

{kappa} Values were calculated to test for reproducibility while taking chance agreement into account.29-30 We composed 4 x 4 diagnostic tables, as well as more condensed 2 x 2 diagnostic tables, using each possible binary cutpoint (ie, negative vs >=ASCUS, <=ASCUS vs >=LSIL, and <=LSIL vs >=HSIL). Both unweighted and weighted {kappa} values were considered. Weighted values, with weights inversely proportional to the number of categories of distance between 2 ratings, are sensitive to severe disagreements as opposed to 1-category disagreements. Specifically, the weights were 1.00 for data cells on the diagonal (ie, exact agreement), 0.67 for cells adjacent to the diagonal, 0.33 for cells 2 units from the diagonal, and 0 for cells 3 units from the diagonal. Weighted {kappa} values are higher than unweighted values when disagreements are common but tend to be close to the diagonal. As a rough guide, a {kappa} value of less than 0 indicates poor agreement, 0 to 0.2 represents slight agreement, 0.2 to 0.4 is fair agreement, 0.4 to 0.6 indicates moderate agreement, 0.6 to 0.8 shows substantial agreement, and 0.8 to 1.0 is almost perfect agreement.

However, {kappa} values were compared cautiously, for 2 reasons. First, the interpretation of the {kappa} statistic is affected by large differences in disease prevalence.31 When disease prevalence is very high or very low (rather than intermediate), the {kappa} values are decreased relative to the percentage of agreement, which does not take chance agreement into account. The presentation notes when this might affect interpretation. In general, as measured by the {kappa} statistic, the rates of agreement observed in this ALTS referral population would tend to be lower (relative to percentage of agreement) in a screening population in which disease is rare.31 Second, {kappa} statistics vary by the number of diagnostic categories. Hence, {kappa} statistics were computed for 4 x 4 tables and 2 x 2 tables; they cannot be directly compared. For the 4 x 4 tables, the symmetry {chi}2 statistic was used to compare the severity of clinical center vs QC interpretations. Analogously, for the 2 x 2 tables, the McNemar statistic was used.


RESULTS
 Jump to Section
 •Top
 •Introduction
 •Methods
 •Results
 •Comment
 •Author information
 •References

The primary comparison data for each specimen type are listed in Table 1 for monolayer cytology, cervical biopsies, and LEEP specimens, respectively. For each of the 3 data sets, the shaded diagonal represents the proportion of concordant specimens. The boxed data cells indicate the most discordant comparisons. There was only moderate interobserver reproducibility, regardless of specimen type. The {kappa} statistics are compared in Table 2. The modest increase in {kappa} values based on weighting suggests that most disagreements were relatively close. There was significant asymmetry in each class of comparison. This suggested a systematic pattern of disagreement between the clinical center and the QC pathologists, with the QC pathologists tending to give less severe interpretations for all 3 types of specimens.


View this table:
[in this window]
[in a new window]
Table 1. Interpretations: Original vs First Quality Control Group Reviewer*



View this table:
[in this window]
[in a new window]
Table 2. Summary Data for Original vs First Quality Control Group Diagnosis*


Not surprisingly, the greatest source of disagreement in monolayer cytology results involved ASCUS interpretations (Table 1). Of 1473 original interpretations of ASCUS, the QC reviewer concurred in only 43.0%, rendering less severe readings for most of the rest. Another significant source of variation included HSILs in which concordance was only 47.1%, with 27.0% and 22.6% of the remainder interpreted as LSIL or ASCUS by the QC reviewers respectively.

Histologic interpretative reproducibility on biopsies was no better overall than cytologic reproducibility. However, histologic variability derived largely from disagreements about grade CIN1 (including koilocytotic atypia). An interpretation of CIN1 by the clinical center was corroborated by the QC group in only 42.6% of 887 biopsies. Virtually an equal proportion of originally diagnosed CIN1 biopsies (41.0%) were intepreted as negative by the pathology QC group.

An equivocal histologic interpretation (ie, a histologic equivalent of ASCUS) was rarely used, although ALTS is a study in which originally equivocal cytologic interpretations predominate. Most of these problematic cases were due to sample limitations (eg, quality of staining, crush or thermal artifact) that caused difficulty in making subtle distinctions between SIL and normal/reactive. Clinical center pathologists rendered an equivocal histologic interpretation in only 8.2% of 2237 biopsies; similarly, the QC pathologists used an equivocal categorization for biopsy interpretation in only 3.5%. The extremes of interpretation, ie, biopsies categorized as negative or high grade (>=CIN2), demonstrated good concordance in 90.8% and 76.9% of cases diagnosed by the clinical centers, respectively.

Histologic reproducibility based on LEEP was not better than for other specimen types. Of note, LEEP specimens were much more likely than either cytology or biopsies to represent grades of CIN2 or higher. Despite smaller and skewed numbers of specimens, it was observed that the interpretation of CIN1 was still poorly reproduced, with only 43.8% of original interpretations being corroborated by the QC group.

More condensed 2 x 2 diagnostic tables were composed using each possible binary cutpoint (ie, negative vs >=ASCUS, <=ASCUS vs >=LSIL, and <=LSIL vs >=HSIL). The {kappa} statistics for these are shown in Table 3. Cytologic interpretations equaling or exceeding HSIL were uncommon enough to merit caution over comparisons of this {kappa} statistic to others in the table. Cytologic interpretations of LSIL were more reproducible than histologic interpretations of comparable severity. High-grade colpobiopsy and LEEP biopsy interpretations showed substantial agreement.


View this table:
[in this window]
[in a new window]
Table 3. Summary of Data for Original vs First Quality Control Group Diagnoses, When Divided Into "Disease" vs "Non-Disease" at Different Binary Cutpoints*


Subgroup analyses by individual QC pathologist or by individual clinical center did not significantly alter any of the results or reveal any significant trends over time.


COMMENT
 Jump to Section
 •Top
 •Introduction
 •Methods
 •Results
 •Comment
 •Author information
 •References

Compared to prior studies, the data available from the ALTS trial are significant for the size of the data set and for the ability to compare common cytologic and histopathologic findings directly. This analysis shows that the interobserver reproducibility of cytologic and histologic interpretations is similar and only moderate. Finer distinctions may be difficult when broader ones are only moderately successful. Nonetheless, an additional study is being pursued to evaluate whether CIN2 can be reproducibly separated from CIN3 within the high-grade group, particularly for histology, where the Bethesda SIL terminology is less accepted. This issue is important for case definition in studies such as the ALTS trial. Obviously, the distinction also has management implications for the minority of clinicians who believe that CIN2 is a reproducible category, not a true cancer precursor, and can be managed differently than CIN3.

For monolayer cytology, the greatest source of variability between the clinical centers and the QC pathologists was the interpretation of ASCUS, which the QC group interpreted as negative in 38.6% of cases. These cytologic smears had a 37% HPV positivity rate much like the HPV positivity rate of 31% among the concordant negative cases (data not shown). Thus, if HPV testing is used as an independent adjudicator of this process, QC-revised interpretations seem likely to be accurate. The other major source of interpretative variability for cytology was the fraction of HSIL clinical center interpretations that were reviewed as either LSIL or ASCUS by QC. These represent different problematic sets of cases. Transitions of HSIL to LSIL undoubtedly reflect the difficulty referenced in the literature of trying to separate mild from moderate dysplasia. On the other hand, transitions from HSIL to ASCUS represent the current controversy surrounding small atypical cells of immature metaplastic type and whether these hard-to-interpret cells represent an entity distinct from HSIL.10, 32

On biopsy, the {kappa} values were remarkably similar in magnitude to the cytology data. However, the overwhelming source of interpretative variability was the marked tendency of the QC group to review clinical center CIN1 biopsy interpretations as negative. This reflects problems in implementation of criteria for recognizing HPV cytopathic effect in tissue. Although CIN1 is a frequently overcalled interpretation in cervical pathology practice, most of the CIN1 cases reviewed by QC as negative were HPV DNA-positive on the correlated monolayer cytology, suggesting that these disagreements may have been excessive (data not shown). The data for LEEP were similar in this trend. Further direct HPV testing of these samples after microdissection may help clarify the accuracy of the revised interpretations, compared to correlations using HPV tests derived from the temporally related monolayer cytology specimens.

Neither the clinical center nor the QC group was free from important error. The final QC interpretations considered both clinical center and first QC interpretations, as well as additional QC reviews in case of discrepancy. Notably, there was no effect of using the final interpretation on the clinical center agreement rates for LEEP, there was an intermediate improvement on biopsy, and there was the most improvement on cytology (data not shown). However, significant variability was still present and the reasons for these trends are not known.

Two points mitigate the appearance of mediocre reproducibility. First, it is possible that the ALTS population provided slightly heightened diagnostic challenges, compared with a typical pathology case load. All of the women were referred for a mild cytologic abnormality. Women with easily reproducible, completely negative cytology results or with obviously high-grade results were underrepresented. Secondly, the reproducibility of high-grade colpobiopsies and LEEP biopsies was substantial, which has important treatment implications. Histologic confirmation of low-grade lesions is more suspect, suggesting that the management of many women is subject to chance. Interestingly, the cytologic diagnosis of LSIL appeared to be more reproducible than the histologic diagnosis. We speculate that these differences are based on the reliability of the criteria applied to individual cells in excellent cytologic preparations compared with the rigor with which these same criteria are applied in histologic sections.

Caveats aside, the data reported in this study probably underestimate the level of variability between groups of pathologists nationally. Most of the pathologists in the trial are academic gynecologic pathologists with a research interest in cervical cancer precursor interpretation and management. In contrast, in many clinical practices, Pap smears are often read in large commercial laboratories whereas biopsies are read locally by community hospital pathologists. Cytohistologic correlation opportunities are decreasing with this unfortunate economic reorganization of cytopathology practice. This problem can potentially be addressed in the future by using the ALTS data set to clarify criteria, revise classification systems, and implement educational tools to help improve interpretative reproducibility within the pathology community. In future works we will focus on whether ASCUS is a useful interpretative entity, what constitutes a CIN1 histologic pattern, and clarification of the variations of HSIL including atypical immature squamous metaplastic cells.

Finally, the need for reproducible interpretations is self-evident. Beyond clinical needs, today's medicolegal environment requires adequate documentation of what is and is not diagnostically possible. Unrealistic expectations of accuracy, reproducibility, and truth determination fuel many of these malpractice actions. It is possible to achieve moderate to substantial reproducibility in a highly refined environment. However, substantial does not equal perfect and this should provide some basis for understanding and defense in cases based on differences of expert opinion. Indeed, if experts were required to present data on their personal levels of intraobserver and interobserver reproducibility (ideally developed in a standardized objective manner with independent HPV adjudication), then the testimony of many so-called experts would be easier to evaluate. In this regard, the results of ALTS could stand as a benchmark for the current state of realistic interpretive reproducibility.


AUTHOR INFORMATION
 Jump to Section
 •Top
 •Introduction
 •Methods
 •Results
 •Comment
 •Author information
 •References

Author Contributions: Study concept and design, acquisition of data, analysis and interpretation of data, drafting of the manuscript, critical revision of the manuscript for important intellectual content, study supervision: Stoler, Schiffman.

Statistical expertise, obtained funding, administrative, technical, or material support: Schiffman.

Affiliations of the ALTS Group: National Cancer Institute, Bethesda, Md: D. Solomon, M. Schiffman, R. Tarone; Clinical Centers: University of Alabama, Birmingham: E. E. Partridge, L. Kilgore, S. Hester; University of Oklahoma, Oklahoma City: J. L. Walker, G. A. Johnson, A.Yadack; Magee-Womens Hospital of the University of Pittsburgh Medical Center Health System, Pittsburgh, Pa: R. S. Guido, K. McIntyre-Seltman, R. P. Edwards, J. Gruss; University of Washington, Seattle: N. B. Kiviat, L. Koutsky, C. Mao, J. M. Haug; Colposcopy Quality Control Group: D. Ferris, Medical College of Georgia, Augusta; J. T. Cox, University of California at Santa Barbara; L. Burke, Beth Israel Deaconess Medical Center Hospital, Boston, Mass; HPV Quality Control Group: C. M. Wheeler and C. Peyton-Goodall, University of New Mexico Health Sciences Center, Albuquerque; M. M. Manos, Kaiser Permanente, Oakland, Calif; Pathology Quality Control Group: R. J. Kurman, D. L. Rosenthal, and M. E. Sherman, Johns Hopkins Hospital, Baltimore, Md; M. H. Stoler, University of Virginia Health Science Center, Charlottesville; Cost Utility Analysis Group: D. M. Harper, Dartmouth Hitchcock Medical Center, Lebanon, NH; Westat, Coordinating Unit, Rockville, Md: J. Rosenthal, M. Dunn, J. Quarantillo, D. Robinson; Information Management Services, Silver Spring, Md: L. Saxon.

Funding/Support: This work was supported by Public Health Service grants CN55153, CN55154, CN55155, CN55156, CN55157, CN15518, CN55159, and CN55105 from the National Cancer Institute. The following companies have provided support in the form of equipment or supplies at no cost or reduced cost: Cytyc Corporation, Boxborough, Mass; DenVu, Tucson, Ariz; Digene, Gaithersburg, Md; National Testing Laboratories, Fenton, Mo; and TriPath Imaging Inc, Elon, NC.

Corresponding Author and Reprints: Mark H. Stoler, MD, University of Virginia Health System, Division of Surgical Pathology and Cytopathology, Box 800214, Charlottesville, VA 22908 (e-mail: mhs2e{at}virginia.edu).

Author Affiliations: University of Virginia Health System, Charlottesville (Dr Stoler); and National Cancer Institute, Bethesda, Md (Dr Schiffman).


REFERENCES
 Jump to Section
 •Top
 •Introduction
 •Methods
 •Results
 •Comment
 •Author information
 •References

1. Stoler MH. Advances in cervical screening technology. Mod Pathol. 2000;13:275-284. ISI
2. Confortini M, Biggeri A, Cariaggi MP, et al. Intralaboratory reproducibility in cervical cytology: results of the application of a 100-slide set. Acta Cytol. 1993;37:49-54. ISI | PUBMED
3. Evans DMD, Shelley G, Cleary B, Baldwin Y. Observer variation and quality control of cytodiagnosis. J Clin Pathol. 1974;27:945-950. FREE FULL TEXT
4. Horn PL, Lowell DM, LiVolsi VA, Boyle CA. Reproducibility of the cytologic diagnosis of human papillomavirus infection. Acta Cytol. 1985;29:692-694. PUBMED
5. Klinkhamer PJ, Vooijs GP, de Haan AF. Intraobserver and interobserver variability in the diagnosis of epithelial abnormalities in cervical smears. Acta Cytol. 1988;32:794-800. ISI | PUBMED
6. Lambourne A, Lederer H. Effects of observer variation in population screening for cervical carcinoma. J Clin Pathol. 1973;26:564-569. FREE FULL TEXT
7. Raab SS, Snider TE, Potts SA, et al. Atypical glandular cells of undetermined significance: diagnostic accuracy and interobserver variability using select cytologic criteria. Am J Clin Pathol. 1997;107:299-307. PUBMED
8. Raab SS, Geisinger KR, Silverman JF, Thomas PA, Stanley MW. Interobserver variability of a Papanicolaou smear diagnosis of atypical glandular cells of undetermined significance. Am J Clin Pathol. 1998;110:653-659. PUBMED
9. Sherman ME, Schiffman MH, Lorincz AT, et al. Toward objective quality assurance in cervical cytopathology: correlation of cytopathologic diagnoses with detection of high-risk human papillomavirus types. Am J Clin Pathol. 1994;102:182-187. ISI | PUBMED
10. Stoler MH. Does every little cell count? don't "ASCUS." Cancer. 1999;87:45-47. PUBMED
11. Young NA, Naryshkin S, Atkinson BF, et al. Interobserver variability of cervical smears with squamous-cell abnormalities: a Philadelphia study. Diagn Cytopathol. 1994;11:352-357. ISI | PUBMED
12. Yobs AR, Plott AE, Hicklin MD, et al. Retrospective evaluation of gynecologic cytodiagnosis, II: interlaboratory reproducibility as shown in rescreening large consecutive samples of reported cases. Acta Cytol. 1987;31:900-910. ISI | PUBMED
13. Crum CP, Genest DR, Krane JF, et al. Subclassifying atypical squamous cells in Thin-Prep cervical cytology correlates with detection of high-risk human papillomavirus DNA. Am J Clin Pathol. 1999;112:384-390. ISI | PUBMED
14. Ferris DG, Cox JT, Burke L, et al. Colposcopy quality control: establishing colposcopy criterion standards for the ALTS trial using cervigrams. J Lower Genital Tract Dis. 1998;2:195-203.
15. Sellors JW, Nieminen P, Vesterinen E, Paavonen J. Observer variability in the scoring of colpophotographs. Obstet Gynecol. 1990;76:1006-1008. ISI | PUBMED
16. Cocker J, Fox H, Langley FA. Consistency in the histological diagnosis of epithelial abnormalities of the cervix uteri. J Clin Pathol. 1968;21:67-70. PUBMED
17. Creagh T, Bridger JE, Kupek E, Fish DE, Martin-Bates E, Wilkins MJ. Pathologist variation in reporting cervical borderline epithelial abnormalities and cervical intraepithelial neoplasia. J Clin Pathol. 1995;48:59-60. FREE FULL TEXT
18. de Vet HC, Knipschild PG, Schouten HJ, et al. Interobserver variation in histopathological grading of cervical dysplasia. J Clin Epidemiol. 1990;43:1395-1398. FULL TEXT | ISI | PUBMED
19. de Vet HC, Knipschild PG, Schouten HJ, et al. Sources of interobserver variation in histopathological grading of cervical dysplasia. J Clin Epidemiol. 1992;45:785-790. FULL TEXT | ISI | PUBMED
20. de Vet HC, Koudstaal J, Kwee WS, Willebrand D, Arends JW. Efforts to improve interobserver agreement in histopathological grading. J Clin Epidemiol. 1995;48:869-873. FULL TEXT | PUBMED
21. Genest DR, Stein L, Cibas E, Sheets E, Zitz JC, Crum CP. A binary (Bethesda) system for classifying cervical cancer precursors: criteria, reproducibility, and viral correlates. Hum Pathol. 1993;24:730-736. FULL TEXT | ISI | PUBMED
22. Ismail SM, Colclough AB, Dinnen JS, et al. Observer variation in histopathological diagnosis and grading of cervical intraepithelial neoplasia. BMJ. 1989;298:707-710. FREE FULL TEXT
23. McCluggage WG, Bharucha H, Caughley LM, et al. Interobserver variation in the reporting of cervical colposcopic biopsy specimens: comparison of grading systems. J Clin Pathol. 1996;49:833-835. FREE FULL TEXT
24. Ringstead J, Amtrup C, Baunsgaard P, et al. Reliability of histo-pathological diagnosis of squamous epithelial changes of the uterine cervix. Acta Pathol Microbiol Scand. 1978;86:273-278.
25. Robertson AJ, Anderson JM, Beck JS, et al. Observer variation in histopathological reporting of cervical biopsy specimens. J Clin Pathol. 1989;42:231-238. FREE FULL TEXT
26. Siegler EE. Microdiagnosis of carcinoma in situ of the uterine cervix: a comparative study of pathologists' diagnoses. Cancer. 1956;9:463-469. PUBMED
27. Seybolt JF, Johnson WD. Cervical cytodiagnostic problems: a survey. Am J Obstet Gynecol. 1971;109:1089-1103. PUBMED
28. Schiffman MH, Adrianza ME for the ALTS Group. The ASCUS-LSIL Triage Study (ALTS): design, methods, and characteristics of trial participants. Acta Cytol. 2000;44:726-742. ISI | PUBMED
29. Fleiss JL. Statistical Methods for Rates and Proportions. 2nd ed. New York, NY: John Wiley & Sons; 1981:218-235.
30. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33:159-174. FULL TEXT | ISI | PUBMED
31. Feinstein AR, Cicchetti DV. High agreement but low kappa, I: the problems of two paradoxes. J Clin Epidemiol. 1990;43:543-549. FULL TEXT | ISI | PUBMED
32. Sherman ME, Tabbara SO, Scott DR, et al. "ASCUS, rule out HSIL": cytologic features, histologic correlates, and human papillomavirus detection. Mod Pathol. 1999;12:335-342. ISI | PUBMED

Toward Optimal Laboratory Use Section Editor: David H. Mark, MD, MPH, Contributing Editor.



Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter     What's this?

RELATED LETTERS

Interobserver Agreement About Cervical Cytologic and Histologic Diagnosis
, , and
JAMA. ;285():2855-2856.
FULL TEXT  

Benefits and Costs of Using HPV Testing to Screen for Cervical Cancer
, , , , , , , , and
JAMA. ;287():2372-2381.
FULL TEXT  

RELATED ARTICLE

Implications of Low Diagnostic Reproducibility of Cervical Cytologic and Histologic Diagnoses

JAMA. ;285():1506-1508.
FULL TEXT  


THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES

Evaluation of an ELISA for p16INK4a as a Screening Test for Cervical Cancer
Balasubramanian et al.
Cancer Epidemiol. Biomarkers Prev. 2009;18:3008-3017.
ABSTRACT | FULL TEXT  

Screening and Prevention Methods for Cervical Cancer
Schiffman and Solomon
JAMA 2009;302:1809-1810.
FULL TEXT  

Human Papillomavirus Testing for Cervical Cancer Screening: Results From a 6-Year Prospective Study in Rural China
Shi et al.
Am J Epidemiol 2009;170:708-716.
ABSTRACT | FULL TEXT  

A Randomized Controlled Trial of Standard vs Endobronchial Ultrasonography-Guided Transbronchial Needle Aspiration in Patients With Suspected Sarcoidosis
Tremblay et al.
Chest 2009;136:340-346.
ABSTRACT | FULL TEXT  

The Reproducibility of CIN Diagnoses Among Different Pathologists: Data From Histology Reviews From a Multicenter Randomized Study
Palma et al.
Am J Clin Pathol 2009;132:125-132.
ABSTRACT | FULL TEXT  

Laboratory Medicine Quality Indicators: A Review of the Literature
Shahangian and Snyder
Am J Clin Pathol 2009;131:418-431.
ABSTRACT | FULL TEXT  

Human Papillomavirus Cofactors by Disease Progression and Human Papillomavirus Types in the Study to Understand Cervical Cancer Early Endpoints and Determinants
Wang et al.
Cancer Epidemiol. Biomarkers Prev. 2009;18:113-120.
ABSTRACT | FULL TEXT  

Expression Pattern and Subcellular Localization of Human Papillomavirus Minor Capsid Protein L2
Lin et al.
Am. J. Pathol. 2009;174:136-143.
ABSTRACT | FULL TEXT  

Long term predictive values of cytology and human papillomavirus testing in cervical cancer screening: joint European cohort study
Dillner et al.
BMJ 2008;337:a1754-a1754.
ABSTRACT | FULL TEXT  

Descriptive Evidence That Risk Profiles for Cervical Intraepithelial Neoplasia 1, 2, and 3 Are Unique
Belinson et al.
Cancer Epidemiol. Biomarkers Prev. 2008;17:2350-2355.
ABSTRACT | FULL TEXT  

Persistent Human Papillomavirus Infection and Cervical Neoplasia: A Systematic Review and Meta-Analysis
Koshiol et al.
Am J Epidemiol 2008;168:123-137.
ABSTRACT | FULL TEXT  

A Comparison of Linear Array and Hybrid Capture 2 for Detection of Carcinogenic Human Papillomavirus and Cervical Precancer in ASCUS-LSIL Triage Study
Gravitt et al.
Cancer Epidemiol. Biomarkers Prev. 2008;17:1248-1254.
ABSTRACT | FULL TEXT  

High-risk human papillomavirus is present in cytologically false-negative smears: an analysis of "normal" smears preceding CIN2/3
Bulk et al.
J. Clin. Pathol. 2008;61:385-389.
ABSTRACT | FULL TEXT  

Reproducibility and reliability of the ankle--brachial index as assessed by vascular experts, family physicians and nurses
Holland-Letz et al.
Vasc Med 2007;12:105-112.
ABSTRACT  

Our approach to squamous intraepithelial lesions of the uterine cervix
Kalof and Cooper
J. Clin. Pathol. 2007;60:449-455.
FULL TEXT  

Mouthwash as a Low-Cost and Safe Specimen Transport Medium for Human Papillomavirus DNA Testing of Cervicovaginal Specimens
Castle et al.
Cancer Epidemiol. Biomarkers Prev. 2007;16:840-843.
ABSTRACT | FULL TEXT  

American Cancer Society Guideline for Human Papillomavirus (HPV) Vaccine Use to Prevent Cervical Cancer and Its Precursors
Saslow et al.
CA Cancer J Clin 2007;57:7-28.
ABSTRACT | FULL TEXT  

Dynamic behavioural interpretation of cervical intraepithelial neoplasia with molecular biomarkers
Baak et al.
J. Clin. Pathol. 2006;59:1017-1028.
ABSTRACT | FULL TEXT  

Smokers at higher risk for undetected antibody for oncogenic human papillomavirus type 16 infection.
Wiley et al.
Cancer Epidemiol. Biomarkers Prev. 2006;15:915-920.
ABSTRACT | FULL TEXT  

Incident High-Grade Squamous Intraepithelial Lesions in Senegalese Women With and Without Human Immunodeficiency Virus Type 1 (HIV-1) and HIV-2
Hawes et al.
JNCI J Natl Cancer Inst 2006;98:100-109.
ABSTRACT | FULL TEXT  

A 21-Year-Old Woman With Atypical Squamous Cells of Undetermined Significance
Sawaya
JAMA 2005;294:2210-2218.
FULL TEXT  

The Elevated 10-Year Risk of Cervical Precancer and Cancer in Women With Human Papillomavirus (HPV) Type 16 or 18 and the Possible Utility of Type-Specific HPV Testing in Clinical Practice
Khan et al.
JNCI J Natl Cancer Inst 2005;97:1072-1079.
ABSTRACT | FULL TEXT  

Spontaneous Regression of High-Grade Cervical Dysplasia: Effects of Human Papillomavirus Type and HLA Phenotype
Trimble et al.
Clin. Cancer Res. 2005;11:4717-4723.
ABSTRACT | FULL TEXT  

Diagnostic accuracy of self collected vaginal specimens for human papillomavirus compared to clinician collected human papillomavirus specimens: a meta-analysis
Ogilvie et al.
Sex. Transm. Infect. 2005;81:207-212.
ABSTRACT | FULL TEXT  

Smoking Is a Risk Factor for Cervical Intraepithelial Neoplasia Grade 3 among Oncogenic Human Papillomavirus DNA-Positive Women with Equivocal or Mildly Abnormal Cytology
McIntyre-Seltman et al.
Cancer Epidemiol. Biomarkers Prev. 2005;14:1165-1170.
ABSTRACT | FULL TEXT  

Comparison of Human Papillomavirus Messenger RNA and DNA Detection: A Cross-sectional Study of 4,136 Women >30 Years of Age with a 2-Year Follow-up of High-Grade Squamous Intraepithelial Lesion
Molden et al.
Cancer Epidemiol. Biomarkers Prev. 2005;14:367-372.
ABSTRACT | FULL TEXT  

Patient-clinician agreement on signs and symptoms of 'strep throat': a MetroNet study
Xu et al.
Fam Pract 2004;21:599-604.
ABSTRACT | FULL TEXT  

Validation of p16INK4a as a Marker of Oncogenic Human Papillomavirus Infection in Cervical Biopsies from a Population-Based Cohort in Costa Rica
Wang et al.
Cancer Epidemiol. Biomarkers Prev. 2004;13:1355-1360.
ABSTRACT | FULL TEXT  

Highly Active Antiretroviral Therapy and Cervical Squamous Intraepithelial Lesions in Human Immunodeficiency Virus-Positive Women
Ahdieh-Grant et al.
JNCI J Natl Cancer Inst 2004;96:1070-1076.
ABSTRACT | FULL TEXT  

Predictive Significance of the Alterations of p16INK4A, p14ARF, p53, and Proliferating Cell Nuclear Antigen Expression in the Progression of Cervical Cancer
Wang et al.
Clin. Cancer Res. 2004;10:2407-2414.
ABSTRACT | FULL TEXT  

Have We Resolved How To Triage Equivocal Cervical Cytology?
Solomon and Schiffman
JNCI J Natl Cancer Inst 2004;96:250-251.
FULL TEXT  

Virologic Versus Cytologic Triage of Women With Equivocal Pap Smears: A Meta-analysis of the Accuracy To Detect High-Grade Intraepithelial Neoplasia
Arbyn et al.
JNCI J Natl Cancer Inst 2004;96:280-293.
ABSTRACT | FULL TEXT  

Risk of Cervical Cancer Associated with Extending the Interval between Cervical-Cancer Screenings
Sawaya et al.
NEJM 2003;349:1501-1509.
ABSTRACT | FULL TEXT  

Determinants of Human Papillomavirus Load among Women with Histological Cervical Intraepithelial Neoplasia 3: Dominant Impact of Surrounding Low-Grade Lesions
Sherman et al.
Cancer Epidemiol. Biomarkers Prev. 2003;12:1038-1044.
ABSTRACT | FULL TEXT  

Chapter 11: Future Directions in Cervical Pathology
Sherman
J Natl Cancer Inst Monogr 2003;2003:72-79.
ABSTRACT | FULL TEXT  

Chapter 13: Primary Screening of Cervical Cancer With Human Papillomavirus Tests
Franco
J Natl Cancer Inst Monogr 2003;2003:89-96.
ABSTRACT | FULL TEXT  

Chapter 14: Role of Triage Testing in Cervical Cancer Screening
Solomon
J Natl Cancer Inst Monogr 2003;2003:97-101.
ABSTRACT | FULL TEXT  

Chapter 18: Statistical Issues in the Design and Analysis of Studies of Human Papillomavirus and Cervical Neoplasia
Wacholder
J Natl Cancer Inst Monogr 2003;2003:125-130.
ABSTRACT | FULL TEXT  

Histopathologic Extent of Cervical Intraepithelial Neoplasia 3 Lesions in the Atypical Squamous Cells of Undetermined Significance Low-grade Squamous Intraepithelial Lesion Triage Study: Implications for Subject Safety and Lead-time Bias
Sherman et al.
Cancer Epidemiol. Biomarkers Prev. 2003;12:372-379.
ABSTRACT | FULL TEXT  

Baseline Cytology, Human Papillomavirus Testing, and Risk for Cervical Neoplasia: A 10-Year Cohort Analysis
Sherman et al.
JNCI J Natl Cancer Inst 2003;95:46-52.
ABSTRACT | FULL TEXT  

Evaluation of Human Papillomavirus Testing in Primary Screening for Cervical Abnormalities: Comparison of Sensitivity, Specificity, and Frequency of Referral
Kulasingam et al.
JAMA 2002;288:1749-1757.
ABSTRACT | FULL TEXT  

Costs and Benefits of Different Strategies to Screen for Cervical Cancer in Less-Developed Countries
Mandelblatt et al.
JNCI J Natl Cancer Inst 2002;94:1469-1483.
ABSTRACT | FULL TEXT  

Benefits and Costs of Using HPV Testing to Screen for Cervical Cancer
Mandelblatt et al.
JAMA 2002;287:2372-2381.
ABSTRACT | FULL TEXT  

2001 Consensus Guidelines for the Management of Women With Cervical Cytological Abnormalities
Wright et al.
JAMA 2002;287:2120-2129.
ABSTRACT | FULL TEXT  

New Bethesda Terminology and Evidence-Based Management Guidelines for Cervical Cytology Findings
Stoler
JAMA 2002;287:2140-2141.
FULL TEXT  

Results of Human Papillomavirus DNA Testing with the Hybrid Capture 2 Assay Are Reproducible
Castle et al.
J. Clin. Microbiol. 2002;40:1088-1090.
ABSTRACT | FULL TEXT  

Interobserver Agreement About Cervical Cytologic and Histologic Diagnosis
Mulkey et al.
JAMA 2001;285:2855-2856.
FULL TEXT  

Implications of Low Diagnostic Reproducibility of Cervical Cytologic and Histologic Diagnoses
Selvaggi
JAMA 2001;285:1506-1508.
FULL TEXT  

Women's Health--Filling the Gaps
DeAngelis and Winker
JAMA 2001;285:1508-1509.
FULL TEXT  





HOME | CURRENT ISSUE | PAST ISSUES | TOPIC COLLECTIONS | CME | SUBMIT | SUBSCRIBE | HELP
CONDITIONS OF USE | PRIVACY POLICY | CONTACT US | SITE MAP
 
© 2001 American Medical Association. All Rights Reserved.