Research Article: The Bethesda System For Reporting Thyroid Cytopathology: A Cytohistological Study
Research Article: The Bethesda System For Reporting Thyroid Cytopathology: A Cytohistological Study
Research Article
The Bethesda System for Reporting Thyroid Cytopathology: A
Cytohistological Study
          Bakiarathana Anand ,1 Anita Ramdas,1 Marie Moses Ambroise,1 and Nirmal P. Kumar2
          1
              Department of Pathology, Pondicherry Institute of Medical Sciences, Ganapathichettikulam, Kalapet, Puducherry 605014, India
          2
              Department of General Surgery, Pondicherry Institute of Medical Sciences, Ganapathichettikulam, Kalapet,
              Puducherry 605014, India
          Copyright © 2020 Bakiarathana Anand et al. This is an open access article distributed under the Creative Commons Attribution
          License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is
          properly cited.
          Introduction. The Bethesda System for Reporting Thyroid Cytopathology (TBSRTC) is a significant step to standardize the
          reporting of thyroid fine needle aspiration (FNA). It has high predictive value, reproducibility, and improved clinical significance.
          Aim. The study was aimed to evaluate the diagnostic utility and reproducibility of “TBSRTC” at our institute. Methods and
          Material. The study included 646 thyroid FNAs which were reviewed by three pathologists and classified according to TBSRTC.
          Cytohistological correlation was done for 100 cases with surgical follow-up and the sensitivity, specificity, positive predictive
          value, negative predictive value, diagnostic accuracy, and risk of malignancy (ROM) were calculated. The interobserver variation
          among three pathologists was also assessed. Results. The distribution of cases in various TBSRTC categories is as follows:
          I—nondiagnostic 13.8%, II—benign 75.9%, III—atypia of undetermined significance (AUS)/follicular lesion of undetermined
          significance (FLUS) 1.2%, IV—follicular neoplasm (FN)/suspicious for follicular neoplasm (SFN) 3.7%, V—suspicious for
          malignancy (SM) 2.6%, and VI—malignant 2.8%. The sensitivity, specificity, positive predictive value, negative predictive value,
          and diagnostic accuracy are 72.4%, 94.3%, 84%, 89.2%, and 87.9%, respectively. The ROM of various TBSRTC categories were
          II—8.5%; III—66.7%; IV—63.6%; and V and VI—100%. Cohen’s Weighted Kappa score was 0.99 which indicates almost perfect
          agreement among the three pathologists. Conclusions. Our study substantiates greater reproducibility among pathologists using
          TBSRTC to arrive at a precise diagnosis with an added advantage of predicting the risk of malignancy which enables the clinician
          to plan for follow-up or surgery and also the extent of surgery.
    In our study, the nondiagnostic yield was 13.8% which                  Mondal et al. reported a lower percentage (1%) of AUS/
was high when compared to TBSRTC consensus. Sampling                   FLUS cases which was a result of performing ultrasound
error and technical quality due to the above-mentioned                 guided FNA in small and heterogeneous nodules with
reasons and strict adherence to the adequacy criteria explain          suspicious features on palpation and radiological evaluation,
the high rate of ND smears.                                            so that the aspirate can be obtained from the exact site of
    Mondal et al. and Nandedkar et al. found high incidence            lesion which is a routine practice even at our institute [6].
of category II lesions since the patients directly visit a tertiary        The actual risk of malignancy of category III is difficult to
care center for primary diagnosis without any referral which           determine, since confirmatory diagnosis is only available in a
was also the case in our study [6, 9].                                 subset of patients selected for surgery who have suspicious
    The incidence of benign lesions in our study was 75.9%             clinical or USG features. The patients are also subjected to
when compared to studies done in USA ranging from 64% to               selection bias which overestimates the prevalence of ma-
66% which can be attributed to the regional variation in the           lignancy [15].
incidence of thyroid disorders and where majority of pa-                   The risk of malignancy of AUS/FLUS cases was 69% in a
tients come only on a referral basis and hence are not exactly         study done by Park et al. which was higher when compared
representative of the general population [10, 11].                     to our study and TBSRTC guidelines. This was because
    The implied risk of malignancy for category II is 0% to            patients with high index of clinical suspicion for malignancy
3% with the recommended management being clinical                      undergo surgery without a repeat FNA. Patients tend to be
follow-up of patients [2]. Although surgery is not recom-              more concerned about false positive results than false
mended for category II lesions, the patients in our study were         negative results, which might have pressurized cytopa-
operated mainly for cosmetic purpose and pressure                      thologists to underdiagnose cases to avoid making false
symptoms.                                                              positive diagnosis [16].
    The indeterminate category, AUS/FLUS, has led to                       Our study was held in a teaching hospital, where FNAs
confusion due to inconsistent usage amongst pathologists of            were performed by different persons with varied level of
various institutions. This category should be used as a last           experience during their training period. This factor could
resort in reporting with the expectation of 7% or less cases to        have resulted in hemodilution and artefactual changes
receive this diagnosis as proposed by TBSRTC. Layfield et al.           during smear preparation which might have contributed to a
reported a variation of 2.5–28.6% among individual pathol-             higher ROM in category III (Figure 1). Repeat FNAs of such
ogists and 3.3–14.9% among three academic institutions [12].           cases along with clinicoradiological correlation could have
    There were less number of cases (1.2%) diagnosed under             decreased the proportion of cases reported in this category as
the category AUS/FLUS in our study which was due to rigid              well as the ROM.
adherence to the diagnostic criteria and the pathologists                  Based on cytology it is difficult to distinguish follicular
endeavor to avoid ambiguity and keep the use of AUS/FLUS               carcinoma from follicular adenoma [2, 12] (Figure 2). Melo-
to a minimum which was in similarity to a study by Nan-                Uribe et al. correlated the results of thyroid FNA reported
dedkar et al. which had 0.8% of cases in category III out of           using the TBSRTC with histopathology, from three different
606 FNA’s [9]. Jo et al. and Yassa et al. have reported 3.4%           hospitals in Columbia. There was significant variation in the
and 4% lesions as AUS/FLUS, respectively [13, 14].                     malignancy risk of category IV which measured 56.3% in
                                                                                                                                                                         4
                                      Table 2: Cytohistological correlation with assessment of risk of malignancy and risk of neoplasm.
                                     Cases that                     Histopathology diagnosis
                                                                                                               Risk of       Risk of malignancy    Risk of malignancy
                    No. of cases     underwent
Bethesda category                                         Benign            Benign                            neoplasm       including papillary   excluding papillary
                    (total � 646)      surgery                                           Malignant lesion
                                                       nonneoplastic       neoplastic                           (%)          microcarcinoma (%)    microcarcinoma (%)
                                    (total � 100)
I—non diagnostic    89 (13.8%)            1         Colloid nodule (1)         0                0                 0                       0                0
                                                                                           Follicular
                                                    Nodular goitre (42)
                                                                                        carcinoma (1)
                                                       Adenomatoid                    Papillary carcinoma
                                                      hyperplasia (10)                         (2)
                                                                           Follicular
II—benign           490 (75.9%)          71                                                 Papillary            14.1                 8.5                  5.6
                                                    Colloid nodule (5)    adenoma (4)
                                                                                      microcarcinoma (2)
                                                       Lymphocytic/
                                                                                           Hurthle cell
                                                         Hashimoto
                                                                                          carcinoma (1)
                                                       thyroiditis (4)
                                                                                            Follicular
                                                                           Follicular    carcinoma (1)
III—AUS/FLUS          8 (1.2%)           3                                                                       100                 66.7                 66.7
                                                                          adenoma (1) Papillary carcinoma
                                                                                                (1)
                                                                           Follicular       Follicular
                                                                          adenoma (1)    carcinoma (4)
                                                                                       Papillary carcinoma
IV—FN/SFN            24 (3.7%)           11         Nodular goitre (2)                                           81.8                63.6                 63.6
                                                                          Hurthle cell          (2)
                                                                          adenoma (1)       Medullary
                                                                                         carcinoma (1)
                                                                                       Papillary carcinoma
                                                                                                (5)
V—suspicious for                                                                             Papillary
                     17 (2.6%)           7                                                                       100                 100                  85.7
malignancy                                                                             microcarcinoma (1)
                                                                                            Medullary
                                                                                         carcinoma (1)
                                                                                       Papillary carcinoma
VI—malignant         18 (2.8%)           7                                                                       100                 100                  100
                                                                                                (7)
                                                                                                                                                                         Journal of Thyroid Research
Journal of Thyroid Research                                                                                                              5
          Table 3: Determination of diagnostic values.                  (Table 2). This was due to repeat FNA of cases with high
                                     HPE             HPE
                                                                        index of clinical and ultrasound features suspicious for
Test                                                           Total    malignancy.
                                   malignant        benign
                                                                            The RON of category II was similar to the study done by
FNA Bethesda categories IV,
                                       21              4        25      Wu et al. (Table 4) [20]. This was due to false negative
V, VI
FNA Bethesda categories II,                                             reporting of 2 papillary microcarcinoma, 1 Hurthle cell
                                        8             66        74      carcinoma, and 1 follicular carcinoma as benign. Two cases
III
Total                                  29             70        99      of conventional papillary carcinomas were misdiagnosed as
                                                                        benign due to sampling error (Table 2). Follicular carcinoma
                                                                        and Hurthle cell carcinoma are difficult to diagnose on FNA
                                                                        and need to be confirmed by histopathology. Papillary
                                                                        microcarcinoma is a lesion that measures 1 cm or less which
                                                                        can be easily missed on FNA unless the aspirator hits the
                                                                        target.
                                                                            Our study was able to accurately predict the RON of
                                                                        categories III, V, and VI when compared to the study done
                                                                        by Wu et al. which could be attributed to the routine practice
                                                                        of correlating cytology with clinical, biochemical, and ra-
                                                                        diological features at our institute (Table 4) [20].
                                                                            The FN/SFN category had RON of 81.8% which was high
                                                                        compared to the study by Wu et al. This was due to clas-
                                                                        sification of two cases of nodular goitre as category IV lesion
                                                                        (Table 2). Another possible reason could be the variation in
                                                                        sample size and less number of cases with surgical follow-up
                                                                        in our study (Table 4) [20].
                                                                            Mehra and Verma in their study found that the method
Figure 1: Atypia of undetermined significance (Bethesda category
                                                                        of statistical analysis can alter the results of diagnostic values.
III). Smear shows clotting artefact with crowding of follicular cells
hindering the interpretation (MGG stain ×400).
                                                                        If suspicious lesions are considered positive, the sensitivity
                                                                        increases while the specificity decreases. If suspicious lesions
                                                                        are excluded, then the sensitivity decreases and the false
oncology centers and 23.5% in nononcology centers which                 negative rates increase. In their study diagnostic values were
was attributed to the selection bias of the patients requiring          calculated by either excluding FN/SFN or including it with
surgery [17].                                                           either benign or malignant diagnosis to highlight the effect
    The high ROM in categories III and IV in our study when             on diagnostic values [21].
compared to other studies may be due to the following                       Shi et al. suggested that eliminating the diagnosis of
reasons. Firstly, it is due to the heterogeneity of the inde-           category III substantially decreases the sensitivity of thyroid
terminate categories III and IV which are subject to variation          FNAs (the sensitivity for detecting PTC dropped from 100%
in interpretation across institutions [3]. Secondly, it is be-          to 27%) and increases both false positive and false negative
cause of variations in number of patients with surgical                 rates. The authors concluded that AUS/FLUS category
follow-up and also the selection bias of patients requiring             should not be eliminated but recommended using it min-
surgery.                                                                imally [22].
    Our study had 2.4% cases suspicious for papillary thy-                  The findings from our study indicate that the calculation
roid carcinoma (PTC) which was similar to the lower range               of sensitivity, specificity, positive predictive value, negative
of rate of suspicious for PTC in the following study [15]. The          predictive value, and diagnostic accuracy of thyroid FNAs
ROM of category V in a study by Williams et al. was less                according to the Bethesda system are less reliable because of
when compared to our study which may be due to variation                the arbitrary nature of cases classified under categories III
in cohort characteristics and underdiagnoses of lesions                 (AUS/FLUS) and IV (FN/SFN) (Table 3).
leading to hemithyroidectomy rather than total thyroidec-                   The main purpose of TBSRTC was to eliminate the
tomy [18].                                                              ambiguity and to follow uniformity in the reporting of
    The ROM in a study by Partyka et al. was in good                    thyroid FNAs thereby enabling ease of communication
correlation with our study in categories V and VI which was             among pathologists and clinician and to plan appropriate
100% each after inclusion of papillary microcarcinoma [19]              treatment for the patients [2]. Table 5 shows comparison of
(Figure 3). Our study was able to accurately predict the ROM            interobserver reproducibility of our study with that of other
for suspicious for malignancy and malignant nodules due to              studies [23–25].
the practice of correlating cytologic features with clinical,               Our study differed from a study done by Padmanabhan
biochemical, and USG findings while reporting (Table 2).                 et al. which assessed the interobserver reproducibility in
    The risk of neoplasm (RON) gives an overall estimate of             reporting AUS/FLUS category among seven cytopatholo-
predicting both benign and malignant lesions. Our study                 gists which revealed fair agreement (Fleiss kappa score 0.23)
had nil risk of neoplasm in the nondiagnostic category                  and recommended review of AUS/FLUS cases for more
6                                                                                                        Journal of Thyroid Research
(a) (b)
(c)
Figure 2: Follicular neoplasm/suspicious for follicular neoplasm (Bethesda category IV). (a) Highly cellular smear with cells arranged
predominantly in microfollicular pattern (MGG ×100). Histopathology of the same showed follicular carcinoma with capsular invasion (b)
and vascular invasion (c) (H&E ×100).
                                 (a)                                                               (b)
                                                        Figure 3: Continued.
Journal of Thyroid Research                                                                                                              7
(c) (d)
Figure 3: Suspicious for papillary carcinoma (Bethesda category V). (a) One of the follicular cells show nuclear groove (arrow) (H&E ×400).
(b) Intranuclear cytoplasmic inclusion (arrow) seen in occasional follicular cell (H&E ×400). (c) Smear shows focal papillaroid structure
(H&E ×400). (d) Histopathology of the same showed papillary microcarcinoma (H&E ×100).
                     Table 4: Comparison of risk of neoplasm of our study with another study by Wu et al. [20].
Bethesda category     Risk of neoplasm of our study (%) (n � 100/646)       Risk of neoplasm in a study by Wu et al. (%) (n � 221/1382)
I—nondiagnostic                             0                                                            24
II—benign                                  14.1                                                          14
III—AUS/FLUS                               100                                                           44
IV—FN/SFN                                  81.8                                                          67
V—SFM                                      100                                                           77
VI—malignant                               100                                                          100
Post Graduate Institute, Puducherry, for his technical                   [15] A. S. Ho, E. E. Sarti, K. S. Jain et al., “Malignancy rate in
support in the preparation of the manuscript.                                 thyroid nodules classified as bethesda category III (AUS/
                                                                              FLUS),” Thyroid, vol. 24, no. 5, pp. 832–839, 2014.
                                                                         [16] J. H. Park, S. O. Yoon, E. J. Son, H. M. Kim, J. H. Nahm, and
References                                                                    S. Hong, “Incidence and malignancy rates of diagnoses in the
                                                                              bethesda system for reporting thyroid aspiration cytology: an
 [1] N. Afroze, N. Kayani, and S. H. Hasan, “Role of fine needle               institutional experience,” Korean Journal of Pathology, vol. 48,
     aspiration cytology in the diagnosis of palpable thyroid le-             no. 2, pp. 133–139, 2014.
     sions,” The Indian Journal of Pathology and Microbiology,           [17] M. Melo-Uribe, Á. Sanabria, A. Romero-Rojas et al., “The
     vol. 45, no. 3, pp. 241–246, 2002.                                       Bethesda system for reporting thyroid cytopathology in
 [2] E. S. Cibas and S. Z. Ali, “The bethesda system for reporting            Colombia: correlation with histopathological diagnoses in
     thyroid cytopathology,” American Journal of Clinical Pa-                 oncology and non-oncology institutions,” Journal of Cytology,
     thology, vol. 132, no. 11, pp. 658–665, 2009.                            vol. 32, no. 1, p. 12, 2015.
 [3] M. Bongiovanni, A. Spitale, W. C. Faquin, L. Mazzucchelli,          [18] B. A. Williams, M. J. Bullock, J. R. Trites, S. M. Taylor, and
     and Z. W. Baloch, “The bethesda system for reporting thyroid             R. D. Hart, “Rates of thyroid malignancy by FNA diagnostic
     cytopathology: a meta-analysis,” Acta Cytologica, vol. 56,               category,” Journal of Otolaryngology—Head & Neck Surgery,
     no. 4, pp. 333–339, 2012.                                                vol. 42, no. 1, p. 61, 2013.
 [4] P. Gunes, S. Canberk, M. Onenerk et al., “A different per-           [19] K. L. Partyka, E. C. Huang, H. M. Cramer, S. Chen, and
     spective on evaluating the malignancy rate of the non-diag-              H. H. Wu, “Histologic and clinical follow-up of thyroid fine-
     nostic category of the bethesda system for reporting thyroid             needle aspirates in pediatric patients,” Cancer Cytopathology,
     cytopathology: a single institute experience and review of the           vol. 124, no. 7, pp. 467–471, 2016.
     literature,” PLoS One, vol. 11, no. 9, Article ID e0162745, 2016.   [20] H. H.-J. Wu, C. Rose, and T. M. Elsheikh, “The bethesda
 [5] A. Guo, Y. Kaminoh, T. Forward, F. L. Schwartz, and                      system for reporting thyroid cytopathology: an experience of
     S. Jenkinson, “Fine needle aspiration of thyroid nodules using           1,382 cases in a community practice setting with the impli-
     the bethesda system for reporting thyroid cytopathology: an              cation for risk of neoplasm and risk of malignancy,” Diag-
     institutional experience in a rural setting,” International              nostic Cytopathology, vol. 40, no. 5, pp. 399–403, 2012.
     Journal of Endocrinology, vol. 2017, Article ID 9601735,            [21] P. Mehra and A. K. Verma, “Thyroid cytopathology reporting
     6 pages, 2017.                                                           by the bethesda system: a two-year prospective study in an
 [6] S. Mondal, S. Sinha, B. Basak, D. Roy, and S. Sinha, “The                academic institution,” Pathology Research International,
     Bethesda system for reporting thyroid fine needle aspirates: a            vol. 2015, Article ID 240505, 11 pages, 2015.
     cytologic study with histologic follow-up,” Journal of Cytol-       [22] Y. Shi, X. Ding, M. Klein et al., “Thyroid fine-needle aspiration
     ogy, vol. 30, no. 2, pp. 94–99, 2013.                                    with atypia of undetermined significance,” Cancer Cytopa-
 [7] T. L. Hall, L. J. Layfield, A. Philippe, and D. L. Rosenthal,             thology, vol. 117, no. 5, pp. 298–304, 2009.
     “Sources of diagnostic error in fine needle aspiration of the        [23] P. Awasthi, G. Goel, U. Khurana, D. Joshi, K. Majumdar, and
     thyroid,” Cancer, vol. 63, no. 4, pp. 718–725, 1989.                     N. Kapoor, “Reproducibility of “the bethesda system for
 [8] J. I. Hamburger, M. Husain, R. Nishiyama, C. Nunez, and                  reporting thyroid cytopathology”: a retrospective analysis of
     D. Solomon, “Increasing the accuracy of fine-needle biopsy                107 patients,” Journal of Cytology, vol. 35, no. 1, pp. 33–36,
     for thyroid nodules,” Archives of Pathology & Laboratory             2018.
     Medicine, vol. 113, no. 9, pp. 1035–1041, 1989.                     [24] P. Pathak, R. Srivastava, N. Singh, V. K. Arora, and A. Bhatia,
 [9] S. S. Nandedkar, M. Dixit, K. Malukani, A. V. Varma, and                 “Implementation of the bethesda system for reporting thyroid
     S. Gambhir, “Evaluation of thyroid lesions by fine-needle                 cytopathology: interobserver concordance and reclassification
     aspiration cytology according to bethesda system and its                 of previously inconclusive aspirates,” Diagnostic Cytopa-
     histopathological correlation,” International Journal of Ap-             thology, vol. 42, no. 11, pp. 944–949, 2014.
     plied and Basic Medical Research, vol. 8, no. 2, pp. 76–82, 2018.   [25] V. Padmanabhan, C. B. Marshall, G. Akdas Barkan et al.,
[10] R. Nayar and M. Ivanovic, “The indeterminate thyroid fine-                “Reproducibility of atypia of undetermined significance/fol-
     needle aspiration,” Cancer Cytopathology, vol. 117, no. 3,               licular lesion of undetermined significance category using the
     pp. 195–202, 2009.                                                       bethesda system for reporting thyroid cytology when
[11] J. Yang, V. Schnadig, R. Logrono, and P. G. Wasserman,                   reviewing slides from different institutions: a study of in-
     “Fine-needle aspiration of thyroid nodules: a study of 4703              terobserver variability,” Diagnostic Cytopathology, vol. 45,
     patients with histologic and clinical correlations,” Cancer,             no. 5, pp. 399–405, 2017.
     vol. 111, no. 5, pp. 306–315, 2007.
[12] L. J. Layfield, M. J. Morton, H. M. Cramer, and S. Hirschowitz,
     “Implications of the proposed thyroid fine-needle aspiration
     category of “follicular lesion of undetermined significance”: a
     five-year multi-institutional analysis,” Diagnostic Cytopa-
     thology, vol. 37, no. 10, pp. 710–714, 2009.
[13] V. Y. Jo, E. B. Stelow, S. M. Dustin, and K. Z. Hanley,
     “Malignancy risk for fine-needle aspiration of thyroid lesions
     according to the bethesda system for reporting thyroid
     cytopathology,” American Journal of Clinical Pathology,
     vol. 134, no. 3, pp. 450–456, 2010.
[14] L. Yassa, E. S. Cibas, C. B. Benson et al., “Long-term as-
     sessment of a multidisciplinary approach to thyroid nodule
     diagnostic evaluation,” Cancer, vol. 111, no. 6, pp. 508–516,
     2007.