DOI: 10.1055/s-0041-1740271 - volume 43 - Janeiro 2022
Carla Peixoto, Maite Castro, Isabel Carriles, Maria de Arriba, Victoria Lapresa, Juan Luis Alcazar
Congenital uterine malformations were described for the first time in 1800s and, since then, several classification systems have been developed for describing different types of uterine and cervical/vaginal anomalies,1 whose incidence has been reported as of between 0.2 and 0.4% in the general population and of between 3 and 13% in infertile patients.2,3,4,5,6,7
Classically, septate uterus has been associated with poor reproductive and obstetric outcomes, and surgical metroplasty is advocated in these cases, with the aim of improving these outcomes.8,9,10,11 Notwithstanding, evidence that this surgery is beneficial is rather arguable.12 Differently from septate uterus, arcuate/normal and bicornuate uteri do not require surgery.8,9,10,11 However, from the beginning, there was some difficulty in the classification of uterine malformations, mainly due to the discrepancy between the diagnostic criteria and the diagnostic techniques used.13 To overcome these limitations, three-dimensional (3D) ultrasound has been proposed as the gold standard technique to classify uterine malformations, as it seems to be better to evaluate the level of distortion of the uterine fundus, and also to reduce the interobserver variability.14,15
The European Society of Human Reproduction and Embryology/European Society for Gynaecological Endoscopy (ESHRE/ESGE) and the American Society for Reproductive Medicine (ASRM) have both published their recommendations on how to classify uterine anomalies, using the coronal plane of the uterus. The ESHRE/ESGE classification suggests using an indentation-to-wall-thickness (I:WT) ratio > 50% for diagnosing a septate uterus and an external fundal indentation > 50% to diagnose a bicornuate uterus.13,16 The ASRM classification considers a uterus as septate when there is both an indentation depth > 15mm and an indentation angle < 90°; a normal/arcuate uterus when there is both an indentation depth < 10mm and an indentation angle > 90°; and a bicornuate uterus when the external fundal indentation is > 10 mm. According to this classification, some cases could not be classified as septate or not-septate (falling in the so-called gray zone).8 Although both classifications have very objective criteria, they do not coincide, which means that a high percentage of uteri classified as septate by the ESHRE/ESGE classification are classified as arcuate/normal by the ASRM classification.17,18,19,20 More recently, a group of experts (Congenital Uterine Malformations by Experts [CUME]) proposed new criteria for diagnosing a septate uterus: indentation depth > 10 mm, indentation angle < 140°, and I:WT > 110%.18
The main objective of the present study was to assess the interobserver agreement of nonexpert sonographers in classifying septate uteri using the ESHRE/ESGE, ASRM, and CUME classifications in each case. Secondly, we also aimed to compare the agreement for each examiner for diagnosing septate uterus between the three different classifications (ESHRE/ESGE, ASRM, and CUME).
The present study was a single-center retrospective analysis of patients with suspicion of congenital uterine malformation who underwent transvaginal ultrasound at the Department of Obstetrics and Gynecology of the Clínica Universidad de Navarra, Pamplona, Spain. Due to the study design and to the anonymization of the 3D volumes, formal approval by the Institutional Review Board from the Clínica Universidad de Navarra was waived. However, all women had given oral informed consent to acquire and use their 3D datasets for the present research. The present study was performed at the Clínica Universidad de Navarra between September and October 2018.
The inclusion criterion was: women with suspected uterine malformation in infertility setting who underwent 3D uterine evaluation. The exclusion criteria were: diagnosis of bicornuate or didelfis uterus or poor-quality 3D volume.
An expert examiner (Alcazar J. L.) randomly selected cases from the hospital database. Two nonexpert examiners (Peixoto C. and Castro M) evaluated a single 3D volume of the uterus of each woman. All 3D datasets had been acquired by one expert examiner (Alcazar J. L.) using either a Voluson 730 Expert or Voluson E8 machines (GE Healthcare, Chicago, IL, USA).
The nonexpert examiners had basic training on ultrasound in gynecology, with no special focus on uterine malformations, but both were undergoing a training program for ultrasound assessment of congenital uterine anomalies. Before the study, the nonexpert examiners took a short (2 hours) theoretical training session focused on the ESHRE/ESGE, ASRM and CUME classifications. Additionally, they read the original papers in which the criteria to classify uterine malformations were described.8,13,18 They were also trained to use the 4D View Ultrasound software (GE Healthcare, Chicago, IL, USA).
The two observers manipulated the uterine 3D volumes, blinded to each other. After obtaining the coronal plane and using the Volume Contrast Imaging (VCI) function according to the CUME recommendations,18 they performed the following measurements: indentation depth, indentation angle, uterine fundal wall thickness, external fundal indentation, and I:WT ratio. Each observer had to assign a diagnosis (normal/arcuate, septate) in each case, according to the three classification systems (ESHRE/ESGE, ASRM, and CUME) (►Fig. 1).
Fig. 1
A case of a septate uterus according to the ESHRE/ESGE classification (I:WT = 52%) (A), but normal/arcuate according to the ASRM (indentation: 0.63 cm, angle: 130°) (B) and to the CUME (indentation: 0.63 cm, angle: 130°, I:WT: 52%) classifications (C).
Arbitrarily, to avoid cases from falling in the grey zone, we decided the following: for the ASRM classification, in case that only one criterion was present, the case was considered as normal. For the CUME classification, the uterus was considered as septate if at least two criteria were present.
The examiners were also instructed not to discuss their impressions among themselves or with the expert after the assessment. We did not set a maximum time for performing evaluations of the 3D volumes.
The interobserver agreement between the two nonexpert examiners regarding the ESHRE/ESGE, ASRM, and CUME classifications was assessed using the Cohen weighted kappa index (k) with 95% confidence intervals (CIs) and percentage of agreement.21
We also assessed the interobserver agreement for the two nonexpert examiners regarding the three classifications (ASRM versus ESHRE/ESGE, ASRM versus CUME, and ESHRE/ESGE versus CUME).
The kappa value was interpreted regarding the reporting of the reliability/strength of agreement as follows: poor < 0.20; fair = 0.21 to 0.40; moderate = 0.41 to 0.60; good = 0.61 to 0.80; and very good = 0.81 to 1.00.22
Statistical calculations were done using GraphPad software (GraphPad Software, Inc., San Diego, CA, USA). Sample size calculation was not performed.
Forty-seven 3D volumes of women were included in the present study. This number was chosen arbitrarily. The interobserver agreement between the two nonexpert examiners for classifying uterine malformations is shown in ►Tables 12 and 3. Overall, it was good for the ESHRE/ ESGE (k = 0.74; 95%CI: 0.55–0.92) classification (►Table 1) and very good for the ASRM and CUME classifications (k = 0.96; 95%CI: 0.88–1.00; and k = 0.91; 95%CI: 0.79– 1.00, respectively) (►Tables 2 and 3).
| ESHRE/ESGE | ||||||
|---|---|---|---|---|---|---|
| Examiner 1 | ||||||
| Examiner 2 | Normal/arcuate | Septate | Bicornuate | Weighted Kappa (95% CI) | Agreement (%) | |
| Normal/arcuate | 16 | 3 | – | 0.74 | 86% | |
| Septate | 3 | 25 | – | (0.55–0.92) | ||
| Bicornuate | – | 1 | 2 | |||
Abbreviations: CI, confidence interval; ESHRE, European Society of Human Reproduction and Embryology; ESGE, European Society for Gynaecological Endoscopy.
Table 1
Interobserver agreement for nonexpert examiners for classifying uterine congenital anomalies using the ESHRE/ESGE classification
| ASRM | ||||||
|---|---|---|---|---|---|---|
| Examiner 1 | ||||||
| Examiner 2 | Normal/arcuate | Septate | Bicornuate | Weighted Kappa (95% CI) | Agreement (%) | |
| Normal/arcuate | 33 | – | – | 0.96 | 98% | |
| Septate | – | 15 | – | (0.88–1.00) | ||
| Bicornuate | – | 1 | 1 | |||
Abbreviations: CI, confidence interval; ASRM, American Society for Reproductive Medicine.
Table 2
Interobserver agreement for nonexpert examiners for classifying uterine congenital anomalies using the ASRM classification
| CUME | |||||
|---|---|---|---|---|---|
| Examiner 1 | |||||
| Examiner 2 | Normal/arcuate | Septate | Kappa (95%CI) | Agreement (%) | |
| Normal/arcuate | 27 | 1 | 0.91 | 96% | |
| Septate | 1 | 18 | (0.79–1.00) | ||
Abbreviations: CI, confidence interval; CUME, Congenital Uterine Malformations by Experts.
Table 3
Interobserver agreement for nonexpert examiners for classifying uterine congenital anomalies using the CUME classification
The agreement between the different classifications systems is shown in ►Tables 4,5,6,7,8,9. When comparing the agreement for classifying uterine anomalies between the ESHRE/ESGE and ASRM classifications, we observed that it was moderate for both examiners (►Tables 4 and 5). We also observed that 14 cases were classified as septate by the ESHRE/ESGE classification and as normal/arcuate by the ASRM classification by both examiners. For both examiners, 9 of these cases were classified as normal/arcuate (64.3%) and 5 as septate (35.7%) when using the CUME classification.
| Examiner 1 | ||||||
|---|---|---|---|---|---|---|
| ASRM | ||||||
| ESHRE/ESGE | Normal/arcuate | Septate | Bicornuate | Weighted Kappa (95% CI) | Agreement (%) | |
| Normal/arcuate | 19 | – | – | 0.48 | 70% | |
| Septate | 14 | 14 | – | (0.28–0.68) | ||
| Bicornuate | – | 1 | 2 | |||
Abbreviations: ASRM, American Society for Reproductive Medicine; CI, confidence interval; ESHRE, European Society of Human Reproduction and Embryology; ESGE, European Society for Gynaecological Endoscopy.
Table 4
Intraobserver agreement for examiner 1 when using the ASRM and the ESHRE/ESGE classifications
| Examiner 2 | ||||||
|---|---|---|---|---|---|---|
| ASRM | ||||||
| ESHRE/ESGE | Normal/arcuate | Septate | Bicornuate | Weighted Kappa (95% CI) | Agreement (%) | |
| Normal/arcuate | 19 | – | – | 0.47 | 70% | |
| Septate | 14 | 15 | – | (0.27–0.67) | ||
| Bicornuate | – | 1 | 1 | |||
Abbreviations: ASRM, American Society for Reproductive Medicine; CI, confidence interval; ESHRE, European Society of Human Reproduction and Embryology; ESGE, European Society for Gynaecological Endoscopy.
Table 5
Intraobserver agreement for examiner 2 when using the ASRM and the ESHRE/ESGE classifications
| Examiner 1 | |||||
|---|---|---|---|---|---|
| ESHRE/ESGE | |||||
| CUME | Normal/arcuate | Septate | Kappa (95%CI) | Agreement (%) | |
| Normal/arcuate | 18 | 10 | 0.59 | 79% | |
| Septate | – | 19 | (0.39–0.80) | ||
Abbreviations: CI, confidence interval; CUME, Congenital Uterine Malformations by Experts; ESHRE, European Society of Human Reproduction and Embryology; ESGE, European Society for Gynaecological Endoscopy.
Table 6
Intraobserver agreement for examiner 1 when using the CUME and the ESHRE/ESGE classifications
| Examiner 2 | |||||
|---|---|---|---|---|---|
| ESHRE/ESGE | |||||
| CUME | Normal/arcuate | Septate | Kappa (95%CI) | Agreement (%) | |
| Normal/arcuate | 19 | 9 | 0.63 | 81% | |
| Septate | – | 19 | (0.43–0.83) | ||
Abbreviations: CI, confidence interval; CUME, Congenital Uterine Malformations by Experts; ESHRE, European Society of Human Reproduction and Embryology; ESGE, European Society for Gynaecological Endoscopy.
Table 7
Intraobserver agreement for examiner 2 when using the CUME and the ESHRE/ESGE classifications
| Examiner 1 | |||||
|---|---|---|---|---|---|
| ASRM | |||||
| CUME | Normal/arcuate | Septate | Kappa (95%CI) | Agreement (%) | |
| Normal/arcuate | 28 | – | 0.77 | 89% | |
| Septate | 5 | 14 | (0.58–0.96) | ||
Abbreviations: ASRM, American Society for Reproductive Medicine; CI, confidence interval. CUME, Congenital Uterine Malformations by Experts.
Table 8
Intraobserver agreement for examiner 1 when using the CUME and the ASRM classifications
| Examiner 2 | |||||
|---|---|---|---|---|---|
| ASRM | |||||
| CUME | Normal/arcuate | Septate | Kappa (95%CI) | Agreement (%) | |
| Normal/arcuate | 28 | – | 0.77 | 89% | |
| Septate | 5 | 14 | (0.58–0.96) | ||
Abbreviations: ASRM, American Society for Reproductive Medicine; CI, confidence interval; CUME, Congenital Uterine Malformations by Experts.
Table 9
Intraobserver agreement for examiner 2 when using the CUME and the ASRM classifications
The agreement between the ESHRE/ESGE and CUME classifications was moderate for examiner 1 and good for examiner 2 (►Tables 6 and 7). Finally, the agreement between the ASRM and CUME classifications was good for both examiners (►Tables 8 and 9).
As far as we know, this is the first study to assess the interobserver agreement of the three existing classification systems to describe normal, arcuate, and septate uterus. We have shown that the evaluation of 3D volumes of uteri is reproducible among nonexpert examiners.
The agreement between observers is higher when using the ASRM and CUME classifications. Actually, our data confirm the results previously reported by Ludwin et al.,20 who showed that the ASRM classification was better than the ESHRE/ESGE classification for diagnosing septate uterus. In addition, we have also shown that the criteria used by the new classification system (CUME), despite being apparently more complex, are highly reproducible among examiners (k = 0.91). This is an important finding, given that this new classification has not yet been validated after its publication.
In our study, the agreement between the ESHRE/ESGE and ASRM criteria was moderate. This finding is in line with those of previous studies18,20 and raises concern regarding the use of the ESHRE/ESGE classification, since its use could lead to an overdiagnosis of septate uterus and to a potential increase of surgical corrections.17 This is relevant since recent evidence suggests no benefit in obstetrical outcomes with surgery.23 Our data also support the results published by the CUME group,18 given that we demonstrated that, in comparison with CUME criteria, the ESHRE/ESGE classification overestimates the number of septate uteri. Overall, the agreement between the CUME and the ESHRE/ESGE and the ASRM classifications was good, but it was slightly better between the CUME and ASRM classifications than between the CUME and ESHRE/ESGE classifications.
An interesting question is related to the fact that if the ESHRE/ESGE classification would use the I:I + WT ratio, instead of the I:WT ratio, the rate of septate uterus would be similar to CUME classification.
The strengths of the present study are its design and the use of an optimal diagnostic method (3D ultrasonography) for diagnosing uterine anomalies.6,24 The participation of nonexpert examiners could be seen as a potential strength, since it allows the evaluation of the reproducibility of the different classifications in “everyday practice.”
However, certainly, our study design can be also considered as a limitation, since the sources of variability regarding the real-time ultrasound and 3D volume acquisition were not taken to account, since the two observers have used previously acquired 3D datasets, which may overestimate the reproducibility of the measurements.
As stated above, our study has limitations. One limitation of the present study is that the examiners had to manipulate the 3D volumes by rotation in all 3 orthogonal planes. This manipulation has an inherent variability between observers,18,20 as the same uterus might provide different images depending on the angle at which the coronal plane is obtained. Other possible limitations of the present study are the small number of cases analyzed and the high quality of 3D volumes, which may have contributed to a lower number of “discrepant” cases. One final limitation that must be mentioned is that we arbitrarily decided to assume that there were no gray-zone cases, since the uteri were classified as septate only when both criteria of the ASRM classification or at least two criteria of the CUME classification were present. It is clear that this point could bias the results, since, somehow, we forced providing a diagnosis in all cases, which is not true in the case of the ASRM classification.
Despite these limitations, we consider that our findings may have clinical relevance and should prompt further studies to determine which classification should be used.
In general, the three classifications have good (ESHRE/ESGE) or very good (ASRM, CUME) interobserver agreement, which makes them all good methods to classify congenital uterine anomalies. However, agreement between the ASRM and the CUME classifications was higher than that for the ESHRE-ESGE and the ASRM and for the ESHRE/ESGE and the CUME classifications.