Abstract

Objectives. MRI is the standard imaging method in evaluating treatment response of breast cancer after neoadjuvant therapy (NAT), while identification of pathologic complete response (pCR) remains challenging. Texture analysis (TA) on post-NAT dynamic contrast-enhanced (DCE) MRI was explored to assess the existence of pCR in mass-like cancer. Materials and Methods. A primary cohort of 112 consecutive patients (40 pCR and 72 non-pCR) with mass-like breast cancers who received preoperative NAT were retrospectively enrolled. On post-NAT MRI, volumes of the residual-enhanced areas and TA first-order features (19 for each sequence) of the corresponding areas were achieved for both early- and late-phase DCE using an in-house radiomics software. Groups were divided according to the operational pathology. Receiver operating characteristic curves and binary logistic regression analysis were used to select features and achieve a predicting formula. Overall diagnostic abilities were compared between TA and radiologists’ subjective judgments. Validation was performed on a time-independent cohort of 39 consecutive patients. Results. TA features with high consistency (Cronbach’s alpha >0.9) between 2 observers showed significant differences between pCR and non-pCR groups. Logistic regression using features selected by ROC curves generated a synthesized formula containing 3 variables (volume of residual enhancement, entropy, and robust mean absolute deviation from early-phase) to yield AUC = 0.81, higher than that of using radiologists’ subjective judgment (AUC = 0.72), and entropy was an independent risk factor (). Accuracy and sensitivity for identifying pCR were 83.93% and 70.00%. AUC of the validation cohort was 0.80. Conclusions. TA may help to improve the diagnostic ability of post-NAT MRI in identifying pCR in mass-like breast cancer. Entropy, as a first-order feature to depict residual tumor heterogeneity, is an important factor.

1. Introduction

Neoadjuvant therapy (NAT) is an essential procedure in breast cancer treatment. Whether a pathologic complete response (pCR) can be achieved represents a prognostic factor that is related to tumor recurrence and survival [1]. Proper evaluation of residual tumor tissue after NAT can help clinicians optimize NAT while avoiding unnecessary therapy. Magnetic resonance imaging (MRI) is considered the best approach with the highest accuracy to evaluate such residual tumors [24].

However, imaging appearances of residual tumors after treatment are different from those at pretreatment, thus making them difficult to evaluate. The only criterion currently used on MRI is enhancement [5, 6]. Only when no enhancement is observed at the corresponding primary tumor site can a conclusion of radiological complete response (iCR) be drawn. Nevertheless, both false positives and false negatives occur. According to a meta-analysis based on 25 studies [7], the specificity of MRI in predicting pCR was as high as 90.7%, but the sensitivity was only 63.1%. Hence, such criterion lacks accuracy [8].

Intratumoral heterogeneity is associated with prognosis [9, 10]. A study showed a positive relationship of tumor heterogeneity with recurrence-free survival in breast cancer patients [11]. Nowadays, some methods to evaluate tumor heterogeneity on radiologic images are widely available. Texture analysis (TA) is a tool that has recently gained much attention in the scientific community. In TA, the characteristics of the spatial distribution of pixels and their gray-scale intensities within an image are defined and it has been used in breast tumor studies [12].

Although pretreatment MRI is used in many studies to predict the possibility of achieving pCR after NAT, post-NAT MRI is more useful in clinical practice for determining the presence of residual tumor tissue and for providing critical information for surgical planning. In this study, TA features were extracted from both early and late phases of dynamic contrast-enhanced (DCE) breast MRI after NAT and were used to evaluate the ability of TA to predict whether pCR can be achieved.

2. Materials and Methods

2.1. Patients

Informed patient consent was waived by the local institutional review board owing to its retrospective design. Between January 2015 and March 2016, 241 consecutive patients with biopsy-proven breast cancers and received treatment at our hospital were filtered by the following criteria. Inclusion: i. treatment was NAT followed by surgery; ii. completion of both pretreatment and preoperative dynamic contrast-enhanced MR imaging at our institution. Exclusion: i. time interval between final MRI examination and date of surgery was longer than 2 weeks (n = 18); ii. the lesion was reported on pretreatment MRI as a nonmass enhancement (n = 64); and iii. poor MR image quality or incomplete images retrieved from PACS (n = 8). Totally, 151 patients were enrolled and divided into two independent sets on the date of pretreatment MRI: 112 patients with earlier examination time were used as the training set, and the remaining 39 patients were used as the validation set.

2.2. MRI Protocols

1.5 T scanner (360; GE Medical Systems) with a dedicated bilateral 4-channel phased-array breast coil was used. Patients were in the prone position. Sequences: (a) fast spin-echo T2-weighted sequences and (b) 3D dynamic contrast-enhanced sequence (VIBRANT) with the following parameters: Sagittal, TR/TE, 5.3 msec/2.6 msec; TI, 12.0 msec; flip angle, 120°; FOV, 20–22 cm; section thickness/gap, 2 mm/0 mm; matrix size with dimensions of 256 × 256; NEX, 0.75; acquisition time for single phase, 45–55 sec with 10–15 sec interval between phases; and total phases acquired, 1 precontrast plus 5 postcontrast series, starting at the same time as contrast injection. A 0.1 mmol/kg bolus of the gadolinium contrast agent was injected into the arm using high-pressure injectors at a rate of 2.0 ml/s followed by a 10 ml saline flush.

2.3. MRI Assessment

Images were retrieved from the local picture archiving and communication system. Pretreatment and post-NAT images were placed side-by-side to locate the exact tumor beds. In cases with multiple lesions, the largest lesion was selected for indexing. On post-NAT early-phase subtracted images (i.e., the subtraction images of the second DCE phase from the precontrast phase), regions of interest (ROIs) were manually placed slice-by-slice to cover the areas with suspicious tumor bed enhancement (Figure 1). For those images with no suspicious enhancement, radiologists were requested to put very small ROIs (four to eight pixels) on the corresponding areas. Late-phase subtractive images were the subtraction images of the last phase from the precontrast phase.

ROIs for the first 30 cases (sorted by MRI date) were drawn, respectively, by a junior resident (Z.B., with 3 years of experience in MRI) and a senior attending (C.K., with more than 10 years of experience in breast imaging) to test interobserver consistency. All other ROIs were initially drawn by Z.B. and then reviewed by C.K. Subjective judgment of complete disappearing of enhancement was recorded as iCR by these two radiologists, discussed together and reached in consensus. Both radiologists were blinded to clinical information during the data collecting period.

Images and ROIs were all transferred into an in-house radiomics software modified on the 3D-slicer platform. 19 conventional TA first-order features were calculated by using the formulas provided by Aerts et al. [13]. Two sets of features were derived from early- and late-subtracted images, respectively, including mean, median, minimal, maximal, 10 percentile, 25 percentile, 75 percentile, 90 percentile, range, interquartile range, variance, skewness, kurtosis, uniformity, energy, entropy, mean absolute deviation, robust mean absolute deviation, and root mean squared. The first 10 features were signal intensity- (SI-) related values, so were calculated as ratios to the mean value on precontrast sequences using same ROIs (rSI = SI/SIpre) when doing comparison between groups and logistic regression analysis. Volumes of the residual-enhancing areas on post-NAT MR imaging was recorded separately as post-NAT-enhancing volume. Thus, 39 features in total were recorded. Lesions defined as complete response on MRI were classified as iCR and others as non-iCR.

2.4. Histopathologic Data

All patients underwent either lumpectomy or mastectomy. Final histopathologic results of surgical specimens were reviewed to determine the existence of residual tumor as residual invasive cancer (non-pCR) and no residual invasive cancer cells (pCR), which was defined as either no cancer cells or ductal carcinoma in situ (DCIS). All were referred to local diseases regardless of lymph node status.

2.5. Statistical Analysis

SPSS version 22.0 and MedCalc version 15.0 was used. The Cronbach test was used to test the interobserver agreement. After the Kolmogorov–Smirnov test for data distribution, comparisons between pCR and non-pCR groups were made by an independent-sample Student’s t-test for normally distributed data and the Wilcoxon rank-sum test for others. Receiver operating characteristic (ROC) and binary logistic regression analyses were used to select features and to generate the formula. Four-fold tables were drawn to calculate diagnostic ability. DeLong’s test was used to compare area under the curve (AUC). Cronbach’s alpha >0.9 and AUC >0.8 were used as the levels to select features. Bonferroni correction was used for multiple comparisons of radiomics features between pCR and non-pCR groups, suggested statistical significance. was considered to indicate statistically significant differences for other tests.

3. Results

Clinical information of the 112 patients in the training set and 39 patients in the validation set is presented in Table 1. The mean interval between pre-NAT MRI and surgery was 10.2 days (range, 1–14 days).

3.1. Interobserver Consistency Test for ROIs

Post-NAT-enhancing volume and 13 TA features showed high agreement (Cronbach’s alpha >0.9; see Table 2) and were used in the following analysis.

3.2. Differences between Groups and Feature Selection Using ROC Curves

Twenty-one out of 39 variables showed significant differences between the two groups (; see Table 3). Among them, eight showed AUC >0.8 in ROC curves.

3.3. Logistic Regression Analysis for Selecting Features

The above eight variables were entered into binary logistic regression. Entropy from the early phase (, Exp (B) = 4.922), post-NAT-enhancing volume (, Exp (B) = 1.001), and robust mean absolute deviation from the early phase (, Exp (B) = 0.980) were the final three variables left. Entropy was an independent risk factor.

Accordingly, the following formula was synthesized: Y = entropy × 1.594 + post-NAT volume × 0.001 − Robust mean absolute deviation × 0.020–3.8. Lesions can be classified as imaging complete response (iCR) when Y < 0 (see Figure 1). AUC of this cohort for diagnosing pCR was 0.81 (95% CI 0.72, 0.88), with an accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of 83.93%, 70.00%, 91.67%, 82.35%, and 84.62%, respectively.

3.4. Diagnostic Ability Compared with Radiologists

On post-NAT MRI, 27 cases (21 pCR and 6 non-pCR) were identified as iCR using the no enhancement criterion by two radiologists (Table 4), achieving AUC = 0.72 (95% CI 0.63, 0.80), with the accuracy, sensitivity, specificity, PPV, and NPV of 77.68%, 52.50%, 91.67%, 77.78%, and 77.65%. Comparing the diagnostic abilities of using the above TA formula with radiologists’ subjective judgment for identifying pCR status after NAT, the AUC was improved significantly (). The accuracy () and sensitivity () were also higher, while the specificity, PPV, and NPV remain at similar levels ().

4. Formula Tested with the Validation Set

In the validation set of 39 cases, using the above TA formula, four pCR cases were wrongly diagnosed as iCR and two non-pCR cases as non-iCR. All other cases were diagnosed correctly. AUC was 0.80 (95% CI 0.62, 0.97).

5. Discussion

In assessing post-NAT pCR in breast cancer, reasons of misdiagnosis by MRI vary [6, 1416]. In our cohort, more cases were seen with residual enhancement, but no invasive cancer on pathology. In Ko’s study [17], mild enhancement was also considered as nonresidual and a diagnostic accuracy rate as high as 89% was reached. However, judgment of mild enhancement is quite subjective. Therefore, we aimed to investigate quantitative values that might aid in the identification of the pCR status.

For those cases falsely diagnosed as iCR, the residual tumor was generally small [6, 18, 19]. Volumes of suspicious areas could be acquired preoperatively by MRI prior to pathologic measurements. In our study, this post-NAT-enhancing volume showed a great difference between the pCR and non-pCR groups and could hence be used as a factor in the final formula.

TA provides parameters to quantify cancer heterogeneity. Uniformity and entropy are the most common ones; high entropy and low uniformity represent high heterogeneity. Few studies on the application of TA in breast cancer have been reported. Uniformity and entropy from T2WI and entropy from contrast-enhanced T1WI have been shown to be associated with recurrence-free survival [11], and entropy may differentiate malignant from benign lesions [20]. In our study, entropy was much lower in the pCR group and was the only independent risk factor in logistic regression, and therefore, has the highest weight in the final formula. Uniformity, however, was excluded because of low consistency between observers.

Residual enhancement on MRI is currently the only accepted criterion for determining residual cancer after NAT, but insufficient evidence has shown the optimal choice of phases in DCE imaging. One study confirmed that early-phase enhancement is still superior in predicting posttreatment residual lesions [21]. Therefore, we defined ROIs on early-phase images. Although all features from early and late phases were included in the following analysis, the two retained in the formula were both from the early phase. This may be explained by the fact that the potential correlation among features singled out the strongest ones. We believe that the early phase in DCE imaging plays a more important role in predicting residual tumor than the late phase.

Using the formula with a combination of factors that reflect heterogeneity, the AUC (by ROC analysis) for pCR diagnosis was 0.81 in the training group and 0.80 in the validation group, as compared to 0.72 by radiologists and with improved sensitivity from 52.50% to 70.00%. TA features did not provide useful information for those non-pCR lesions without obvious enhancement, so the specificity was the same for both methods. We did not find any differences in features from basic histograms, for example, percentiles of SI, kurtosis, and skewness, which also reflect distribution heterogeneity. This was likely due to the combined use of all first-order TA features, which consequently reduced the effects of the basic histograms.

Certain limitations of our study exist. This is a single-institute study, with a limited case number. Further validation will be needed. However, compared to those studies that include numerous features and complicated combinations to achieve 100% accuracy and specificity [22], the final features used in our study are much simpler in clinical use, so we expect high reproducibility of our results. Another point to mention is that our results are only applicable to mass-like lesions because nonmass-enhancement lesions on MRI are generally diffuse and scattered and are more likely to suffer from ROI drawing discrepancies.

In conclusion, TA may help to improve the diagnostic ability of MRI in identifying post-NAT pCR in breast cancer, in which entropy, a first-order feature to depict residual tumor heterogeneity, is an important factor. Compared to the judgment by radiologists, the AUC was improved using TA features with higher sensitivity.

Abbreviations

AUC:Area under the curve
CI:Confidence interval
DCE:Dynamic contrast enhanced
DCIS:Ductal carcinoma in situ
iCR:Radiological complete response
MRI:Magnetic resonance imaging
NAT:Neoadjuvant therapy
NPV:Negative predictive value
PPV:Positive predictive value
pCR:Pathologic complete response
ROC:Receiver operating characteristic
ROI:Region of interest
SI:Signal intensity
TA:Texture analysis.

Data Availability

Data are available from the corresponding author on reasonable request.

Disclosure

The funding body had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The authors thank Hui Liu, Ph.D. for integrating and modifying the radiomics software. Also, the authors thank LetPub (http://www.letpub.com) for its linguistic assistance during the preparation of this manuscript. This study was funded by the Beijing Municipal Administration of Hospitals Clinical Medicine Development of Special Funding Support (no. ZYLX201803), Beijing million Talents Project (no. 2017A13), and National Key Research and Development Plan (2017YFC1309101 and 2017YFC1309104).