Abstract
Purpose
The objectives of this study were to investigate the predictive value of tumour measurements on 2-deoxy-2-[(18)F]fluoro-D-glucose ((18)F-FDG) positron emission tomography (PET) pretreatment scan regarding therapy response in oesophageal cancer and to evaluate the impact of tumour delineation strategies.Methods
Fifty patients with oesophageal cancer treated with concomitant radiochemotherapy between 2004 and 2008 were retrospectively considered and classified as complete, partial or non-responders (including stable and progressive disease) according to Response Evaluation Criteria in Solid Tumors (RECIST). The classification of partial and complete responders was confirmed by biopsy. Tumours were delineated on the (18)F-FDG pretreatment scan using an adaptive threshold and the automatic fuzzy locally adaptive Bayesian (FLAB) methodologies. Several parameters were then extracted: maximum and peak standardized uptake value (SUV), tumour longitudinal length (TL) and volume (TV), SUV(mean), and total lesion glycolysis (TLG = TV × SUV(mean)). The correlation between each parameter and response was investigated using Kruskal-Wallis tests, and receiver-operating characteristic methodology was used to assess performance of the parameters to differentiate patients.Results
Whereas commonly used parameters such as SUV measurements were not significant predictive factors of the response, parameters related to tumour functional spatial extent (TL, TV, TLG) allowed significant differentiation of all three groups of patients, independently of the delineation strategy, and could identify complete and non-responders with sensitivity above 75% and specificity above 85%. A systematic although not statistically significant trend was observed regarding the hierarchy of the delineation methodologies and the parameters considered, with slightly higher predictive value obtained with FLAB over adaptive thresholding, and TLG over TV and TL.Conclusion
TLG is a promising predictive factor of concomitant radiochemotherapy response with statistically higher predictive value than SUV measurements in advanced oesophageal cancer.Free full text
Baseline 18F-FDG PET image-derived parameters for therapy response prediction in oesophageal cancer
Abstract
Background
The objectives of this study were to investigate the predictive value of tumour measurements on 18F-FDG PET pretreatment scan regarding therapy response in oesophageal cancer and to evaluate the impact of tumour delineation strategies.
Methods
50 patients with oesophageal cancer treated with concomitant radio-chemotherapy between 2004 and 2008 were retrospectively considered and classified as complete, partial or non responders (including stable and progressive disease) according to RECIST. The classification of partial and complete responders was confirmed by biopsy. Tumours were delineated on the 18F-FDG pretreatment scan using an adaptive threshold and the automatic Fuzzy Locally Adaptive Bayesian (FLAB) methodologies. Several parameters were then extracted: maximum and peak SUV, tumour longitudinal length (TL) and volume (TV), mean SUV, and Total Lesion Glycolysis (TLG=TV×mean SUV). The correlation between each parameter and response was investigated using Kruskal-Wallis tests and receiver operating characteristic methodology was used to assess performance of the parameters to differentiate patients.
Results
Whereas commonly-used parameters such as SUV measurements were not significant predictive factors of the response, parameters related to tumour functional spatial extent (TL, TV, TLG) allowed significant differentiation of all three groups of patients, independently of the delineation strategy, and could identify complete and non responders with sensitivity above 75% and specificity above 85%. A systematic although not statistically significant trend was observed regarding the hierarchy of the delineation methodologies and the considered parameters, with slightly higher predictive value obtained with FLAB over adaptive thresholding, and TLG over TV and TL.
Conclusions
TLG is a promising predictive factor of concomitant radio-chemotherapy response with statistically higher predictive value than SUV measurements in advanced oesophageal cancer.
1. Introduction
Oesophageal cancer is the third most common malignancy of the digestive tract and a leading cause of cancer mortality worldwide with an estimated 5-year survival of 15% [1]. Despite the progress made to better understand this disease, its incidence is steadily increasing and there is a growing concern regarding its effective management [2]. The best chance for cure remains surgical resection. However, many patients have already an advanced disease (locally advanced oesophageal carcinoma: LAEC) at diagnosis and may benefit in terms of survival from neoadjuvant therapy prior to surgery [3]. The maximum benefit is for those patients who achieve a complete pathological response with no residual cancer cells in the primary tumour or lymph nodes [4]. A complete response occurs only in 15–30% of cases and is associated with an increased overall survival [5]. On the other hand, patients who do not respond to therapy may be unnecessarily affected by toxicity of an inefficient therapy [6]. Therefore, the development of a diagnostic test offering non invasive response to therapy prediction early in the course of treatment is of a great interest, allowing potential personalization of patient management as for un-operable tumours, chemotherapy and/or radiation therapy remains the only option. Such an assessment becomes more critical when one considers new targeted drugs that could be tested with higher efficiency if applied early [7]. For oesophageal cancer several histological markers such as the tumour suppressor factor gene p53, the proliferative marker Ki67, and the epidermal growth factor receptor, have been evaluated for the prediction of the therapeutic response prior to neoadjuvant therapy. None of these markers or a combination of them can currently predict response with sufficient accuracy [8–9]. Positron Emission Tomography (PET) imaging with 2-(18F)fluoro-2-deoxy-D-glucose (18F-FDG) allows the visualization of the enhanced glucose metabolism in viable oesophageal cancer cells and may be of interest within this context. 18F-FDG PET is already well established for staging of oesophageal cancer with a better sensitivity and specificity than the combined use of CT and endoscopic ultrasonography (EUS) to detect distant metastases [10]. PET has also been shown to be promising in assessing response to therapy [11]. Several studies have shown that the reduction of the tumour’s metabolic activity as measured by the standard uptake value (SUV) from the baseline to the end of therapy uptake is predictive of a better outcome with however a large variability in the sensitivity and specificity [12]. In addition, a correlation between clinical outcome and a metabolic response observed as early as within the first 2 weeks of treatment has been demonstrated [13]. These findings suggest that tumour activity concentration differences measured on serial 18F-FDG PET scans could possibly be used to individualize treatment. However, it could be more cost-effective and beneficial to the patient to be able to predict therapy response from a single baseline PET scan acquired before the initiation of the treatment. The current study was therefore carried out to investigate the potential value of baseline 18F-FDG PET image derived parameters for the prediction of response to combined radio-chemotherapy in oesophageal cancer. A secondary objective was to investigate the potential influence of the method used to delineate the tumour on the prediction results.
2. Material and methods
2.1. Patients
50 consecutive patients with newly diagnosed oesophageal cancer treated with exclusive concomitant radio-chemotherapy between 2004 and 2008 were included in this study. As part of the routine procedure for the initial staging in oesophageal cancer, each patient was referred for an 18F-FDG PET study before treatment. It included three courses of 5-fluorouracil/cisplatin and a median radiation dose of 60Gy given in 180cGy daily fractions delivered once daily, 5 days a week for 6–7 weeks. The characteristics of the patients are given in table 1. Most of them (45 out of 50) were male, aged 65±9 years at the time of diagnosis. 74% of the tumours, most of which were squamous cell carcinoma (72%), originated from the middle and lower oesophagus. Response to therapy was evaluated 1 month after the completion of the concomitant radio-chemotherapy using conventional thoraco-abdominal CT and endoscopy. Patients were classified as non responders (NR) including stable and progressive disease, partial responders (PR) or complete responders (CR). Response evaluation was based on CT evolution between pre-treatment and post-treatment scans using RECIST (Response Evaluation Criteria in Solid Tumours) [14]. Patients also underwent fibroscopy in case of partial or complete response. Complete response was confirmed by the absence of visible disease in the high endoscopy and no viable tumor on biopsy. Partial CT response was confirmed by macroscopic residual (>10% viable) on biopsy. No discordance was observed between pathological, when available, and CT evaluation.
Table I
Parameter | Number of patients (%) |
---|---|
Gender | |
Male | 45(90) |
Female | 5(10) |
Age | |
Range | 45–84 |
Median | 69 |
Site | |
Upper esophagus | 13(26) |
Middle esophagus | 20(40) |
Lower esophagus | 17(34) |
Histology type | |
Adenocarcinoma | 14(28) |
Squamous cell carcinoma | 36(72) |
Histology differentiation | |
Well differentiated | 14(28) |
Moderately differentiated | 12(24) |
Poorly differentiated | 5(10) |
Unknown | 19(38) |
TNM Stage | |
T1 | 7(14) |
T2 | 8(16) |
T3 | 24(48) |
T4 | 11(22) |
N0 | 20(40) |
N1 | 30(60) |
M0 | 34(68) |
M1 | 16(32) |
AJCC Stage | |
I | 4(8) |
IIA | 8(16) |
IIB | 6(12) |
III | 16(32) |
IVA | 16(32) |
The current analysis was carried out after an approval by the institutional ethics review board.
2.2. 18F-FDG PET acquisitions
All 18F-FDG PET studies were carried out prior to the initiation of treatment. Patients were instructed to fast for at least 6h before the 18F-FDG administration (5MBq/kg). Static emission images were acquired from head to thigh (2min per bed position) beginning 60min after injection on a Philips GEMINI PET/CT system (Philips Medical Systems, Cleveland, OH USA). Images were reconstructed using the RAMLA 3D algorithm and CT based attenuation correction. Optimized reconstruction parameters were used for the RAMLA 3D based on the standard optimized clinical protocol (2 iterations, relaxation parameter of 0.05, 5mm 3D Gaussian post-filtering, 4x4x4mm3 voxels grid sampling). The PET images were corrected for attenuation using CT based attenuation correction.
2.3. PET image analysis
All considered parameters were extracted from the baseline PET images only. For each patient, the primary tumour was identified on the baseline pre-treatment PET images by a nuclear physician. Three different SUV measurements and three parameters related to the tumour functional dimensions, namely the tumor volume (TV), tumour longitudinal length (TL) and total lesion glycolysis (TLG) [15] were extracted for each primary lesion. SUV measurements considered were SUVmax, SUVpeak defined as the mean of SUVmax and its 26 neighbors (roughly similar to a 1cm ROI), and mean SUV within the delineated tumour (SUVmean). Whereas SUVmax and SUVpeak are clearly independent of the tumour delineation strategy used, TL, TV, SUVmean and the derived TLG values might depend on the delineation process. To study the impact of this step, we considered two different approaches; namely the automatic Fuzzy Locally Adaptive Bayesian (FLAB) algorithm [16–17] and an adaptive threshold algorithm [18] optimized for the GEMINI PET/CT scanner. Although the first approach is fully automatic, adaptive thresholding requires a manually defined background region of interest (ROI). Therefore two experienced nuclear medicine physicians were considered in the background ROI definition, leading to two series of results denoted as TA1 and TA2. TL was determined in longitudinal direction by multiplying the number of slices in the delineated tumour volume by the PET image slice thickness (4mm). TV was defined as the sum of all voxels contained in the delineated volumes multiplied by the image voxel’s volume (64mm3). Finally, TLG was determined by multiplying the SUVmean and associated TV.
2.4. Statistical analysis
The relation between response to therapy and each parameter distribution was studied using the Kruskal-Wallis test [19] as recommended for small, not normally distributed samples.
Receiver operating characteristic (ROC) methodology [20] was used to assess the performance of each parameter to differentiate patients. Two classification tasks were considered: differentiating CR patients from PR and NR, or NR patients from CR and PR. Evaluation was performed in terms of the area under the curve (AUC) as well as specificity and sensitivity.
The significance of the following factors was tested: age, gender, T, N, and M classifications, AJCC (American Joint Committee on Cancer) stage, histology types, SUVmax, SUVpeak, TL, TV, SUVmean, and TLG. All tests were two-sided and p values <0.05 were considered statistically significant.
3. Results
The range of values for the different image derived indices as well as the mean and standard deviation for the patient population considered are given in table II. All primary lesions were detected by 18F-FDG PET exhibiting a rather high uptake with a SUVmax of 9.7±3.9. SUVpeak and SUVmean measurements were comparatively lower (8.0±3.3 and 5.8±2.4 respectively).
Table II
Definition | Notation | Mean ± SD | ||
---|---|---|---|---|
Highest SUV | SUVmax | 9.7±3.9 | ||
Mean of SUVmax and its 26 neighbors | SUVpeak | 8.0±3.3 | ||
Mean SUV within tumor defined by | Adaptive threshold | User 1 | SUVmean(TA1) | 6.4±2.5 |
User 2 | SUVmean(TA2) | 6.0±2.6 | ||
FLAB | SUVmean(FLAB) | 5.5±2.3 | ||
Tumor length (cm) defined by | Adaptive threshold | User 1 | TL(TA1) | 5.8±2.9 |
User 2 | TL(TA2) | 5.5±2.8 | ||
FLAB | TL(FLAB) | 6.0±2.8 | ||
Tumor volume (cm3) defined by | Adaptive threshold | User 1 | TV(TA1) | 27.2±25.6 |
User 2 | TV(TA2) | 34.8±30.7 | ||
FLAB | TV(FLAB) | 39.4±34.9 | ||
SUVmean(TA1)×TV(TA1)(g) | TLG(TA1) | 175.6±178.9 | ||
SUVmean(TA2) ×TV(TA2)(g) | TLG(TA2) | 206.9±203.4 | ||
SUVmean(FLAB) ×TV(FLAB)(g) | TLG(FLAB) | 207.3±192.0 |
Correlation between image derived indices and between methodologies
TV and TL measurements were moderately correlated (r=0.77, 0.68 and 0.60 for FLAB, TA1 and TA2 respectively, p<0.0001). On the other hand, no significant correlation was found between TV and any of the SUV measurements (r<0.2, p>0.1), irrespective of the delineation approach used. High correlations were observed between the TV (r>0.89), TL (r>0.90) or TLG (r>0.93) measurements obtained with the two delineation strategies (p<0.0001). Even higher correlation coefficients (r>0.97, p<0.0001) were observed for the SUVmean measurements derived using the two different tumour segmentation approaches (FLAB and adaptive thresholding). Despite these correlations, certain large differences were observed for few patients between the delineation results of the two segmentation algorithms considered, examples of which are illustrated in figure 1.
Response to therapy analysis
Out of the 50 patients included in the study 25 were classified as PR, while there were 12 CR and 13 NR. Results concerning the predictive value of all considered parameters are summarized in tables III and andIVIV containing the results of the Kruskal-Wallis tests and that of the ROC analysis (considering the AUC, specificity and sensitivity regarding the classification tasks) respectively.
Table III
Kruskal-Wallis tests | ||||||
---|---|---|---|---|---|---|
Parameter | Test statistic | P | Response differentiation? (p<0.05) | |||
CR(n=12) vs. NR(n=13) | CR(n=12) vs. PR(n=25) | PR(n=25) vs. NR(n=13) | ||||
Age | 0.4 | 0.83 | no | no | no | |
Gender | 4.0 | 0.14 | no | no | no | |
T | 4.9 | 0.09 | no | no | no | |
N | 2.7 | 0.26 | no | no | no | |
M | 3.6 | 0.17 | no | no | no | |
AJCC stage | 5.9 | 0.052 | yes | no | yes | |
Histology type | 2.3 | 0.32 | no | no | no | |
SUVmax | 2.5 | 0.29 | no | no | no | |
SUVpeak | 3.9 | 0.14 | no | no | no | |
SUVmean | TA1 | 3.3 | 0.197 | no | no | no |
TA2 | 3.2 | 0.199 | no | no | no | |
FLAB | 2.6 | 0.270 | no | no | no | |
TL | TA1 | 14.5 | 0.0007 | yes | yes | yes |
TA2 | 12.4 | 0.0020 | yes | yes | yes | |
FLAB | 15.6 | 0.0004 | yes | yes | yes | |
TV | TA1 | 13.9 | 0.0010 | yes | yes | yes |
TA2 | 12.9 | 0.0016 | yes | yes | yes | |
FLAB | 16.2 | 0.0003 | yes | yes | yes | |
TLG | TA1 | 14.6 | 0.0007 | yes | yes | yes |
TA2 | 14.6 | 0.0007 | yes | yes | yes | |
FLAB | 21.1 | <0.0001 | yes | yes | yes |
Table IV
Parameter | ROC analysis | ||||||||
---|---|---|---|---|---|---|---|---|---|
NR&PR(n=38) vs. CR(n=12) | NR(n=13) vs. PR&CR(n=37) | ||||||||
AUC | 95%CI | Se(%) | Sp(%) | AUC | 95%CI | Se(%) | Sp(%) | ||
Age | 0.51 | [0.32–0.70] | 83.3 | 31.6 | 0.55 | [0.35–0.75] | 86.5 | 36.5 | |
Gender | 0.61 | [0.46–0.75] | 27.3 | 94.3 | 0.51 | [0.41–0.62] | 90.9 | 11.4 | |
T | 0.70 | [0.47–0.93] | 60.0 | 89.3 | 0.64 | [0.49–0.78] | 100.0 | 33.3 | |
N | 0.64 | [0.46–0.83] | 60.0 | 68.8 | 0.55 | [0.38–0.73] | 70.0 | 40.6 | |
M | 0.56 | [0.38–0.73] | 70.0 | 41.2 | 0.70 | [0.53–0.87] | 70.0 | 70.6 | |
AJCC stage | 0.63 | [0.43–0.84] | 54.6 | 73.5 | 0.72 | [0.57–0.88] | 87.5 | 46.2 | |
Histology type | 0.51 | [0.35–0.66] | 72.7 | 28.2 | 0.60 | [0.46–0.75] | 42.9 | 77.8 | |
SUVmax | 0.65 | [0.45–0.85] | 33.3 | 94.7 | 0.54 | [0.34–0.73] | 30.8 | 89.2 | |
SUVpeak | 0.69 | [0.49–0.89] | 75.0 | 63.2 | 0.54 | [0.35–0.73] | 30.8 | 86.5 | |
SUVmean | TA1 | 0.67 | [0.47–0.87] | 50.0 | 84.2 | 0.54 | [0.35–0.74] | 89.2 | 38.0 |
TA2 | 0.67 | [0.45–0.88] | 50.0 | 94.7 | 0.51 | [0.32–0.70] | 100.0 | 16.2 | |
FLAB | 0.65 | [0.43–0.87] | 58.3 | 84.2 | 0.51 | [0.32–0.70] | 100.0 | 13.5 | |
TL | TA1 | 0.81 | [0.65–0.97] | 83.3 | 79.0 | 0.78 | [0.63–0.93] | 59.5 | 92.3 |
TA2 | 0.79 | [0.63–0.96] | 83.3 | 73.3 | 0.75 | [0.61–0.90] | 75.7 | 69.2 | |
FLAB | 0.79 | [0.64–0.94] | 83.3 | 65.8 | 0.82 | [0.70–0.94] | 59.5 | 92.3 | |
TV | TA1 | 0.79 | [0.65–0.89] | 75.0 | 81.6 | 0.79 | [0.65–0.93] | 78.4 | 69.2 |
TA2 | 0.74 | [0.59–0.85] | 83.3 | 57.9 | 0.81 | [0.67–0.95] | 94.6 | 53.9 | |
FLAB | 0.78 | [0.64–0.88] | 75.0 | 79.0 | 0.84 | [0.72–0.96] | 75.7 | 76.9 | |
TLG | TA1 | 0.81 | [0.62–1.00] | 66.7 | 92.1 | 0.78 | [0.65–0.92] | 92.3 | 56.8 |
TA2 | 0.80 | [0.61–0.99] | 75.0 | 86.8 | 0.80 | [0.67–0.93] | 69.2 | 81.1 | |
FLAB | 0.85 | [0.73–0.98] | 75.0 | 92.1 | 0.86 | [0.75–0.98] | 84.6 | 75.7 |
Age, gender, or T, N, M classifications did not allow significant prediction of the response to treatment. The AJCC stage was not significantly (p>0.05) associated with the type of response, despite the fact that all NR were at least stage IIB and could be statistically differentiated from both PR and CR (p<0.05). However, AJCC stage could not differentiate PR from CR (p>0.05). Finally, there was no statistical correlation between histology type and response (p=0.3).
Figure 2 shows a graphical comparison of the Kruskal-Wallis results considering the predictive value of the different SUV parameters considered. Initial SUVmax (fig 2.A) was not predictive of response to therapy (p=0.29) although CR tended to have smaller SUVmax (8.1±4.1) than PR and NR (10.2±3.7 and 10.2±3.9 respectively). Similarly, SUVpeak (fig 2.B) was not predictive of response to therapy with a mean value of 6.5±3.5 in CR, whereas both PR and NR were characterized by similar higher SUVpeak values (8.5±3.1 and 8.4±3.3 respectively) (p=0.14). None of the SUVmean measurements, whatever delineation strategy was used, could significantly predict response to therapy (p>0.19).
On the contrary, all parameters related to tumour spatial extent (TL, TV and TLG) measurements allowed significant (p<0.002) differentiation of the three response groups, irrespective of the segmentation methodology (see figure 3.A–C). For instance, TV as measured by FLAB was 20±25, 32±24 and 72±40 cm3 for CR, PR and NR patients respectively. The parameter that allowed the best differentiation between the three patient groups was TLG measured by FLAB (K-W test p<0.0001, see figure 3.C), with a TLG of 74±75g, 179±143g and 385±226g for CR, PR and NR patients respectively. Figure 4 shows examples of one CR, one PR and one NR patient with corresponding TLG values.
The ROC analysis results confirmed the limited predictive value of most SUV measurements for the accurate classification of either CR vs. PR and NR, or NR vs. PR and CR (AUC<0.70 and <0.56 respectively). Differences between ROC analysis associated with SUV measurements and those associated with TL, TV and TLG was significant (p<0.05) for both tasks (see examples in figure 5). Better predictive performances were obtained with TL, TV and TLG measurements with significantly higher AUC (from 0.74 to 0.86) for both tasks (p<0.05). For instance, using FLAB a TLG <58g allowed identifying complete responders with a sensitivity of 75% and a specificity of 92%, and a TLG >196g identified NRs with a sensitivity of 76% and a specificity of 85%. However in terms of predictive performance no significant differences were obtained between TL, TV and TLG measurements for both tasks. In terms of an observed trend, better results were obtained for TLG over TV and TL whatever tumour delineation approach was used (tables III and andIV).IV). In addition there was a systematic although not statistically significant trend of better performance for those parameters when obtained with FLAB compared to the use of the adaptive threshold, as demonstrated by higher AUC and smaller confidence intervals, as well as higher sensitivity and specificity for both classification tasks (table IV).
The analysis with respect to histology type (adenocarcinoma vs. squamous cell carcinoma) led to similar results with what was observed when considering the entire population. Within the same context no statistically significant differences were observed between the two patient groups in the hierarchy of parameters and results derived using the different functional tumour volume delineation methods.
The predictive value of TLG, combining TV and SUVmean into one single parameter, was higher than the one of tumour volume, despite the non-significant value of SUVmean alone. Considering together TV and SUVmean, one is able to differentiate different treatment response patient groups (see figure 6). On the one hand, TLG increased the differentiation between CR and NR, as all NR had either a TV above 50cm3 (8/13) or a SUVmean above 5 (8/13), while 10 out of 12 CR had either a small TV (<15cm3) (9/12) or SUVmean (<5) (7/12), and half of them (6/12) had both. On the other hand, PR had either a higher SUVmean than CR for volumes below 25cm3 (6.5±2.7 vs. 4.5±2.4), or lower SUVmean than NR for TV of 25–50cm3 (5.8±1.8 vs. 7.1±0.9). Therefore the use of TLG increased the differentiation between PR and CR, as well as between PR and NR for volumes below 15cm3 and between 25 and 50cm3 respectively.
4. Discussion
Assessment of response to therapy early during treatment plays an important role in patient management as well as in drug development and new criteria including PET have been suggested for this task [21–22]. However, being able to predict response to therapy before the initiation of the treatment would be even more powerful for patient management. In this context, either patient or tumour characteristics could be considered. In our study we focused on functional imaging and different image derived parameters related to tumour uptake using PET. The results of our study demonstrate that tumour volume based parameters derived from baseline FDG PET images in oesophageal cancer are good predictors of response to therapy, with high TL, TV and TLG being associated with poor response to combined radio-chemotherapy. On the contrary, more commonly used parameters such as tumour SUVs were not predictors of response to therapy considering only the baseline FDG PET images. These results further demonstrate the value of tumour volume based PET image derived parameters, since we have previously demonstrated a superior prognostic value of baseline functional TL, TV and TLG over SUV measurements for overall survival on a similar group of oesophageal cancer patients [23].
FDG PET has been previously used for the prediction of response to therapy or prognosis in a variety of malignancies [24]. Considering the predictive value of baseline FDG uptake for therapy response in oesophageal cancer, only few data showing conflicting results are available [12]. Levine et al. and Rizk et al. reported a high initial SUVmax being associated with good response [25–26], whereas Makino et al. and Kato et al. found the opposite [27–28]. These conflicting results can be potentially attributed to differences in patient populations, tumour histology types, as well as treatment, but could also suggest that SUV measurements are unreliable in this context. Although similarly to the results of Kato et al. and Makino et al., our results suggest that lower values of SUVmax are associated with a complete response, this trend was not significant. In addition, SUVmean or SUVpeak, considered more robust to potential noise bias associated with SUVmax, were also not significant predictors of response to therapy in our study.
One of the demonstrated independent predictors of long term survival in oesophageal cancer is longitudinal tumour extension established by pathological examination [29]. It has been previously demonstrated that TL measured on CT images leads to a weak correlation with the pathological TL, associated with a large overestimation [30]. Some authors proposed the estimation of metabolic TL as a surrogate of pathological TL using various thresholds of 18F-FDG PET uptake [31] however conflicting results concerning the predictive value of metabolic TL for response to neoadjuvant radio-chemotherapy have been observed [32–33]. One may argue that TL does not reflect the entire volume of the tumour and could therefore be only considered as a limited surrogate measure of tumour functional spatial extent. This assumption is partly supported by our data, in which only a moderate correlation (r between 0.6 and 0.77) was found between TV and TL, suggesting that TV may bring additional information compared to TL in assessing overall tumour burden. In our study both TV and TL were found to be significant predictive factors of response to therapy, irrespective of the functional volume delineation strategy, with only a small and non significant improvement of the predictive value of TV over TL.
TV and TLG measured on PET are 3D measurements incorporating metabolically active tumour volume not available from CT data [34]. It has already been demonstrated that a decrease of the TV and TLG can predict response to therapy [35–36]. These studies however have explored differences in indices derived from serial PET images. The value of such indices obtained on the baseline scan only within the context of therapy response prediction in oesophageal cancer has not previously been explored. Because these parameters reflect metabolic information in the entire tumour, they may be more accurate for tumour characterization than a single voxel measure and this may explain why TV and TLG were good predictors of therapy response as demonstrated in our study. Our results are consistent with recent studies in pleural mesothelioma and lymphoma patients that have demonstrated the potential of such indices extracted from baseline 18F-FDG PET scan to predict response to therapy [37–38].
Despite a great potential value, such indices have been only of limited use to date, which can be explained by the limited accuracy, robustness and reproducibility of the available tumour delineation tools [39–40]. In oesophageal cancer only the prognostic value of TV has been studied [23, 41], while there is limited data on the value of TLG [23]. In our study TLG allowed identifying complete responders and non responders with moderate sensitivity (75% and 76% respectively) and high specificity (92% and 85% respectively). Prospective studies with a larger patient population using a predictive model built upon our results should now be carried out to demonstrate the ability of the parameters to discriminate responders from non responders on a patient by patient basis.
In our study, TNM stage and AJCC classification were not good predictors of therapy response. This could be explained by our suboptimal staging procedure. Since we considered only patients referred for exclusive radio-chemotherapy, no patient underwent surgery, and therefore no pathological data was available. Staging was routinely performed using endoscopic ultrasonography and CT which are known to have limited staging performances [10].
Our present study has limitations. Firstly, we considered a group of only 50 patients with predominantly squamous cell carcinomas since it is the most common histological type of oesophageal cancer in European countries. An analysis based on the tumour histology type considering our patient population did not reveal statistically significant differences, although due to the small number of patients with adenocarcinomas, these results would obviously need to be confirmed. Secondly, our study was inherently limited by its retrospective design and as such some selection bias might be present. However, the treatment regime was homogeneous throughout the recruited patients since all were treated in a single institution. In addition, within this patient population no particular selection criteria were applied. Thirdly, the impact of partial volume effects in the measured SUVs was not assessed in this study. The lack of partial volume correction might have played a role in the reduced predictive value of some of the SUV measurements, although it is unlikely because of the large tumour volumes considered in this work (40±35cm3). Lastly, we did consider only primary tumours since the measurements used are simpler to perform in routine clinical practice compared to measurement of overall tumour burden including primary and metastatic lesions. However, given the respective size of metastatic lesions and primary tumours, adding metastatic lesions to the overall TLG would not significantly alter the resulting values and associated conclusions.
5. Conclusion
Our results demonstrated that 18F-FDG baseline image derived parameters related to the metabolic tumour spatial extent (TL, TV and TLG) are good predictors of response to therapy in oesophageal cancer with sensitivity above 75% and specificity above 85%. Commonly used SUV measurements (max, peak, mean) on pre-treatment FDG PET image did not allow statistically significant differentiation of the different response patient groups.
Acknowledgments
This work was partly funded by ANR (French National Research Agency) under the contract ANR-08-ETEC-005-01.
Footnotes
Conflict of interest statement:
The authors declare that they have no conflict of interest
References
Full text links
Read article at publisher's site: https://doi.org/10.1007/s00259-011-1834-9
Read article for free, from open access legal sources, via Unpaywall: https://europepmc.org/articles/pmc3375481
HAL Open Archive
http://www.hal.inserm.fr/inserm-00595534
Citations & impact
Impact metrics
Citations of article over time
Smart citations by scite.ai
Explore citation contexts and check if this article has been
supported or disputed.
https://scite.ai/reports/10.1007/s00259-011-1834-9
Article citations
Role of Textural Analysis of Pretreatment 18F Fluorodeoxyglucose Positron Emission Tomography/Computed Tomography in Response Prediction in Esophageal Carcinoma Patients.
Indian J Nucl Med, 38(3):255-263, 01 Jul 2023
Cited by: 0 articles | PMID: 38046976 | PMCID: PMC10693362
Clinical Utility of 18F-2-Fluoro-deoxy-d-glucose PET Imaging in Locally Advanced Esophageal/Gastroesophageal Junction Adenocarcinoma.
Diagnostics (Basel), 13(11):1884, 28 May 2023
Cited by: 0 articles | PMID: 37296735 | PMCID: PMC10252409
Review Free full text in Europe PMC
Performance of radiomics-based artificial intelligence systems in the diagnosis and prediction of treatment response and survival in esophageal cancer: a systematic review and meta-analysis of diagnostic accuracy.
Dis Esophagus, 36(6):doad034, 01 May 2023
Cited by: 1 article | PMID: 37236811 | PMCID: PMC10789236
Review Free full text in Europe PMC
Predictive value of clinical and 18F-FDG-PET/CT derived imaging parameters in patients undergoing neoadjuvant chemoradiation for esophageal squamous cell carcinoma.
Sci Rep, 12(1):7148, 03 May 2022
Cited by: 2 articles | PMID: 35504955 | PMCID: PMC9065158
Pretreatment 18F-FDG PET/CT-Derived Parameters in Predicting Clinical Outcomes of Locally Advanced Upper Third Esophageal Squamous Cell Carcinoma After Definitive Chemoradiation Therapy.
Nucl Med Mol Imaging, 56(4):181-187, 10 May 2022
Cited by: 0 articles | PMID: 35846416 | PMCID: PMC9276877
Go to all (43) article citations
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Prognostic value of 18F-FDG PET image-based parameters in oesophageal cancer and impact of tumour delineation methodology.
Eur J Nucl Med Mol Imaging, 38(7):1191-1202, 02 Mar 2011
Cited by: 86 articles | PMID: 21365252
The predictive value of treatment response using FDG PET performed on day 21 of chemoradiotherapy in patients with oesophageal squamous cell carcinoma. A prospective, multicentre study (RTEP3).
Eur J Nucl Med Mol Imaging, 40(9):1345-1355, 29 May 2013
Cited by: 30 articles | PMID: 23715903
Impact of partial-volume effect correction on the predictive and prognostic value of baseline 18F-FDG PET images in esophageal cancer.
J Nucl Med, 53(1):12-20, 01 Jan 2012
Cited by: 29 articles | PMID: 22213819 | PMCID: PMC3477069
Intratumor heterogeneity characterized by textural features on baseline 18F-FDG PET images predicts response to concomitant radiochemotherapy in esophageal cancer.
J Nucl Med, 52(3):369-378, 14 Feb 2011
Cited by: 421 articles | PMID: 21321270 | PMCID: PMC3789272
The application of functional imaging techniques to personalise chemoradiotherapy in upper gastrointestinal malignancies.
Clin Oncol (R Coll Radiol), 26(9):581-596, 04 Jul 2014
Cited by: 8 articles | PMID: 24998430 | PMCID: PMC4150923
Review Free full text in Europe PMC