Abstract
Objective: Improving the prognostic stratification of unresectable hepatocellular carcinoma (HCC) patients is critically needed. Since patients’ survival is closely linked to the severity of the underlying liver disease, and insulin-like growth factor-1 (IGF-1) is produced predominantly in the liver, we hypothesized that IGF-1 may correlate with patients’ survival and hence improve the prognostic ability of the Cancer of the Liver Italian Program (CLIP) score. Methods: Baseline plasma IGF-1 and clinicopathologic parameters were available from 288 patients. Multivariate Cox regression models, Kaplan-Meier curves, and the log-rank test were applied. Recursive partitioning was used to determine the optimal cut point for IGF-1 using training/validation samples. Prognostic ability of the I-CLIP (I = IGF) was compared to CLIP using C-index. Results: IGF-1 significantly correlated with the clinicopathologic features. With an optimal IGF-1 cut point of 26 ng/ml, the overall survival of patients with IGF-1 >26 was 17.7 months (95% CI 13.6–22.8), and with IGF-1 ≤26 was 5.8 months (95% CI 4.0–12.5), p < 0.0001. The concordance probabilities for CLIP and I-CLIP were 0.7037 and 0.7096, respectively (p < 0.0001). Conclusions: Our preliminary results indicate that I-CLIP significantly improved prognostic stratification of patients with advanced HCC. However, independent validation of our study is warranted.
Introduction
Hepatocellular carcinoma (HCC) develops as a consequence of underlying chronic liver disease (CLD), most commonly cirrhosis due to viral hepatitis. Therefore, the majority of patients with HCC suffer from two co-existing competing causes of morbidity and mortality, the severity of their CLD and the HCC tumor stage. This syndrome of the two-disease state directly affects survival, which in turn affects patients’ prognostic stratification and treatment decisions in clinical trials and in clinical practice. Therefore, most of the HCC staging systems include a combination of variables related to the tumor stage and the status of the CLD. However, several challenges face the clinical utility of these staging systems. First, their performance is highly variable because they were developed from different patient populations, surgical and nonsurgical, and they depend on many factors including patient demographics. Secondly, they lack the integration of new biologic biomarkers that correlate with the tumor parameters and with the severity of the underlying CLD. In this context, non-invasive means of obtaining those markers is essential, given the challenges in obtaining tissue samples in patients with cirrhosis and coagulation disorders that put HCC patients at risk for complications following needle biopsies.
The Cancer of the Liver Italian Program (CLIP) score [1] and the Barcelona Clinic Liver Cancer (BCLC) staging system [2] are among the most commonly used HCC prognostic systems to stratify patients on clinical trials in the Western world. The BCLC staging system [2] is endorsed by the American Association for the Study of Liver Diseases (AASLD) and the European Association for the Study of Liver Diseases (EASL) clinical practice guidelines [3,4]. Notably, our most recent publication compared the prognostic accuracy of the CLIP score to that of the BCLC staging in our patient population using the C-index, and found that the concordance probabilities for BCLC and CLIP were 0.65 and 0.70, respectively [5]. Using U-statistics, the difference was significant and the p value was 0.007. Thus, these results indicate that the CLIP scoring system better predicted survival than BCLC staging in our patient population.
The CLIP score is a prognostic system that assigns points for the Child-Pugh score, tumor morphology (solitary, ≤50% of the liver, massive), serum α-fetoprotein (AFP), and presence or absence of portal vein thrombus [1]. Notably, many studies validated the use of CLIP in HCC patients [6,7,8,9,10,11,12,13,14,15]. Furthermore, recent studies, including one of our patient population [5], evaluated the predictive accuracy of different HCC staging systems and showed that the CLIP score was more predictive than BCLC in the studied patient populations, which included a significant number of patients with advanced HCC [5,16,17]. However, the two systems are conceptually different and cannot be directly compared. The BCLC staging system was originally designed as a treatment allocation algorithm and subsequently evolved as one of the standard systems to stage HCC patients, whereas the CLIP score is a prognostic system that was designed to stratify patients according to their expected survival. In addition, several studies indicated that about 80% of the CLIP patient population is classified as a CLIP score of 0–3 [5,9,10]; therefore, there is an unmet need to improve the accuracy of CLIP score stratification.
Insulin-like growth factor-1 (IGF-1) is a hormone that is predominantly synthesized in the liver [18]. Therefore, low circulating IGF-1 levels have been found to be associated with CLD, such as steatosis, chronic hepatitis C, cirrhosis, non-alcoholic steatohepatitis, and HCC [19,20,21,22,23,24,25]. Collectively, these data suggest that circulating levels of IGF-1 may reflect the synthetic function of the liver and correlate with patients’ advanced CLD status, which directly affects the survival of patients with HCC independent of their tumor stage.
However, evaluating the role of IGF-1 in predicting survival of patients with HCC and in refining their prognostic stratification has not been studied yet. Therefore, we hypothesized that baseline plasma levels of IGF-1 will correlate with the clinicopathologic features and survival of patients with HCC and hence may improve the predictive ability of the CLIP score for HCC.
Patients and Methods
Study Population
After obtaining MD Anderson’s Institutional Review Board approval and patients’ informed consent, we enrolled new patients with pathologically proven HCC. We excluded patients with concurrent presence or history of other malignancies, including other types of primary liver cancer (such as bile duct cancers or fibrolamellar HCC). Patients’ medical records were reviewed for baseline clinical, laboratory, pathologic, staging, and radiologic features. We recruited 394 eligible patients; baseline plasma samples were available for 288 only who returned for blood samples collection. There were no significant differences between our study population and patients who were missed, either in their age, race, and gender, hepatitis status, diabetes history, alcohol consumption, smoking, Child-Pugh score, cirrhosis; pathological cellular differentiation; serum albumin, or CLIP scoring. However, patients without blood samples had a tendency to have multinodular tumor, higher AFP levels, portal vein thrombosis, and higher incidence of tumor size involving <50% of the liver. Overall survival time was calculated to reflect the time between diagnosis and last follow-up visit or death.
Baseline Plasma IGF-1
Baseline venous blood samples (3–5 ml of the whole blood) were collected, anticoagulated by ethylenediaminetetraacetic acid (EDTA) and centrifuged at 4°C and 3,000 r.p.m. for 15 min. The plasma samples were then removed, aliquoted, and snap frozen at –20°C. IGF-1 was tested using ELISA (Quantikine Human IGF-1 Immunoassay ELISA Kit; R&D Systems, Minneapolis, Minn., USA). The plasma biomarker level was determined from a standard curve generated for each of the sample sets assayed after duplicate measurements were made.
Statistical Analysis
We used Cox regression to assess factors associated with overall survival, and Wilcoxon rank sum test to study the correlation between baseline IGF-1 and various clinical characteristics and staging systems. To identify an optimal IGF-1 cut point, recursive partitioning methodology [26] was used to split the data randomly into training (2/3) and validation (test) (1/3) sets. We used the training set to find the optimal cut point maximizing the survival difference between the low and high IGF-1 groups, and then validated the cut point by fitting a Cox regression model to the dichotomized IGF-1 value on the test data. We repeated this methodology using 20 different random splits. Next, we fitted the multivariable Cox regression models including IGF-1, dichotomized at the optimal cut point, and the variables within the CLIP system, to evaluate whether IGF-1 was an independent prognostic factor after adjusting for the CLIP variables. Finally, the median survival was computed in each I-CLIP (I = IGF) group (0, 1, 2, 3, 4, and 5+) and the groups were compared using log-rank tests, to assess the performance of the I-CLIP in comparison to the CLIP. The sign test was used to assess whether the IGF-1-low groups tended to have shorter median survival within the CLIP and I-CLIP groups than IGF-1-high groups. The prognostic ability of the CLIP and I-CLIP was compared using a C-index test, with secondary analyses done to compute a partial C-index analysis, which computed the C-index using only patients whose class status changed between CLIP and I-CLIP.
Results
Patient Characteristics
For the entire cohort of 288 patients, the median overall survival was 13.8 months, (95% CI 11.7–17.3). Baseline patients’ characteristics are shown in table 1. The Cox regression models indicated that vascular invasion, tumor differentiation and nodularity, high serum level of AFP, bilirubin, alanine transaminase (ALT), aspartate transaminase (AST), presence of cirrhosis, a Child-Pugh score of B or C, and extrahepatic metastasis were all significant predictors of shorter overall survival.
Prognostic Factors of Survival in Our Patient Population
Most recently, we reported that the discriminative ability of the CLIP score was better than that of the BCLC staging system in our patient population [5]. In that study, we applied a multivariable Cox regression model to our data including the factors contained in the CLIP score to validate the CLIP scoring system in our patient population. Only 285 patients were included because AFP values were missing for 3 patients. The study hazard ratios (HRs) were very close to the HRs reported in the original CLIP paper [1]. The CLIP score stratified our patients very effectively into different prognostic categories (p < 0.0001).
Baseline Plasma Levels of IGF-1 as an Independent Prognostic Factor
Table 2 describes the correlations between plasma IGF-1 level and patient characteristics by the Wilcoxon rank-sum test. Univariate Cox regression analyses showed that baseline plasma IGF-1 level was most significantly associated with Child-Pugh score, bilirubin, AST levels, tumor size and nodularity, and vascular invasion; however, the strongest association was with AST level (p < 0.0001; table 3).
Identifying the Optimal Cut Points of Plasma IGF-1 in Our Patient Population
We applied the recursive partitioning test to randomly selected training/test sets to find the optimal single cut point for baseline IGF-1 in terms of predicting survival. We found that 8 of the 20 sets found roughly the same optimal cut points of 26 ng/ml. Our results suggest that patients with low IGF-1 (<26 ng/ml) had shorter median overall survival. When ‘low IGF-1’ was considered in a univariate Cox regression model fit to the entire data set, this effect was highly significant (p < 0.0001; HR 2.06; 95% CI 1.50–2.81). The median overall survival time estimate for patients with IGF-1 >26 was 17.7 months (95% CI 13.6–22.8), and for IGF-1 ≤26 was 5.8 months (95% CI 4.0–12.5; p < 0.0001; see Kaplan-Meier estimates; fig. 1).
Constructing the I-CLIP Scoring System
Our current multivariate analysis, using the dichotomized value of IGF-1, indicated that IGF-1 was a significant independent prognostic factor for overall survival, even after adjusting for CLIP parameters (p = 0.01; HR 1.53; 95% CI 1.09–2.13; table 4). We also observed that IGF-1 incorporation in the CLIP score did not alter the HRs for the CLIP parameters.
Baseline Plasma IGF-1 Separates High- and Low-Risk Groups within Each CLIP Score
Patients within each CLIP score group were split according to whether they had low/high IGF-1. Within each CLIP or I-CLIP score, the estimated median overall survival for IGF-1-high patients was higher than the median overall survival for IGF-1-low patients. We observed strong trends that were not statistically significant in the scoring system groups (table 5). However, this was not surprising, given the extremely low power for detecting such differences given the small number of IGF-1-low subjects (≤15) within all groups. However, our overall sign test that compared low and high IGF-1 groups and assessed the prognostic information of IGF-1 was significant (p = 0.031). Therefore, our preliminary results indicated that the estimated median overall survival was greater in IGF-1-high patients within all groups, which contain independent patients, indicating that those patients within all scores tended to have better prognosis than IGF-1-low patients.
Our preliminary scoring system, named I-CLIP, integrates a dichotomized IGF-1 level into the CLIP score variables; score between 0 and 7 for each patient (table 6). Specifically, the I-CLIP scoring system was computed in the following way: CLIP score + IGF-1 score (1 if IGF ≤26, and 0 if IGF >26). Based on the C-index analysis that compared the prognostic ability of the two systems, the concordance probabilities for CLIP and I-CLIP were 0.7037 and 0.7096, respectively. U-statistics indicated that the differences in C-index were statistically significant (p < 0.0001). Computing a ‘partial’ C-index using only subjects whose group membership changed between CLIP and I-CLIP (0.547 for CLIP, 0.617 for I-CLIP), we observed that the IGF-1-low subjects were better classified when moved into the next higher risk group by I-CLIP. The calculated C-index for the V-CLIP (V = VEGF) score was 0.7041, and the U-statistics p value for I-CLIP versus V-CLIP was 0.056. Figure 2 contains the Kaplan-Meier plots for the separate CLIP groups, and figure 3 contains the Kaplan-Meier plots for the separate I-CLIP groups. CLIP 4 and 5 were joined as one group (4+), and I-CLIP 4, 5 and 6 were joined as one group (4+).
Constructing an IV-CLIP Scoring System
Our current multivariate analysis, using the dichotomized values of both biomarkers, indicated that IGF-1 and the VEGF were significant independent prognostic factors for overall survival, even after adjusting for CLIP parameters (for IGF-1: p = 0.01, HR 1.53, 95% CI 1.09–2.13, and for VEGF: p = 0.01, HR 1.57, 95% CI 1.12–2.22; table 7). We also observed that IGF-1 and VEGF incorporation into the CLIP score did not alter the HRs for the CLIP factors. However, the C-index analysis for IV-CLIP was 0.7092, almost identical to that of the I-CLIP (0.7096).
Discussion
One of the major challenges in HCC management is prediction of survival and prognosis, given the marked heterogeneity noted in patients with HCC because of the two-disease state of CLD and HCC that independently affects their survival and prognosis. Our most recent study [12] introduced the V-CLIP score, integrating baseline plasma VEGF, as a marker that was shown in several studies to correlate with the tumor staging and invasiveness [27], into the CLIP score, and it showed significant improvement in prediction of survival. However, since accurate assessment of the status of the CLD is also essential to determine both short- and long-term prognosis and to make therapy decisions in patients with HCC, there is an unmet need for non-invasive liver reserve biomarkers to assess the severity of the CLD. Our current study aimed at evaluating the utility of plasma IGF-1 in this setting since it is synthesized mainly by the liver and significantly decreases in CLD and cirrhosis [19,20,21,22]. Therefore, our current score, I-CLIP, takes into account assessing a baseline plasma biomarker that is shown in our patient population to correlate with the CLD condition, using non-invasive molecular assays. Our combined clinical and laboratory index using plasma IGF-1 and parameters of the CLIP score is advantageous since it is easily calculated and could be replicated independently. Notably, the CLIP score validation in our patient population is consistent with prior studies of Western HCC patient populations. Notably, a substantial number of our patients had advanced BCLC stages B–D (239 patients, 82.9%). Therefore, the superiority of CLIP over BCLC in our advanced (or unresectable) HCC patient population, the standard clinical trials population, is also consistent with prior studies of patients with advanced HCC, since BCLC staging of this patient population is challenged. However, evaluating the incorporation of IGF-1 and other biologically pertinent biomarkers into BCLC staging of more diverse patient populations, in addition to evaluating other commonly used HCC staging systems, may prove useful in the assessment of HCC prognosis and hence in guiding the treatment decisions and patients’ stratification in clinical trials. While none of these staging systems is yet approaching routine clinical use, rapid growth in plasma biomarkers assay technology may provide new avenues for non-invasive ways of assessing the severity of CLD and the biology of HCC tumors, and may prove beneficial after incorporation into the HCC staging systems to refine patients’ stratification.
Our study has some limitations. First, our study cutoff point was based on the recursive portioning method that identified the best cutoff point which correlated with the survival of our patient population; therefore, the IGF-1 cutoff point may differ in other patient populations. However, choosing an optimal cutoff point for plasma biomarkers in cancer research remains challenging in general, given the possible daily variations in the circulating levels and the potential effects of patients’ genetics, nutritional status, gender, and age. Nevertheless, our results clearly indicate that the IGF-1 cutoff point we identified was independent of and complementary to our patients’ clinicopathologic prognostic factors. Moreover, additional independent studies in other patient populations are warranted, not only to confirm our concept, but also to possibly help to identify an optimal range to be used in this setting. Additionally, we utilized the commercially available ELISA kits to measure plasma IGF-1 and VEGF. Although the methodology is reproducible in expert research facilities, the measurement should be done in a Clinical Laboratory Improvement Amendments (CLIA)-certified laboratory before using it in clinic to guide therapy decisions. Furthermore, while 394 HCC patients signed consent forms to participate in the study, baseline plasma samples were available for 288 patients only. The main reason for missing blood samples collection was mainly related to insufficient time to obtain blood samples during the initial assessment in clinic. However, our analysis indicated that there were no major differences between both groups as aforementioned. Finally, our findings are based on a single institution experience and will need to be validated prospectively in other centers and patient populations. Furthermore, measuring IGF-1 carrier binding protein, IGFBP-3 and serum protease activity, the main determinants of the circulating level of IGF-1, may shed more light on the IGF-1 pathway in CLD and HCC, and will also confirm the utility of plasma IGF-1 in assessing the severity of the liver condition in CLD and HCC. Interestingly, recent studies of circulating IGF-1 in cancer patients [28,29,30,31] found that high IGF-1 level was associated with a higher risk of cancer. However, the goal of our current study was to evaluate the level of IGF-1, as a marker of the underlying synthetic function of the liver. Finally, the majority of our patient population had advanced unresectable disease. However, this population represents the classic patient population of the HCC clinical trials, who critically need new approaches to improve their prognostic stratification, which is crucial in comparing and interpreting trial results and also in decision making in clinical practice.
In conclusion, searching for new tools of assessment of the hepatic reserve in patients with HCC is essential to improving their prognostic stratification. This approach will directly affect therapy decisions, treatment outcome, and refine advanced HCC patients’ stratification in clinical trials. Our study results indicate that lower baseline plasma level of IGF-1 correlated with advanced clinicopathologic parameters of patients with HCC. The integration of IGF-1 into the CLIP score (I-CLIP) significantly improved prognostic stratification of patients with advanced HCC. However, independent validation of our study is warranted.
Acknowledgements
The authors of the article would like to thank Dr. Karen Muller for editing the manuscript and Dr. John Heymach for helpful comments on the study. The study was supported by National Institutes of Health (NIH) RO3 grant ES11481, CA106458-01 (to M.H.), and by philanthropic funds to the Department of Gastrointestinal Medical Oncology.
Disclosure Statement
All authors have no commercial associations (e.g. consultancies, stock ownership, equity interests, patent-licensing arrangements, etc.) that might pose a conflict of interest in connection with the submitted article.