Our study aimed to compare the predictive performance of different hepatocellular carcinoma (HCC) prediction models in chronic hepatitis B patients receiving entecavir or tenofovir, including discrimination, calibration, negative predictive value (NPV) in low-risk, and proportion of low-risk.
Methods
We conducted a systematic literature research in PubMed, EMbase, the Cochrane Library, and Web of Science before January 13, 2022. The predictive performance was assessed by area under receiver operating characteristic curve (AUROC), calibration index, negative predictive value, and the proportion in low-risk. Subgroup and meta-regression analyses of discrimination and calibration were conducted. Sensitivity analysis was conducted to validate the stability of the results.
Results
We identified ten prediction models in 23 studies. The pooled 3-, 5-, and 10-year AUROC varied from 0.72 to 0.84, 0.74 to 0.83, and 0.76 to 0.86, respectively. REAL-B, AASL-HCC, and HCC-RESCUE achieved the best discrimination. HCC-RESCUE, PAGE-B, and mPAGE-B overestimated HCC development, whereas mREACH-B, AASL-HCC, REAL-B, CAMD, CAGE-B, SAGE-B, and aMAP underestimated it. All models were able to identify people with a low risk of HCC accurately. HCC-RESCUE and aMAP recognized over half of the population as low-risk. Subgroup analysis and sensitivity analysis showed similar results.
Conclusion
Considering the predictive performance of all four aspects, we suggest that HCC-RESCUE was the best model to utilize in clinical practice, especially in primary care and low-income areas. To confirm our findings, further validation studies with the above four components were required.
Xiaolan Xu and Lushun Jiang contributed equally to this work.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Background
Chronic hepatitis B virus (HBV) infection is associated with life-threatening liver conditions like hepatocellular carcinoma (HCC) [1]. The early detection of HCC and stratified care of distinct risk populations were critical to minimize the harm of liver complications. Patients could be classified into different risk levels of developing HCC over time with HCC risk prediction models. There were models developed in untreated patients like REACH-B (Risk Estimation for HCC in Chronic Hepatitis B)and NGM-HCC (Nomograms for Risk of Hepatocellular Carcinoma) [2, 3], in mixed patients like CU-HCC (Chinese University-HCC), LSM-HCC (Liver Stiffness Measurement based-HCC), GAG-HCC (Guide with Age, Gender, HBV DNA, Core promoter mutations and Cirrhosis), and aMAP (the Age-Male-ALBI-Platelets Score), and models developed in treated patients [4‐6].
Given that antiviral therapy was commonly employed in the present society, models developed in treated patients may have a greater advantage in the accuracy of predictions. The mREACH-B (modified REACH-B), PAGE-B (Platelet, Age, Gender and HBV), mPAGE-B (modified PAGE-B), HCC-RESCUE (HCC-Risk Estimating Score in CHB patients Under Entecavir), CAMD (the Cirrhosis, Age, Male sex, and Diabetes Mellitus Score), AASL-HCC (Age, Albumin, Sex, Liver Cirrhosis-HCC Scoring System), CAGE-B (Cirrhosis and Age Score), SAGE-B (Stiffness and Age Score), and REAL-B (Real-world Effectiveness from the Asia Pacific Rim Liver Consortium for HBV) were initially developed in patients treated with different antiviral drugs [7‐15]. It was important to combine information from all derivation and external validation studies for the same model to assess the prediction performance across diverse populations. Furthermore, for some low-risk patients, needless HCC screening and surveillance may result in potential physical, financial, and psychological harms [16]. According to the guideline, HCC surveillance was cost-effective if the annual risk of HCC was ≥ 0.2% per year [17]. As a result, the fraction of low-risk population highlighted by models, as well as the ability to exclude individuals who are unlikely to develop HCC, should be considered.
Anzeige
Presently, entecavir and tenofovir were the first-line medications suggested in the guidelines for antiviral treatment [1, 18, 19]. Thus, our study systematically assessed the prediction performance of the above models in patients treated with entecavir and tenofovir in a meta-analysis.
Methods
The systematic review and meta-analysis was reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) and was registered on PROSPERO (ID: CRD42022303167).
Search strategy
We searched literatures published before January 13, 2022 in PubMed, EMbase, the Cochrane Library, and Web of Science. There were no limits on language or publication dates. Keywords of HCC, CHB, prediction models, et al. were used (Content 1 in Additional file 1). We also looked up references in relevant reviews and original publications to see if there were any studies we had overlooked.
Selection criteria
HCC prediction models built in treated CHB patients were selected in our study, including mREACH-B, PAGE-B, mPAGE-B, HCC-RESCUE, CAMD, AASL-HCC, CAGE-B, SAGE-B, and REAL-B. Even though aMAP was created in a mixed population, we included it in our study due to the large sample size in the derivation study. Both derivation studies and validation studies were retrieved. Exclusion criteria were as follows: (1) reviews or meta-analyses, (2) conference abstracts, (3) letters, editorials, and case reports, (4) full-text not in English, (5) update models without external validation, (6) validation cohort including untreated patients or patients treated with other oral antiviral medications, (7) insufficient data for analysis. The study focused on 3-, 5-, and 10-year HCC prediction performance. XX and LJ independently examined titles and abstracts, and studies that met the inclusion criteria were retrieved for full-text evaluation. Two independent investigators were also responsible for data extraction (XX and LJ). Any discrepancies were resolved by a third investigator (YZ).
Anzeige
Data extraction
The following data were extracted from these studies: author, publication year, study type, region, race, setting, recruitment period, sample size, follow-up duration, HCC cases, study interval, type of antiviral treatment received, baseline demographic and medical history (age, sex, proportion of cirrhosis, alcohol abuse, and diabetes), baseline laboratory results (hepatitis B e antigen [HBeAg] status, HBV DNA quantitative, alanine aminotransferase [ALT], platelets, albumin, total bilirubin, alpha-fetoprotein, liver stiffness measurement, and prediction score), area under the receiver operator characteristic curve (AUROC) with 95% confidence interval (CI), observed (O) events and expected (E) events, and negative predictive value (NPV) with 95% CI in the low-risk group. For external validation of different existing models or different cohorts, information was extracted separately.
According to Prediction model Risk of Bias Assessment Tool (PROBAST), which was organized into the following 4 domains: participants, predictors, outcome, and analysis, the quality of the original studies was evaluated.
Statistical analysis
Both derivation and external validation studies were included in meta-analysis. In the meta-analysis, effect measures included AUROC with 95% CI, total O:E ratio with standard error, NPV with 95% CI, and proportion in low-risk group with 95% CI. The predicted occurrences were estimated by multiplying the cumulative HCC incidence by the number of patients in each risk group. Variance of O:E ratio was calculated according to the equation recommended by Debray et al. [20]. When generating 95% CI for average performance, the random-effects model was used. I2 statistics were used to measure between-study heterogeneity. I2 statistic > 50% was regarded as moderate heterogeneity and I2 statistics > 75% was considered as severe heterogeneity. Subgroup analysis was conducted by cirrhotic status (cirrhotic/non-cirrhotic) and race (Caucasian/Asian) to explore the heterogeneity. Meta-regression analysis adjusted by the Hartung-Knapp method was conducted. Besides, sensitivity analysis by omitting anyone research was conducted to study the impact of individual studies on the average performance. Publication bias was analyzed by funnel plots for models that included ten or more studies [21]. All analyses were performed using Review Manager version 5.4 (Cochrane, London, United Kingdom) and StataMP software version 15 (StataCorp LLC, Texas, USA).
Results
We identified 23 publications for the systematic review and meta-analysis after screening 4374 studies from four databases, which included 153,445 CHB patients and 5133 HCC cases (Fig. 1). External validation was not performed in the original research for three model derivation investigations (mREACH-B, PAGE-B, CAGE-B, and SAGE-B). External validation studies or derivation and external validation studies of six models made up the remaining research (HCC-RESCUE, CAMD, mPAGE-B, AASL-HCC, REAL-B, and aMAP). PAGE-B and mPAGE-B were the most commonly externally validated in 19 and 14 studies, respectively, whereas REAL-B and mREACH-B were only validated in one study, respectively. Other models were also frequently validated as follows: CAMD (n = 6), HCC-RESCUE (n = 5), AASL-HCC (n = 4,), CAGE-B and SAGE-B (n = 3), and aMAP (n = 2).
×
Characteristics of the included studies
The participants were recruited retrospectively using hospital medical records, whereas Hsu et al. and Yip et al. used an insurance database and the Clinical Data Analysis and Reporting System to perform their studies [11, 22]. Different from other studies, Gui et al. compared model performance in cirrhotic patients [23], and Kim et al. studied veterans in United States [24]. Most models were developed in Asian populations, except for PAGE-B, CAGE-B, and SAGE-B, which were derived from Caucasian populations. Except for REAL-B, which was developed in individuals whose treatment regimen included other oral antiviral medicines, most models were developed in patients treated with entecavir or tenofovir. And aMAP was developed in mixed patients with a treatment proportion of 78%. The number of parameters in the models ranged from three to seven. Age and sex were nearly included in all models and other parameters included albumin, total bilirubin, platelets, cirrhosis, liver stiffness measurement, ALT, HBeAg status, diabetes, alcohol abuse, and alpha-fetoprotein (Table 1). The REAL-B and aMAP derivation cohorts were not included in the meta-analysis because their participants did not match the inclusion criteria.
Table 1
Summary of hepatitis B virus-hepatocellular carcinoma prediction models in the derivation studies
The italic indicates a continuous variable and the other indicates a categorical variable. HCC, hepatocellular carcinoma; ALT, alanine aminotransferase; HBeAg, hepatitis B e antigen; LSM, liver stiffness measurement; mREACH-B, Modified Risk Estimation for Hepatocellular Carcinoma in Chronic Hepatitis B; PAGE-B, Platelet, Age, Gender and HBV; mPAGE-B, modified Platelet, Age, Gender and HBV; HCC-RESCUE, HCC-Risk Estimating Score in CHB patients Under Entecavir; CAMD, the Cirrhosis, Age, Male sex, and Diabetes Mellitus Score; AASL-HCC, Age, Albumin, Aex, Liver Cirrhosis-HCC scoring system; aMAP: the Age-Male-ALBI-Platelets Score; CAGE-B, Cirrhosis and Age Score; SAGE-B, Stiffness and Age Score; REAL-B, Real-world Effectiveness from the Asia Pacific Rim Liver Consortium for HBV.
Risk of bias and applicability assessment
The details of the risk of bias and applicability were depicted in Table S1-2 and Figure S1-2. According to PROBAST, the predictors and outcome had a low risk of bias, but the participants and analysis had a high risk of bias in 17.4% and 52.1% of studies, respectively. In terms of analysis, model calibration was not performed in eight studies (34.8%) and four studies (17.4%) had a small number of HCC cases. Except for 17.4% of research, which had a high risk of participants, most models had a low risk of applicability.
Anzeige
×
Meta-analysis
The characteristics of included studies in meta-analysis were shown in Table 2. The pooled 3-, 5-, and 10-year AUROC varied from 0.72 to 0.84 (aMAP: 0.72, 95% CI 0.70–0.75; REAL-B: 0.84, 95% CI 0.78–0.90), 0.74 to 0.83 (mREACH-B: 0.74, 95% CI 0.71–0.76; AASL-HCC: 0.83, 95% CI 0.79–0.86), and 0.76 to 0.86 (SAGE-B: 0.76, 95% CI 0.70–0.83; mPAGE-B: 0.86, 95% CI 0.83–0.89), respectively (Fig. 2, Figure S3, Table S3). When predicting 3-year HCC incidence, REAL-B, AASL-HCC, and HCC-RESCUE models had better discrimination with an AUROC > 0.80, while mREACH-B, PAGE-B and aMAP showed an AUROC < 0.75. When predicting 5-year HCC incidence, AASL-HCC, REAL-B, HCC-RESCUE, and aMAP models performed better with an AUROC ≥ 0.80, followed by mPAGE-B, CAMD, and PAGE-B with an AUROC > 0.75, while mREACH-B showed an AUROC < 0.75. When predicting 10-year HCC incidence, with an AUROC ≥ 0.80, mPAGE-B, HCC-RESCUE, CAMD, and AASL-HCC outperformed CAGE-B, PAGE-B, and SAGE-B (> 0.75).
Table 2
Characteristics of derivation and external validation cohorts included in the meta-analysis
HCC, hepatocellular carcinoma; ALT, alanine aminotransferase; HBeAg, hepatitis B e antigen; LSM, liver stiffness measurement; ETV, entecavir; TDF, tenofovir alafenamide; TAF, tenofovir disoproxil fumarate; mREACH-B, Modified Risk Estimation for Hepatocellular Carcinoma in Chronic Hepatitis B; PAGE-B, Platelet, Age, Gender and HBV; mPAGE-B, modified Platelet, Age, Gender and HBV; HCC-RESCUE, HCC-Risk Estimating Score in CHB patients Under Entecavir; CAMD, the Cirrhosis, Age, Male sex, and Diabetes Mellitus Score; AASL-HCC, Age, Albumin, Aex, Liver Cirrhosis-HCC scoring system; aMAP: the Age-Male-ALBI-Platelets Score; CAGE-B, Cirrhosis and Age Score; SAGE-B, Stiffness and Age Score; REAL-B, Real-world Effectiveness from the Asia Pacific Rim Liver Consortium for HBV; CDARS, Clinical Data Analysis and Reporting System.
The pooled 5- and 10-year total O:E ratio ranged from 0.25 to 1.83 (mPAGE-B: 0.25, 95% CI 0.18–0.31; CAMD: 1.83, 95% CI 1.31–2.35) and 1.99 to 2.10 (SAGE-B: 1.99, 95% CI 0.99–2.99; CAGE-B: 2.10, 95%CI 1.02–3.17), respectively (Fig. 2, Figure S4). The pooled 3-year total O:E ratio of CAMD was 0.77 (95% CI 0.51–1.04). HCC-RESCUE, PAGE-B, and mPAGE-B exhibited an overestimation of HCC development, while AASL-HCC, aMAP, CAMD, CAGE-B, and SAGE-B exhibited an underestimation of HCC development.
Anzeige
The pooled 3-, 5-, and 10-year NPVs ranged from 98.3 to 100% (aMAP: 98.3%, 95% CI 96.3-100.3%; REAL-B: 100%, 95% CI 99.5-100.5%), 99.58–100% (aMAP: 99.6%, 95% CI 99.2–100.0%; AASL-HCC: 100%, 95% CI 99.5-100.5%; HCC-RESCUE: 100%, 95% CI 99.6-100.5%; REAL-B: 100%, 95%CI 99.5-100.5%), and 99.6–100% (PAGE-B: 99.6%, 95% CI 98.4-100.8%; CAGE-B: 100%, 95% CI 99.4-100.7%; SAGE-B: 100%, 95%CI 99.4-100.6%), respectively (Fig. 2, Table S4). All models had a high NPV over 99.5% except for aMAP. The proportion of low-risk population ranged from 14.4 to 53.0% (CAGE-B: 14.4%, 95% CI 12.9–16.0%; aMAP: 53.0%, 95% CI 28.5–77.6%) (Table 3). More than half of the population was identified as low-risk by HCC-RESCUE and aMAP.
Table 3
The proportion of low-risk population classified by the models in meta-analysis
Model
Sample size
Low-risk proportion, %
95% CI
I2
P
PAGE-B
45,241 (N = 10)
20.1
16.3, 23.9
98.4%
< 0.001
mPAGE-B
38,997 (N = 5)
26.6
20.2, 33.0
99.0%
< 0.001
HCC-RESCUE
3818 (N = 2)
52.4
50.8, 54.0
0.0%
-
CAMD
50,197 (N = 5)
36.8
30.4, 43.3
99.5%
< 0.001
AASL-HCC
7072 (N = 4)
21.8
12.1, 31.5
99.0%
< 0.001
aMAP
4005 (N = 3)
53.0
28.5, 77.6
99.7%
< 0.001
CAGE-B
1951 (N = 1)
14.4
12.9, 16.0
-
-
SAGE-B
1951 (N = 1)
15.8
14.2, 17.5
-
-
REAL-B
1858 (N = 1)
19.1
17.3, 21.0
-
-
CI, confidence interval; PAGE-B, Platelet, Age, Gender and HBV; mPAGE-B, modified Platelet, Age, Gender and HBV; HCC-RESCUE, HCC-Risk Estimating Score in CHB patients Under Entecavir; CAMD, the Cirrhosis, Age, Male sex, and Diabetes Mellitus Score; AASL-HCC, Age, Albumin, Aex, Liver Cirrhosis-HCC scoring system; aMAP: the Age-Male-ALBI-Platelets Score; CAGE-B, Cirrhosis and Age Score; SAGE-B, Stiffness and Age Score; REAL-B, Real-world Effectiveness from the Asia Pacific Rim Liver Consortium for HBV
Subgroup analysis and meta-regression
The results of subgroup analysis for discrimination and calibration were presented in Table S5. Only three researches compared the performance of PAGE-B, mPAGE-B, and aMAP in cirrhotic and non-cirrhotic individuals. The discrimination performance was generally better in non-cirrhotic patients than cirrhotic patients. PAGE-B, mPAGE-B, HCC-RESCUE, CAMD, and aMAP exhibited greater AUROC in Caucasians, whereas AASL-HCC, CAGE-B, and SAGE-B had comparable discrimination performance in Asians and Caucasians. Several articles reported the model calibration performance, but the difference in cirrhotic and non-cirrhotic population was not reported. The calibration of the subgroup analysis by race was same as that in meta-analysis. And the underestimation of CAMD seems to be more pronounced in Asians than in Caucasians (O:E ratio 2.38 vs. 1.55). We did a meta-regression analysis for model discrimination and calibration performance and found the heterogeneity could not be explained by race (Figure S5).
Publication bias and sensitivity analysis
The funnel plots for the PAGE-B and mPAGE-B model on 5-year discrimination performance were symmetric visually (Fig. 3). Funnel Plots for other models were not analyzed because the number of included studies was small. We mainly discussed the 5- and 10-year predictive performance of model discrimination and calibration, NPV in low-risk, and proportion of low-risk. External validation investigations of REAL-B and mREACH-B were insufficient for sensitivity analysis. After excluding any one research, the pooled 5- or 10-year AUROC of PAGE-B, mPAGE-B, HCC-RESCUE, CAMD, AASL-HCC, CAGE-B, SAGE-B, and aMAP did not change considerably, as shown in Figure S6-7. Sensitivity analysis of calibration was shown in Figure S8-9, and variations in 5-year O:E ratio prediction of CAMD were evident in studies by Hsu and Kim [11, 25]. In Yip et al’s 5-year NPV estimate [22], there was a clear variance in PAGE-B and mPAGE-B (Figure S10). The proportion of low-risk patients detected by AASL-HCC, aMAP, CAMD, PAGE-B, and mPAGE-B did not change significantly in the sensitivity analysis (Figure S11).
Anzeige
×
Pair-wise comparison between HCC-RESCUE and other models
We further explored the meta-values of HCC-RESCUE and other models within the same investigations. Only 4 studies have compared the predictive performance of HCC-RESCUE with PAGE-B, mPAGE-B, CAMD, or AASL-HCC. As depicted in Fig. 4, the 5-year AUROC were 0.81 (95% CI 0.77–0.86), 0.80 (95% CI 0.73–0.86), 0.81 (95% CI 0.75–0.87) for HCC-RESCUE, PAGE-B, and mPAGE-B, respectively. The discrimination was also similar between HCC-RESCUE/CAMD (0.81 vs. 0.81) and HCC-RESCUE/AASL-HCC (0.81 vs. 0.83). The proportion of low-risk patients detected by HCC-RESCUE was significantly higher than that by PAGE-B or mPAGE-B (52.4% vs. 23.3% vs. 30%, Table S6).
×
Discussion
We conducted a systematic review and meta-analysis of ten HCC prediction models in CHB patients receiving entecavir or tenofovir and compared their predictive performance of discrimination, calibration, NPV in low-risk, and proportion of low-risk. Overall, all models were able to generate satisfactory discrimination with an AUROC > 0.70. In terms of discrimination, calibration, and the capacity to stratify low-risk populations, HCC-RESCUE performed admirably. Different from the previous researches, we also studied the proportions of low-risk in each model and the accuracy of excluding HCC development in low-risk population.
The models included three to seven parameters including age, sex, albumin, total bilirubin, platelets, cirrhosis, liver stiffness measurement, ALT, HBeAg status, diabetes, alcohol abuse, or alpha-fetoprotein. None of the models incorporated viral-related parameters, except for mREACH-B, which included HBeAg status. However, HBV DNA level or HBeAg status were key determinants in models drawn from untreated population or mixed population (CU-HCC, GAG-HCC, LSM-HCC, NGM-HCC) [3‐6]. The difference could be explained by that the antiviral therapy was effective in suppressing virus activity. During the 12-month treatment with entecavir or tenofovir, HBV DNA was undetectable in 80% and 69% of patients, respectively, according to a randomized controlled experiment [26]. HBeAg seroconversion rate was over 40% in patients who received 5-year tenofovir or entecavir treatment [27]. According to Papatheodoridis et al., Caucasian patients receiving long-term entecavir or tenofovir had an 8-year survival rate comparable to the general population if HCC had not developed [28]. Thus, viral-related factors might have little effect in predicting long-term HCC development in antiviral-treated patients. We considered these models developed in treated patients were more suitable to predict HCC incidence in the antiviral era.
According to PROBAST, the participant and analysis were the main sources of the bias. Kim et al. compared performance of different models in veterans, which could lead to a significant risk in participant selection [29]. Gui et al. verified the model performance in patients with CHB-related cirrhosis without considering those who did not develop cirrhosis [23]. While Yip and Güzelbulut et al. included decompensated cirrhosis in their validation cohorts, and we figured that model parameters would be unstable in decompensated cirrhosis, thus increasing the models’ inaccuracy [22, 30]. The bias in the analysis was mostly caused by the limited sample size, which resulted in an unreasonable number of HCC instances, as well as the fact that model performance was evaluated inappropriately due to a lack of calibration evaluation.
Overall, REAL-B, AASL-HCC, and HCC-RESCUE models had the best discrimination performance with an AUROC > 0.8. Interestingly, age, sex, and cirrhosis were all included in the above models. HCC was found to be six times more common in cirrhotic patients than in people without cirrhosis, and males were more likely to acquire HCC than females [31]. Also, the risk of developing HCC increased with age. This may indicate that age, sex, and cirrhosis-based models were more accurate in predicting HCC incidence in treated individuals. Similarly, the CAMD model, which included age, sex, diabetes, and cirrhosis, also performed well. Our findings were consistent with a prior meta-analysis that indicated REAL-B and CAMD had the best discrimination performance, in which HCC-RESCUE was not been investigated and AASL-HCC was been validated in one study [32]. Subgroup analysis showed discrimination of aMAP, PAGE-B, and mPAGE-B was better in non-cirrhotic than cirrhotic patients, which has been reported by Wu et al. [32]. However, discrimination evaluation according to cirrhotic status was not available in other model studies. Besides, there was a tendency that discrimination was better in Caucasians than Asians no matter the model was developed in Asian or Caucasian population. The reason for such a racial disparity was currently unknown. To be mentioned, mREACH-B, aMAP and REAL-B were not widely validated in patients treated with entecavir or tenofovir, further studies on these models were needed to verify our findings. Besides, predictive performance in different subgroups should be considered in further validation.
Regarding model calibration performance, HCC-RESCUE, PAGE-B, and mPAGE-B overestimated HCC development, while AASL-HCC, aMAP, CAMD, CAGE-B, and SAGE-B underestimated HCC development. Calibration was not available in REAL-B and mREACH-B. In the previous study, PAGE-B and mPAGE-B had an overestimation which was the same as our findings, while CAMD has a slight overestimation in 3-year period [32]. To classifying patients with high risk of HCC, we figured that the overestimation was preferable than underestimation. Although overestimation may cause excessive surveillance and financial waste, underestimation would lead to the omission of possible HCC patients, putting patients’ lives in jeopardy. Nevertheless, model calibration was only done in two-thirds of the studies involved. As reported in a previous meta-analysis of HCC prediction models, publication compliance with TRIPOD was 74% [33]. Following model validation studies should pay more attention to the completeness of the article according to transparent reporting of individual prognosis or diagnostic multivariate predictive model (TRIPOD) statement.
All models exhibited a high NPV over 99% in low-risk population, indicating the ability of excluding HCC development was admirable. In the 5- or 10-year study period, almost none of the low-risk patients got HCC. Therefore, intensive supervision was not necessary for these patients, potentially reducing the risk of physical, financial, and psychological harms [16]. To our knowledge, the risk stratification proportions were occasionally reported in separate studies and had not been systematically investigated by meta-analysis. To some extent, the more patients were designated as low-risk, the more medical resources could be saved. According to our findings, HCC-RESCUE and aMAP classified over half of the population as low-risk, following by CAMD and mPAGE-B with 36.8% and 26.6%, respectively, while AASL-HCC, PAGE-B, REAL-B, CAGE-B, and SAGE-B was approximately 20%. Thus, HCC-RESCUE made more patients can be spared from HCC screening and save resources, which was especially useful in primary care and low-income areas.
Our research assessed the predictive ability of ten models at multiple time points, and included subgroup analysis based on race and cirrhosis status. We also ran a sensitivity analysis to verify that the results were reliable. The between-study heterogeneity could be partly explained by race and cirrhotic status. And the results were relatively robust in sensitivity analysis. We also used NPV to assess the accuracy of identifying individuals who would not develop HCC in a given time, and we focused on the low-risk proportion divided by each model, which had never been systematically examined before. We proposed that these two characteristics be investigated in model studies since they could play a key role in allocating HCC screening resources.
However, there were several limitations in our study. First, there were insufficient validation cohorts for mREACH-B, aMAP, and REAL-B, because the mREACH-B derivation study did not illustrate the details of the model, and the latter two were newly developed [7, 14, 15]. Second, model calibration, proportion of each risk group, and NPV in low-risk populations were not depicted in every study, which caused that some models (mREACH and REAL-B) were not analyzed for these performance in the meta-analysis and the number of studies was insufficient for some models (HCC-RESCUE, REAL-B, CAGE-B, and SAGE-B). Besides, over half of the studies had a high risk of bias for participant selection or data analysis, but sensitivity analysis showed that our findings remained stable. Finally, the subgroup analysis of cirrhosis status was incomplete because the difference between cirrhotic and non-cirrhotic patients were not displayed in most cohorts. For the similar reason, the discrimination and calibration results could not be stratified according to treatment received. Further external validation studies with more complete information were needed to confirm our findings.
Conclusion and implications
REAL-B, AASL-HCC, and HCC-RESCUE performed the best discrimination in the meta-analysis of the ten prediction models, although more validation studies of model REAL-B are needed to confirm our findings. AASL-HCC, aMAP, CAMD, CAGE-B, and SAGE-B underestimated HCC development, whereas HCC-RESCUE, PAGE-B, and mPAGE-B overestimated it. Model calibration, proportion of low-risk group, and NPVs were insufficiently reported in many researches, which should be addressed in future model derivation or validation studies. In comparison to other models, HCC-RESCUE identified the most people as low-risk, with a high NPV, indicating that it might be the most appropriate model to be used in primary clinical practice for HCC surveillance.
List of abbreviations: HBV, hepatitis B virus; HCC, hepatocellular carcinoma; CHB, chronic hepatitis B; mREACH-B, modified Risk Estimation for HCC in Chronic Hepatitis B; PAGE-B, Platelet, Age, Gender and HBV; mPAGE-B, modified PAGE-B; HCC-RESCUE, HCC-Risk Estimating Score in CHB patients Under Entecavir; CAMD, the Cirrhosis, Age, Male sex, and Diabetes Mellitus Score; AASL-HCC, Age, Albumin, Sex, Liver Cirrhosis-HCC Scoring System; CAGE-B, Cirrhosis and Age Score; SAGE-B, Stiffness and Age Score; REAL-B, Real-world Effectiveness from the Asia Pacific Rim Liver Consortium for HBV; aMAP, the Age-Male-ALBI-Platelets Score; HBeAg, hepatitis B e antigen; ALT, alanine aminotransferase; AUROC, area under the receiver operator characteristic curve; CI, confidence interval; O:E ratio, observed events versus expected events ratio; NPV, negative predictive value.
Acknowledgements
None.
Declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
„Kalte“ Tumoren werden heiß – CD28-kostimulatorische Antikörper sollen dies ermöglichen. Am besten könnten diese in Kombination mit BiTEs und Checkpointhemmern wirken. Erste klinische Studien laufen bereits.
Eine perioperative Therapie mit Nivolumab reduziert das Risiko für Rezidive und Todesfälle bei operablem NSCLC im Vergleich zu einer alleinigen neoadjuvanten Chemotherapie um über 40%. Darauf deuten die Resultate der Phase-3-Studie CheckMate 77T.
Immunchemischer Stuhltest positiv, Koloskopie negativ – in solchen Fällen kann die Blutungsquelle auch weiter proximal sitzen. Ein Forschungsteam hat nachgesehen, wie häufig und in welchen Lokalisationen das der Fall ist.
Möglicherweise hängt es von der Art der Diabetesmedikamente ab, wie hoch das Risiko der Betroffenen ist, dass sich sehkraftgefährdende Komplikationen verschlimmern.
Update Innere Medizin
Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.