Transurethral water vapour thermal therapy for benign prostatic hyperplasia under local anaesthesia alone: initial experience in Chinese patients

Hong Kong Med J 2024 Jun;30(3):227–32 | Epub 10 May 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Transurethral water vapour thermal therapy for benign prostatic hyperplasia under local anaesthesia alone: initial experience in Chinese patients
KL Lo, MB, ChB, FRCSEd; Alex Mok, MB, ChB; Ivan CH Ko, MB, ChB; Steffi KK Yuen, MB, BS, FRCSEd; Peter KF Chiu, MB, BS, FRCSEd; CF Ng, MD, FRCSEd
SH Ho Urology Centre, Department of Surgery, The Chinese University of Hong Kong, Hong Kong SAR, China
 
Corresponding author: Prof CF Ng (ngcf@surgery.cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This study evaluated the perioperative and early postoperative outcomes of transurethral water vapour thermal therapy (WVTT) under local anaesthesia alone for benign prostatic enlargement in Chinese patients.
 
Methods: This retrospective review of transurethral WVTT for benign prostatic enlargement focused on 50 Chinese patients who exhibited clinical indications (acute retention of urine or symptomatic lower urinary tract symptoms due to benign prostatic enlargement) for surgical treatment between June 2020 and December 2021 in Hong Kong. Exclusion criteria included active urinary tract problems and urological malignancies. Follow-up was conducted at 3 months postoperatively.
 
Results: The median patient age was 71.5 years. The mean preoperative prostatic volume was 56.7 mL. The mean operation time was 25.1 minutes. All procedures were performed under local anaesthesia alone. The mean pain scores for transrectal ultrasound probe insertion, transperineal local anaesthesia injection, and transurethral WVTT were 2, 5, and 4, respectively. Forty-nine patients (98%) were discharged on the same day with a urethral catheter. Forty-eight patients (96%) successfully completed a trial without catheter within 3 weeks postoperatively. Five patients (10%) had unplanned hospital admission within 30 days postoperatively due to surgical complications (Clavien–Dindo grade 1).
 
Conclusion: Transurethral WVTT, an advanced surgical treatment for benign prostatic enlargement, is a safe procedure that relieves lower urinary tract symptoms with minimal hospital stay. It can be performed in an office-based setting under local anaesthesia, maximising utilisation of the surgical theatre.
 
 
New knowledge added by this study
  • This is the first study concerning the efficacy and safety profile of water vapour thermal therapy (WVTT) in Asian patients. It can relieve lower urinary tract symptoms with minimal hospital stay.
  • This is the first study of WVTT in an office-based setting under local anaesthesia, maximising utilisation of the surgical theatre.
Implications for clinical practice or policy
  • Water vapour thermal therapy is an effective and safe alternative for patients who have high surgical risk of benign prostatic enlargement under general or spinal anaesthesia.
 
 
Introduction
Benign prostatic hyperplasia (BPH) is characterised by a non-malignant growth in the prostate gland that can cause a wide range of lower urinary tract symptoms (LUTS). These symptoms can greatly reduce a patient’s quality of life (QoL) and may eventually lead to acute retention of urine (AROU).
 
Current standard treatments for BPH include conservative, pharmacological, and surgical approaches. For patients who fail to successfully complete a trial without catheter (TWOC) after AROU secondary to BPH, surgical intervention remains the main therapeutic approach. Surgical treatment options for BPH have evolved from electrosurgical resection to enucleation, ablation of the prostate, and other techniques.1 Transurethral resection of the prostate (TURP), first performed over 90 years ago, continues to be regarded as the gold standard for the treatment of BPH with prostatic volumes of 30 to 80 mL.2 Although TURP results in statistically significant improvements in symptom scores and maximum urinary flow rate (Qmax), it has some limitations. Perioperative morbidities and complications of TURP include infection, bleeding, urinary retention, incontinence, urethral stricture, erectile dysfunction, and ejaculatory dysfunction. Additionally, TURP requires general or spinal anaesthesia and postoperative hospitalisation. Through technological advancements, several minimally invasive procedures (eg, UroLift and prostatic artery embolisation) have been developed for specific groups of patients with BPH to minimise the aforementioned limitations.3 4 5 Among these new-generation BPH surgical approaches, transurethral water vapour thermal therapy (WVTT) provides some of the best surgical outcomes.
 
Transurethral WVTT uses the thermodynamic principle of convective energy transfer, whereas other techniques (eg, transurethral microwave thermotherapy or transurethral needle ablation of the prostate) involve conductive heat transfer.6 The thermal therapy system consists of a generator with a radiofrequency power supply that creates water vapour from sterile water, as well as a disposable transurethral delivery device. The tip of the delivery device contains an 18-gauge needle with 12 small emitter holes circumferentially arranged for water vapour dispersion into the targeted prostatic tissue. The release of thermal energy causes tissue necrosis. The most important characteristic of this technique is that, during treatment of the transitional zone, energy is only deposited within this specific region of the prostate. Reviews of histological evidence and magnetic resonance images have revealed that thermal lesions are limited to the transitional zone without affecting the peripheral zone, bladder, rectum, or striated urinary sphincter.7 8 At 6 months after treatment, the total prostatic volume is reduced by 28.9% and the resolution of thermal lesions, as determined by gadolinium-enhanced magnetic resonance imaging, is almost complete.8
 
A pilot study showed that transurethral WVTT can serve as a safe and effective treatment in men with LUTS due to BPH.8 In the first multicentre, randomised controlled study, 197 men were enrolled and randomised in a 2:1 ratio to treatment with the transurethral WVTT or a sham procedure.9 The sham procedure consisted of rigid cystoscopy with sound effects that mimicked the thermal treatment. The primary efficacy endpoint was met at 3 months: relief of symptoms, measured as a change in International Prostate Symptom Score (IPSS), was detected in 50% of patients in the thermal treatment group compared with 20% of patients in the sham procedure group (P<0.0001). In the thermal treatment group, the Qmax increased by 63%—from 9.9 mL/s to 16.1 mL/s (P<0.0001)—after 3 months. This clinical benefit was sustained throughout the study period, with a 54% improvement at the 12-month follow-up. In the most recent update regarding 5-year outcomes, the improvement in voiding (as measured by IPSS and uroflowmetry) had persisted for 5 years, with a surgical retreatment rate of 4.4%.10
 
Thus far, studies of transurethral WVTT for BPH have mainly focused on Caucasian populations. To provide information regarding its tolerability and effectiveness in the Chinese population, this study investigated the safety profile and efficacy of transurethral WVTT under local anaesthesia alone for BPH among Chinese patients in Hong Kong.
 
Methods
Study protocol
This retrospective study investigated transurethral WVTT for benign prostatic enlargement. The inclusion criteria included Chinese ethnicity and clinical indications for surgical treatment, including AROU or symptomatic LUTS due to benign prostatic enlargement. Exclusion criteria included active urinary tract problems such as infection, bleeding disorder, bladder pathologies (eg, bladder stones and neurogenic bladder), and urethral stricture, as well as urological malignancies including bladder and prostate cancer.
 
Intervention
The procedure was performed with perioperative antibiotic prophylaxis. Patients were placed in the dorsal lithotomy position. After local anaesthesia, cystoscopy was performed to assess the anatomy of the bladder and prostate. A specialised handpiece with an optical lens was inserted under direct visual guidance into the prostate channel. Treatment began with the needle tip visually positioned and inserted approximately 1 cm distal to the bladder neck. Each treatment lasted for 9 seconds. After 9 seconds, an audible signal was produced by the system and the treatment needle was retracted. The handpiece was then repositioned 1 cm distal to the previous treatment site; repositioning was repeated until reaching a treatment site immediately proximal to the verumontanum. During each water vapour injection, the majority of the targeted tissue was treated. All treatment cycles involving one lateral lobe were completed as a group to utilise residual heat from prior treatments involving that lobe. Subsequently, the contralateral lateral lobe was treated in a similar manner. An enlarged median lobe could be treated by positioning the needle at a 45-degree angle towards the targeted lobe using the same technique. After the procedure, a 14-Fr Foley catheter was inserted. A 1-week course of antibiotic treatment was administered after surgery.10 Patients were discharged with the urethral catheter and readmitted for a TWOC at approximately 1 to 2 weeks after surgery. Upon satisfactory completion of the TWOC, patients were scheduled for follow-up at 3 months postoperatively.
 
Statistical analysis
Preoperative parameters and perioperative outcomes were collected and tabulated using SPSS software (Windows version 28.0; IBM Corp, Armonk [NY], United States). Descriptive statistics were used to summarise the demographic data and perioperative patient characteristics. Paired sample t tests were used to compare continuous variables with normal distributions; the Mann-Whitney U test was used to compare continuous variables with skewed distributions, and the Chi squared test was used to compare categorical variables. Two-sided P values of <0.05 were considered statistically significant.
 
Results
Demographics
Between June 2020 and December 2021, 50 eligible patients were included in this study. The median age was 71.5 years (interquartile range [IQR]=64-75.25). In terms of indications, 27 patients (54%) had symptomatic BPH, 13 patients (26%) had AROU with a urethral catheter, and 10 patients (20%) had AROU without a urethral catheter. Of the 50 patients, 39 (78%) were categorised as American Society of Anesthesiologists class 2, whereas the remaining 11 were categorised as class 3. Most patients (68%) did not use any antiplatelet or anticoagulant therapy. The numbers of patients using aspirin, clopidogrel, dual antiplatelet therapy, and apixaban were 11 (22%), three (6%), one (2%), and one (2%), respectively. All antiplatelet and anticoagulant agents were temporarily discontinued before the operation (Table 1).
 

Table 1. Use of antiplatelet or anticoagulant therapy (n=50)
 
Operation
The mean preoperative prostatic volume was 56.7 mL (standard deviation [SD]=24.6; range, 29.2-119.0). The mean operation time was 25.1 minutes (SD=8.4). All procedures were conducted under local anaesthesia alone. Lignocaine 1% with adrenaline was injected into the periprostatic space using a transperineal approach. The mean pain scores for transrectal ultrasound probe insertion, transperineal local anaesthesia injection, and transurethral WVTT were 2, 5, and 4, respectively.
 
Postoperative course
Only one patient (2%) required bladder irrigation for 5 days postoperatively; that patient had been taking apixaban before surgery. All other patients were discharged on the same day with a urethral catheter. A TWOC was planned at around 1 week (for the AROU without urethral catheter or symptomatic BPH group) to 2 weeks (for the AROU with urethral catheter group) after surgery. Forty-eight patients (96%) in our study successfully completed a TWOC within 3 weeks postoperatively; the median time was 7 days (IQR=7-14). The median successful TWOC times were 14 days (IQR=8-21) for the AROU with urethral catheter group and 7 days (IQR=7-12) for the AROU without urethral catheter or symptomatic BPH group. Two patients (4%) with an initially unsuccessful TWOC began temporary clean intermittent self-catheterisation; they were subsequently weaned from this management approach on postoperative days 40 and 45, respectively.
 
Five patients (10%) had unplanned hospital admission within 30 days postoperatively due to surgical complications (Clavien–Dindo grade 1). The reasons for readmission are listed in Table 2.
 

Table 2. Reasons for readmission (n=50)
 
There were significant differences in preoperative and 3-month postoperative parameters, including prostate-specific antigen level, post-void residual urine (PVRU) level, Qmax, IPSS, and QoL assessment. Table 3 shows the medians and IQRs of these data. As indicated in Table 4, the mean differences in PVRU, Qmax, IPSS, and QoL score were -41 mL (SD=107), +6.6 mL/s (SD=5.4), -10.9 points (SD=5.8), and -2.2 points (SD=1.5), respectively.
 

Table 3. Descriptive statistics of preoperative and postoperative parameters
 

Table 4. Mean differences of parameters 3 months after the procedure
 
No patients in this study exhibited de novo retrograde ejaculation or stress urinary incontinence at 3 months postoperatively. However, there were three reported cases (6%) of new-onset erectile dysfunction postoperatively. All three patients had temporary erectile dysfunction that resolved within 6 months postoperatively without requiring medication.
 
Discussion
Our study is the first to focus on the application of transurethral WVTT (Rezūm therapy) under local anaesthesia alone among Chinese men with BPH. Our results demonstrated clinically significant outcomes comparable to other treatments for BPH. Transurethral WVTT provided effective symptomatic improvement, as illustrated by a decrease in PVRU of 41 mL, an increase in Qmax of 6.6 mL/s, and a substantial decrease in IPSS of 10.9 at the 3-month follow-up (Table 4). These results were also comparable to outcomes in a recent international study of this therapy.10 The postoperative outcome was favourable, with a successful TWOC rate of 96% within 3 weeks postoperatively. Moreover, all patients with urethral catheters before surgery successfully completed a TWOC after transurethral WVTT. The median successful TWOC time was 7 days postoperatively. However, compared with data from other studies (4.1 to 5 days),11 12 our centre had a longer duration of catheterisation, which could be explained by our centre’s policy of scheduling a TWOC on postoperative days 7 and 14 for patients without and with a urethral catheter before surgery, respectively. Five patients were readmitted within 30 days after surgery due to haematuria, post-obstructive diuresis, recurrent AROU, and urinary tract infection with AROU (Table 2). All were uneventfully discharged without further readmission; none of them developed postoperative urinary incontinence. Three patients reported de novo erectile dysfunction, higher than the rate observed in the recent international study.10 However, the rate remained significantly lower than that associated with TURP.13 Considering the minimally invasive nature of this procedure, it could revolutionise future management of BPH.
 
The current management algorithm for BPH does not include transurethral WVTT as a first-line treatment due to the relative lack of evidence regarding its mid- to long-term efficacy and safety.2 However, it has considerable potential in the management of BPH because of unique advantages compared with TURP. Transurethral WVTT can be an office-based procedure with a short learning curve. If a surgeon completes 10 cases of transurethral WVTT under supervision, he/she will become independent from a surgical trainer. Because BPH is a particularly common urological disease and often requires surgical management,14 the minimally invasive nature of transurethral WVTT can help reduce the number of patients waiting for operations in overcrowded hospital facilities. The results of our study provide initial evidence that transurethral WVTT is well-tolerated among patients under local anaesthesia alone. We did not administer any sedation to the patients because they might move during the operation, resulting in a high risk of water vapour leakage. Such leakage would lead to inadequate treatment.
 
In Hong Kong, total health costs represent about 19% of the total government budget,15 and public in-patient health costs in 2021/2022 constituted 32% of total health costs.16 Operation time is one of the most important factors affecting in-patient costs. According to a meta-analysis by Mamoulakis et al17 in 2009, the mean operation time for TURP ranged from 39 to 79 minutes. In the present study, the mean operation time was 25.1 minutes. Thus far, no studies have directly assessed the cost-effectiveness of transurethral WVTT in the Chinese population. In the United States, a cost-effectiveness analysis of six therapies for BPH, published in 2018,18 showed that transurethral WVTT was more cost-effective than other minimally invasive therapies, such as combination medical treatment and UroLift. Moreover, McVary et al10 reported that the 5-year retreatment rate after transurethral WVTT was 4.4%, which was significantly lower than that after UroLift therapy (13.6%) reported by Roehrborn et al.5
 
Notably, transurethral WVTT leads to lower incidences of bleeding, urgency, urge incontinence, and ejaculatory dysfunction compared with TURP.19 The more favourable side-effect profile has resulted in considerable interest concerning its potential to replace TURP as the first-line surgical treatment in the future. No head-to-head trials have compared other surgical modalities with transurethral WVTT. Indirect comparison through a meta-analysis revealed that TURP outperformed transurethral WVTT by providing greater relief of LUTS,19 although it carried a greater cost and higher complication rate.18 Although pharmacological treatment is currently the first-line treatment for moderate to severe LUTS, it is associated with complications such as dizziness, postural hypotension, reduced libido, and erectile dysfunction. Gupta et al20 compared standard medical therapy with transurethral WVTT using cohort data from the MTOPS trial (Medical Therapy of Prostatic Symptoms); they showed that transurethral WVTT had superior outcomes in terms of QoL, IPSS, and prostatic volume reduction. Considering these advantages, transurethral WVTT can be regarded as a first-line treatment option for patients with symptomatic LUTS who prefer a short operation, rather than lifelong pharmacological treatment.
 
Limitations
There were some limitations in this study. First, a substantial proportion of our patients had been catheterised preoperatively (74%) and thus could not undergo uroflowmetry studies before the operation. Due to the coronavirus disease 2019 pandemic and the associated community isolation policy, some other patients did not complete uroflowmetry studies. However, all IPSS data were able to be collected via telemedicine, ensuring the inclusion of those data in the analysis. Second, our study did not have a sufficient number of patients to allow subgroup analysis of patients with different indications for transurethral WVTT; future studies should explore treatment outcomes among patients with different indications for transurethral WVTT. Third, our inclusion period was prolonged, partly due to the coronavirus disease 2019 pandemic and partly because transurethral WVTT mainly was regarded as a self-financed item in our centre; these aspects led to some difficulty in accumulating a sufficient number of patients for analysis. Finally, this study used a single-arm design with a relatively short follow-up period; additional studies are needed to assess long-term treatment outcomes and retreatment rates after transurethral WVTT under local anaesthesia alone.
 
Conclusion
Transurethral WVTT is a safe and effective treatment for benign prostatic hyperplasia in the Chinese population. It can also be conducted in an office setting under local anaesthesia alone, avoiding use of the surgical theatre and its associated costs.
 
Author contributions
Concept or design: KL Lo, CF Ng.
Acquisition of data: KL Lo.
Analysis or interpretation of data: A Mok, ICH Ko.
Drafting of the manuscript: KL Lo, A Mok, ICH Ko.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
As an editor of the journal, CF Ng was not involved in the peer review process. Other authors have disclosed no conflicts of interest.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This research was approved by the Joint Chinese University of Hong Kong–New Territories East Cluster Clinical Research Ethics Committee, Hong Kong (Ref No.: 2019.662). The patients were treated in accordance with the tenets of the Declaration of Helsinki and have provided written informed consent for all treatments and procedures and consent for publication.
 
References
1. Kahokehr A, Gilling PJ. Landmarks in BPH—from aetiology to medical and surgical management. Nat Rev Urol 2014;11:118-22. Crossref
2. Gravas S, Cornu JN, Gacci M, et al. EAU guidelines on management of non-neurogenic male lower urinary tract symptoms (LUTS), incl. benign prostatic obstruction (BPO). 2022. Available from: https://d56bochluxqnz.cloudfront.net/documents/full-guideline/EAU-Guidelines-on-Non-Neurogenic-Male-LUTS-2022.pdf. Accessed 8 May 2024.
3. Teoh JY, Chiu PK, Yee CH, et al. Prostatic artery embolization in treating benign prostatic hyperplasia: a systematic review. Int Urol Nephrol 2017;49:197-203. Crossref
4. Abt D, Hechelhammer L, Müllhaupt G, et al. Comparison of prostatic artery embolisation (PAE) versus transurethral resection of the prostate (TURP) for benign prostatic hyperplasia: randomised, open label, non-inferiority trial. BMJ 2018;361:k2338. Crossref
5. Roehrborn CG, Barkin J, Gange SN, et al. Five-year results of the prospective randomized controlled prostatic urethral L.I.F.T. study. Can J Urol 2017;24:8802-13.
6. Magistro G, Chapple CR, Elhilali M, et al. Emerging minimally invasive treatment options for male lower urinary tract symptoms. Eur Urol 2017;72:986-97. Crossref
7. Dixon CM, Rijo Cedano E, Mynderse LA, Larson TR. Transurethral convective water vapor as a treatment for lower urinary tract symptomatology due to benign prostatic hyperplasia using the Rezūm(®) system: evaluation of acute ablative capabilities in the human prostate. Res Rep Urol 2015;7:13-8. Crossref
8. Mynderse LA, Hanson D, Robb RA, et al. Rezūm system water vapor treatment for lower urinary tract symptoms/benign prostatic hyperplasia: validation of convective thermal energy transfer and characterization with magnetic resonance imaging and 3-dimensional renderings. Urology 2015;86:122-7. Crossref
9. McVary KT, Gange SN, Gittelman MC, et al. Minimally invasive prostate convective water vapor energy ablation: a multicenter, randomized, controlled study for the treatment of lower urinary tract symptoms secondary to benign prostatic hyperplasia. J Urol 2016;195:1529-38. Crossref
10. McVary KT, Gittelman MC, Goldberg KA, et al. Final 5-year outcomes of the multicenter randomized sham-controlled trial of a water vapor thermal therapy for treatment of moderate to severe lower urinary tract symptoms secondary to benign prostatic hyperplasia. J Urol 2021;206:715-24. Crossref
11. Dixon CM, Cedano ER, Pacik D, et al. Two-year results after convective radiofrequency water vapor thermal therapy of symptomatic benign prostatic hyperplasia. Res Rep Urol 2016;8:207-16. Crossref
12. Rowaiee R, Akhras A, Lakshmanan J, Sikafi Z, Janahi F. Rezum therapy for benign prostatic hyperplasia: Dubai’s initial experience. Cureus 2021;13:e18083. Crossref
13. Pavone C, Abbadessa D, Scaduto G, et al. Sexual dysfunctions after transurethral resection of the prostate (TURP): evidence from a retrospective study on 264 patients. Arch Ital Urol Androl 2015;87:8-13. Crossref
14. Lee YJ, Lee JW, Park J, et al. Nationwide incidence and treatment pattern of benign prostatic hyperplasia in Korea. Investig Clin Urol 2016;57:424-30. Crossref
15. Hong Kong SAR Government. The 2023-24 Budget. Budget speech. Available from: https://www.budget.gov.hk/2023/eng/budget30.html. Accessed 5 May 2024.
16. Health Bureau, Hong Kong SAR Government. Hong Kong’s domestic health accounts (DHA) 2021/22. Available from: https://www.healthbureau.gov.hk/statistics/en/dha.htm. Accessed 5 May 2024.
17. Mamoulakis C, Ubbink DT, de la Rosette JJ. Bipolar versus monopolar transurethral resection of the prostate: a systematic review and meta-analysis of randomized controlled trials. Eur Urol 2009;56:798-809. Crossref
18. Ulchaker JC, Martinson MS. Cost-effectiveness analysis of six therapies for the treatment of lower urinary tract symptoms due to benign prostatic hyperplasia. Clinicoecon Outcomes Res 2018;10:29-43. Crossref
19. Tanneru K, Jazayeri SB, Alam MU, et al. An indirect comparison of newer minimally invasive treatments for benign prostatic hyperplasia: a network meta-analysis model. J Endourol 2021;35:409-16. Crossref
20. Gupta N, Rogers T, Holland B, Helo S, Dynda D, McVary KT. Three-year treatment outcomes of water vapor thermal therapy compared to doxazosin, finasteride and combination drug therapy in men with benign prostatic hyperplasia: cohort data from the MTOPS trial. J Urol 2018;200:405-13. Crossref

Amniotic fluid gamma-glutamyl transferase for prediction of biliary atresia in cases of non-visualisation of the fetal gallbladder: a retrospective study using a validated analytical platform and local reference range

Hong Kong Med J 2024 Jun;30(3):218–26 | Epub 5 Jun 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Amniotic fluid gamma-glutamyl transferase for prediction of biliary atresia in cases of non-visualisation of the fetal gallbladder: a retrospective study using a validated analytical platform and local reference range
Tommy YT Cheung, MB, ChB1,2; Natalie KL Wong, MB, ChB, MRCOG3; Daljit Singh Sahota, PhD3; Shreenidhi Ranganatha Subramaniam, MB, ChB1; SL Lau, MB, ChB, MRCOG3; X Zhu, MB, BS, PhD3; WT Lui, MPhil3; Edwin KW Chan, FHKAM (Surgery), FRCSEd (Paed)4; Yvonne KY Kwok, PhD3; KW Choy, PhD3; TY Leung, MD, FRCOG3; Michael HM Chan, FHKCPath, FHKAM (Pathology)5; Felix CK Wong, MB, BS, FRCPA5,6 #; YH Ting, FHKAM (Obstetrics and Gynaecology), FRCOG3 #
1 Department of Chemical Pathology, The Chinese University of Hong Kong, Hong Kong SAR, China
2 Department of Chemical Pathology, Princess Margaret Hospital, Hong Kong SAR, China
3 Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Hong Kong SAR, China
4 Division of Paediatric Surgery and Paediatric Urology, Department of Surgery, Prince of Wales Hospital, Hong Kong SAR, China
5 Department of Chemical Pathology, Prince of Wales Hospital, Hong Kong SAR, China
6 Division of Chemical Pathology, Department of Pathology, Queen Mary Hospital, Hong Kong SAR, China
# Equal contribution
 
Corresponding author: Dr YH Ting (tingyh@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: The level of amniotic fluid gamma-glutamyl transferase (AFGGT) may help identify biliary atresia (BA) in cases of non-visualisation of the fetal gallbladder (NVFGB). This study aimed to validate a serum/plasma matrix–based gamma-glutamyl transferase (GGT) assay for amniotic fluid (AF) samples, establish a local gestational age–specific AFGGT reference range, and evaluate the efficacy of AFGGT for predicting fetal BA in pregnancies with NVFGB using the constructed reference range.
 
Methods: The analytical performance of a serum/ plasma matrix–based GGT assay on AF samples was evaluated using a Cobas c502 analyser. Amniotic fluid gamma-glutamyl transferase levels in confirmed euploid singleton pregnancies (16+0 to 22+6 weeks of gestation) were determined using the same analyser to establish a local gestational age–specific reference range (the 2.5th to 97.5th percentiles). This local reference range was used to determine the positive predictive value (PPV) and negative predictive value (NPV) of AFGGT level <2.5th percentile for identifying fetal BA in euploid pregnancies with NVFGB.
 
Results: The serum/plasma matrix–based GGT assay was able to reliably and accurately determine GGT levels in AF samples. Using the constructed local gestational age–specific AFGGT reference range, the NPV and PPV of AFGGT level <2.5th percentile for predicting fetal BA in pregnancies with NVFGB were 100% and 25% (95% confidence interval=0, 53), respectively.
 
Conclusion: In pregnancies with NVFGB, AFGGT level ≥2.5th percentile likely excludes fetal BA. Although AFGGT level <2.5th percentile is not diagnostic of fetal BA, fetuses with AFGGT below this level should be referred for early postnatal investigation.
 
 
New knowledge added by this study
  • A serum/plasma matrix–based gamma-glutamyl transferase (GGT) assay reliably and accurately determines GGT levels in amniotic fluid samples.
  • Using amniotic fluid gamma-glutamyl transferase (AFGGT) level <2.5th percentile to identify biliary atresia (BA) in cases of non-visualisation of the fetal gallbladder (NVFGB), the negative and positive predictive values were 100% and 25%, respectively.
Implications for clinical practice or policy
  • A local gestational age–specific AFGGT reference range (the 2.5th to 97.5th percentiles) is available for clinical use.
  • In pregnancies with NVFGB, AFGGT level ≥2.5th percentile likely excludes fetal BA; although AFGGT level <2.5th percentile is not diagnostic of BA, it is an important indicator of the need for early postnatal investigation.
 
 
Introduction
Non-visualisation of the fetal gallbladder (NVFGB) in the second trimester is a rare condition affecting 0.1% of pregnancies.1 It might be a transient finding—the gallbladder is visible later in pregnancy or after birth in 70% of cases.2 Persistent NVFGB may be associated with benign conditions (eg, gallbladder agenesis); it may also be a manifestation of serious disorders, such as biliary atresia (BA), cystic fibrosis, or chromosomal abnormalities.3 Although chromosomal abnormalities and cystic fibrosis can be identified prenatally via chromosomal microarray (CMA) and sequencing of the cystic fibrosis transmembrane conductance regulator gene, respectively, it remains challenging to diagnose BA before birth because no diagnostic prenatal test is currently available. Biliary atresia is a devastating condition and is the leading indication for liver transplantation in childhood.4 Prenatal suspicion of BA allows prompt postnatal assessment and early diagnosis, permitting timely intervention via Kasai hepatoportoenterostomy and resulting in improved outcomes.4
 
The measurement of gamma-glutamyl transferase (GGT) in amniotic fluid (AF) has been suggested as a method for prenatal detection of BA. This transferase is secreted by the fetal biliary tract, passes into the intestines, and is ultimately excreted into the amniotic cavity. It becomes detectable in AF at around 14 weeks of gestation upon maturation of the intestinal villi and opening of the cloacal membrane.5 6 7 Biliary tract obstructions, such as BA, hinder the passage of GGT into the intestines and AF, leading to reduced levels of amniotic fluid gamma-glutamyl transferase (AFGGT). Muller et al5 first reported extremely low AFGGT levels in three fetuses with extrahepatic bile duct obstruction at 18 to 19 weeks of gestation; they concluded that a low AFGGT level could be a useful indicator of BA. Subsequently, Burc et al,6 Dreux et al,8 and Bardin et al9 reported similar findings. Nevertheless, existing platforms for analysis of GGT in serum or plasma samples have not been thoroughly validated for use with AF samples. Furthermore, an appropriate reference range, essential for the interpretation of AFGGT results, is difficult to establish due to the limited availability of AF samples from normal pregnancies. Thus, publications regarding gestational age–specific reference ranges for AFGGT have been scarce.
 
Burc et al6 established reference values for five AF enzymes (AFEs), including GGT, using the Hitachi 911 analyser (Roche Diagnostics). Despite a large sample size of 508, no separate reference range was constructed for each gestational age from 20 to 24 weeks, limiting clinical use of the findings.6 Bardin et al7 established another reference range for AFGGT, from 16 to 22 weeks, using the Integra 800 analyser (Roche). However, the numbers of samples at weeks 16, 21, and 22 were small and had relatively large standard deviations, precluding clinical application.7 Both reference ranges were derived from Caucasian populations. Because the GGT level in adult blood varies according to ethnicity,10 it is likely that AFGGT levels also vary according to ethnicity; thus, there is a need to establish a local AFGGT reference range. Furthermore, both previous reference ranges covered the 5th to 95th percentiles. A wider reference range, from the 2.5th to 97.5th percentiles, would allow greater flexibility in selecting cut-off values for clinical application.
 
In this study, we aimed to validate a serum/plasma matrix–based GGT assay for AF samples, establish a local gestational age–specific reference range for AFGGT (from the 2.5th to 97.5th percentiles), and evaluate the efficacy of AFGGT for predicting fetal BA in pregnancies with NVFGB using the constructed reference range.
 
Methods
This retrospective study used archived AF supernatant obtained from amniocentesis procedures that had been conducted to exclude fetal chromosomal abnormalities between January 2012 and December 2021. All amniocenteses were performed under ultrasound guidance with aseptic technique by fetal-maternal medicine specialists who also performed detailed ultrasound examinations to document the presence or absence of fetal structural abnormalities. All AF samples were centrifuged at 100 g for 10 minutes; 1 mL of the resulting supernatant was stored at -80℃ in plain aliquots without additives (Axygen; BioGene, Union City [CA], United States), whereas the pellet was used for either CMA and/or karyotyping by G-banding analysis. The cytogenetic laboratory information system recorded the ultrasound findings, indication for amniocentesis, gestational age at amniocentesis, and CMA and/or karyotype results. Pregnancy and fetal outcomes, including the presence of any abnormalities at birth or autopsy findings, were recorded in each patient’s electronic records.
 
Patients
Archived AF samples with known fetal and pregnancy outcomes, taken at 16+0 to 22+6 weeks of gestation from singleton pregnancies with a euploid fetus (confirmed by CMA or karyotype) during the period from January 2018 to December 2020, were retrieved to establish a gestational age–specific reference range for GGT. Pregnancies with fetal chromosomal, gastrointestinal, or hepatobiliary anomalies (particularly BA) polyhydramnios, or oligohydramnios were excluded to eliminate potential confounding effects of these conditions.
 
Non-visualisation of the fetal gallbladder was incidentally detected during fetal morphology scans in pregnancies with risk factors for fetal abnormalities because the fetal gallbladder was not assessed in low-risk routine anomaly scans, in accordance with the International Society of Ultrasound in Obstetrics and Gynecology guideline.11 It was defined as failure to visualise the fetal gallbladder on two targeted ultrasound examinations performed 1 week apart. Isolated NVFGB was defined as NVFGB in the absence of other abnormal ultrasound findings. Pregnant women were counselled regarding possible differential diagnoses and offered amniocentesis for chromosomal analysis, as well as repeated ultrasound scans until the fetal gallbladder was visible. After birth, babies with persistent NVFGB were referred to paediatricians for hepatobiliary ultrasound and liver function tests. If the gallbladder was visible during prenatal scans, paediatricians did not order further tests in the absence of clinical suspicion. We followed the progress of all babies until the time of writing, using electronic hospital records for those delivered in public hospitals and phone calls for those delivered in private hospitals. With parental consent, post-mortem examinations were arranged in pregnancies terminated for serious associated fetal abnormalities.
 
Validation and analytical performance evaluation
Amniotic fluid samples were removed from -80℃ storage in batches, thawed, and equilibrated to room temperature immediately prior to analysis. Gamma-glutamyl transferase levels were determined using an International Federation of Clinical Chemistry and Laboratory Medicine–standardised L-gamma-glutamyl-3-carboxy-4-nitroanilide (GGCNA) enzymatic colorimetric assay on a Cobas c502 analyser (Roche, Basel, Switzerland). Internal quality controls were performed before and after each batch.
 
Details of the AFGGT assay validation and analytical performance, including matrix effects; linearity; intra- and inter-run precision at various GGT levels; interference due to haemolysis, icterus, and lipaemia; and sample stability, are summarised in online supplementary Appendix 1.
 
Establishment of reference range
An AFGGT reference range was developed using the Generalised Additive Models for Location (μ), Scale (υ) and Shape (σ) [GAMLSS] package in R statistical software (version 3.3.2; R Foundation for Statistical Computing, Vienna, Austria). All GGT values were transformed to their natural log equivalent before model construction; the final model balanced between percentile smoothness, goodness-of-fit, and simplicity. Model fit was assessed using the generalised Akaike information criterion and by inspection of residuals with quantile-quantile plots for all measurements, detrending of quantile-quantile plots, and comparison of empirical percentiles to fitted percentiles. Empirical percentiles were determined for comparative purposes by grouping GGT levels according to gestational age (in weeks).
 
The final model was used to determine reference values for the 2.5th, 5th, 50th, 95th (z = ± 1.645), and 97.5th percentiles (z = ± 1.964). Percentiles were determined using the expression μ × (1 + zpυσ)1/υ, where zp is the percentile of interest and μ, υ, and σ are dependent on the time covariate, gestational age. The GGT reference range was constructed using R statistical software and Microsoft Excel (Microsoft Corporation, Redmond [WA], United States).
 
We estimated a priori that 293 AFGGT measurements were needed to achieve a standard error of 10% of the gestational age–specific standard deviation for the 2.5th and 97.5th (z=1.96) reference percentiles, assuming that the standard error of the percentile of interest is expressed as a multiple of standard deviation using the formula , where each gestational age between 16+0 and 22+6 weeks has a minimum of 42 measurements.
 
Performance evaluation
Measured levels of AFGGT in pregnancies with NVFGB were transformed to their gestational age–specific percentile values using the final model. We then determined the positive predictive value (PPV) and negative predictive value (NPV) of AFGGT level <2.5th percentile for identifying BA in euploid pregnancies with NVFGB.
 
Results
Validation and analytical performance
The performance of the GGT assay using AF is summarised in online supplementary Appendix 2. Validation studies indicated that the GGCNA enzymatic colorimetric assay for determination of GGT activity in AF had linearity, precision, recovery, interference profiles, and stability comparable to the values reported for measurement of GGT in plasma and serum samples. The verified analytical measurement range was 10 to 1200 U/L. No significant interference was observed in the presence of 0.25 g/dL haemoglobin, 103 μmol/L bilirubin, or 16.8 mmol/L triglyceride. The limits of haemolysis/icterus/lipaemia indices above which interference occurred were significantly higher than the degrees of those indices in all analysed samples.
 
Gestational age–specific reference range
A database search identified 518 stored amniotic samples (502 [97%] Chinese and 16 [3%] other Asian ethnicities) suitable for use in establishing a local gestational age–specific AFGGT reference range. The median number of samples per week was 65; there were 2 weeks with <42 samples (27 and 28 samples in the 16th and 19th week of gestation, respectively). The online supplementary Table lists the regressed values of AFGGT levels according to percentile for each week of gestation.
 
The simplest best-fit model indicated a linear relationship between natural log-transformed GGT and gestational age. Table 1 and the online supplementary Figure show the final smoothing equations for median, coefficient of variation, and skewness, along with gestational age–specific smoothed percentile curves and values of AFGGT for our local population. Residuals of the final model had a mean skewness of 0 and a variance of 1; they were almost mesokurtic, with kurtosis values close to 3.
 

Table 1. Smoothing equations determined by modelling for median, coefficient of variation, and skewness used to compute z-scores and percentiles of the natural logarithm of gestational age–specific gamma-glutamyl transferase level
 
Performance evaluation
The database search identified 32 pregnancies with NVFGB from 2012 to 2021, of which 18 had an available AF sample (four isolated NVFGB and 14 non-isolated NVFGB). There were no cases of cystic fibrosis. Nine cases were excluded from analysis, including five with chromosomal abnormalities, one with renal hypoplasia and oligohydramnios, one with hydrops and polyhydramnios, and two in which the biliary tract anatomy could not be identified. The nine remaining cases for analysis included four with isolated NVFGB and five with non-isolated NVFGB (Table 2). The median gestational age at amniocentesis was 21.0 weeks (interquartile range=20.7-22.1).
 

Table 2. Characteristics of the nine cases of non-visualisation of the fetal gallbladder included in the current study
 
The Figure depicts the AFGGT levels in the nine pregnancies for analysis compared with our local gestational age–specific AFGGT reference range. Four cases had AFGGT level <2.5th percentile, including one with BA (AFGGT level of 27 U/L at 20+3 weeks) and three with transient non-visualisation (AFGGT levels of 32, 68, and 37 U/L at 20+0, 20+6, and 22+2 weeks, respectively). Five had AFGGT level ≥2.5th percentile, including one with gallbladder agenesis and four with transient non-visualisation (Table 2). Using AFGGT level <2.5th percentile as the cut-off, the NPV and PPV for identifying fetal BA in pregnancies with NVFGB were 100% and 25% (95% confidence interval=0, 53), respectively (Table 3). Repeated analysis using AFGGT level <5th percentile or the reference ranges from Burc et al6 or Bardin et al7 yielded identical results.
 

Figure. Amniotic fluid gamma-glutamyl transferase (AFGGT) levels in cases with non-visualisation of the fetal gallbladder compared with the gestational age–specific AFGGT reference range
 

Table 3. Reported efficacies of amniotic fluid enzymes for the prediction of biliary atresia in cases of non-visualisation of the fetal gallbladder
 
Discussion
Biliary atresia is a rare congenital anomaly, with a prevalence of 1 in 15 000 to 20 000 live births among Caucasian populations.12 However, BA is more common in East Asians; the prevalence is 1 in 5000 to 7000 among Chinese populations.13 14 Untreated BA is a progressive and devastating disease that can cause cirrhosis and death by 2 years of age.14 This outcome can be prevented by early intervention via palliative Kasai hepatoportoenterostomy, which is essential for re-establishing biliary drainage. If biliary drainage cannot be re-established, liver transplantation is necessary. Indeed, BA is the most common indication for liver transplantation in children, contributing to 75% of transplantations in children before 2 years of age.12 It is therefore imperative to diagnose BA early, preferably during the prenatal period. Nevertheless, prenatal diagnosis of BA is challenging because ultrasound cannot directly examine the patency of fetal bile ducts. When NVFGB is associated with a hepatic hilar cyst or heterotaxy, it is highly suggestive of BA.15 Non-visualisation of the fetal gallbladder with a hepatic hilar cyst is an indicator of cystic BA, a rare subtype representing 5% to 10% of BA cases16; therefore, this prenatal combination is uncommon. Heterotaxy is another rare condition with a prevalence of 1 in 10 000 live births,17 and concurrent BA is present in only 10.4% of left atrial isomerism cases18; therefore, this prenatal combination is even less common. Consequently, NVFGB may be the only prenatal sign indicating the possibility of BA. However, in cases of isolated NVFGB, it is difficult to differentiate between BA and gallbladder agenesis. The discovery of the association between low AFGGT levels and fetal BA has led to interest regarding the role of AFGGT in the management of NVFGB.
 
Validation and analytical performance
To our knowledge, this study is the first to validate the International Federation of Clinical Chemistry and Laboratory Medicine–standardised GGCNA enzymatic colorimetric assay on the Cobas c502 analyser, a common locally available plasma and serum analyser, for use with AF. We have demonstrated that accurate and precise measurements of GGT can be achieved with AF samples, enabling adoption in clinical settings. We have also established that the analytical measurement range is 10 to 1200 U/L. This is particularly important for an AFGGT assay because AFGGT levels in euploid pregnancies can vary across multiple orders of magnitude, whereas plasma GGT levels in healthy individuals usually remain below 100 U/L. This verification of the lower limit of quantification and linearity range improves confidence in our measurements. Furthermore, we have excluded potential interference, particularly from haemoglobin, because AF samples may sometimes contain maternal blood. We were initially concerned about GGT stability because some samples had been stored for several years; however, consistent with the World Health Organization report that GGT is stable for years in frozen serum and plasma samples,19 we found that AF stored frozen in plain bottles without additives at -20℃ or -80℃ remained stable for at least 6 months (online supplementary Appendix 1). This finding indicates that supernatants can be stored and subsequently retrieved for GGT assays, an important consideration if amniocentesis is performed in the early second trimester but an indication for AFGGT testing is identified during a mid-trimester morphology scan.
 
Gestational age–specific reference range
We have established a reference range for AFGGT levels at 16+0 to 22+6 weeks of gestation using a large local reference population of 518 samples (all Asian, 97% Chinese); this reference population is the largest compared with similar previous publications. The presence of an adequate sample size for each week of gestation allowed us to establish a reference range for each gestational age, thus overcoming the aforementioned limitations regarding clinical use of the two previous reference ranges.6 7 With respect to the two previous reference ranges, the 5th, 50th, and 95th percentiles in our study are similar to those reported by Bardin et al7 but higher at most gestational ages than those reported by Burc et al.6 The larger sample size in our study permitted the calculation of the 2.5th and 97.5th percentiles, such that a 95% central reference range could be established; this allows greater flexibility in selecting cut-off values for clinical application.
 
It has been reported that AFGGT levels are increased in oesophageal atresia and duodenal atresia, whereas they are decreased in anal atresia without fistula.20 21 Although duodenal atresia can easily be diagnosed prenatally by detection of the double bubble sign, this sign usually appears after 24 weeks of gestation.22 Prenatal diagnosis of oesophageal atresia relies on the indirect sign of non-visualisation of the stomach bubble and polyhydramnios; the sensitivities of these ultrasound findings range from 8.9% to 42%.20 Additionally, prenatal detection of anal atresia without fistula depends on the absence of the perianal muscular complex, but the anal sphincter does not fully mature until after 28 weeks of gestation.23 Further research regarding the use of AFGGT for early detection of these congenital gastrointestinal tract obstructions is valuable, and the availability of a local gestational age–specific reference range is crucial for supporting such research.
 
Performance evaluation
In the present study, the NPV and PPV of AFGGT level <2.5th percentile for identifying fetal BA in NVFGB were 100% and 25%, respectively (Table 3). Bardin et al9 assessed the efficacy of low AFGGT levels in predicting BA among cases of NVFGB between 17 and 22 weeks of gestation. Of the 26 cases with AFGGT level ≥5th percentile, none had BA; of the four cases with AFGGT level <5th percentile, three had BA. The corresponding NPV and PPV were 100% and 75%, respectively. The PPV in their study may have been higher because amniocentesis was performed before 22 weeks of gestation in all cases.9 In our cohort, after exclusion of the two cases in which amniocentesis was performed after 22 weeks of gestation, the PPV only marginally improved to 33.3%. Our figures are more consistent with those of Dreux et al,8 who analysed the efficacy of AFEs for predicting BA in NVFGB before and after 22 weeks of gestation. In that study, abnormal AFE was defined as GGT and/or intestinal alkaline phosphatase level <0.5 multiples of the median.8 Before 22 weeks of gestation, there were three cases of BA among seven cases with abnormal AFE and no cases of BA among 16 cases with normal AFE. The corresponding NPV and PPV were 100% and 43%, respectively. However, after 22 weeks of gestation, there was only one case of BA among six cases with abnormal AFE and four cases of BA among 56 cases with normal AFE. The corresponding NPV and PPV were 93% and 17%, respectively (Table 3). These findings confirmed the expected decrease in AFE efficacy for predicting BA after 22 weeks of gestation. By that gestational age, the passage of GGT from the intestine into the AF is impeded by mature anal sphincter muscles in normal fetuses; thus, the AFGGT level is very low after 22 weeks of gestation, and it is difficult to distinguish between a low level due to BA and a low level related to normal development.5 6 7
 
In our cohort, there were three fetuses without BA who had AFGGT level ≤2.5th percentile; one of these fetuses had an extremely low AFGGT level (Case 4) [Table 2]. In addition to its association with fetal BA, low AFGGT is linked to chromosomal abnormalities, cystic fibrosis, anal atresia without fistula, and polyhydramnios. However, none of the three fetuses had any of these conditions. Although one fetus (Case 2) [Table 2] had a small choledochal cyst which might have impeded biliary drainage and caused a mild decrease in AFGGT, we could not find a potential explanation for the low AFGGT levels in the other two fetuses.
 
Limitations
Limitations of our study include its retrospective nature and small cohort size. The incidence of NVFGB is low (0.1%).1 In the largest systematic review concerning the outcomes of NVFGB, encompassing seven studies, the total number of cases was 280; among 170 cases of isolated NVFGB, only six (3.5%) had BA.2 In Hong Kong, the fetal gallbladder is not routinely assessed in low-risk routine anomaly scans, in accordance with the International Society of Ultrasound in Obstetrics and Gynecology guideline.11 24 Among the 32 cases of NVFGB in our cohort, one (3.1%) had BA; this incidence is comparable to the findings in the aforementioned review. However, only 18 cases had an AF sample available for AFGGT testing; after the exclusion of cases with abnormalities that might affect the AFGGT level, we included nine cases in the analysis. Because there was only one case of BA in our cohort and 11 more were described in the literature (Table 3), clinical application of these research findings requires caution, as well as careful pre- and post-test counselling.
 
In addition to the one case of BA in this cohort, two additional cases of BA were not included in this study because prenatal ultrasound scans did not indicate NVFGB. The AFGGT levels in all three cases were <30 U/L and <2.5th percentile (27, 28, and 29 U/L at 20+3, 21+5, and 21+6 weeks, respectively), whereas the AFGGT levels in all samples used to establish our reference range were >30 U/L. Thus, an AFGGT level <30 U/L may be a useful absolute cut-off for the prediction of BA. Notably, all three cases of extrahepatic bile duct obstruction reported by Muller et al5 also had an AFGGT level <30 U/L (20 U/L for all three cases [<1st percentile]). Further research with a larger cohort is required to confirm the efficacy of using an AFGGT level <30 U/L as the absolute cut-off for predicting BA in NVFGB.
 
Conclusion
Based on the present findings and published literature, we conclude that AFGGT testing is useful for the exclusion of fetal BA in pregnancies with NVFGB. With a consistent NPV of 100% across all published series, AFGGT level ≥2.5th percentile can provide reassurance for parents that the fetus is unlikely to have BA. However, considering its PPV of 33.3% to 75% before 22 weeks of gestation and 17% to 43% after 22 weeks of gestation, AFGGT level <2.5th percentile cannot be considered diagnostic for BA. Instead, it serves as a warning sign, indicating the need for prompt postnatal investigation of possible BA. Because NVFGB is also associated with chromosomal abnormalities, amniocentesis is recommended; the advantages of detecting underlying chromosomal abnormalities by CMA and excluding BA through AFGGT testing likely outweigh the 0.3% risk of procedure-related miscarriage.25 Follow-up prenatal ultrasound scans to visualise the fetal gallbladder should be arranged. Paediatricians should also be alerted for prompt postnatal assessment to facilitate early detection of BA. Timely performance of the Kasai operation can reduce the need for liver transplantation in childhood and improve the rate of overall survival into adulthood to 90%.26 27
 
Author contributions
Concept or design: YH Ting, DS Sahota, FCK Wong, TYT Cheung.
Acquisition of data: TYT Cheung, SR Subramaniam, FCK Wong, MHM Chan, YH Ting, NKL Wong, SL Lau, X Zhu, WT Lui, EKW Chan, YKY Kwok, KW Choy, TY Leung.
Analysis or interpretation of data: TYT Cheung, DS Sahota, FCK Wong, YH Ting, NKL Wong.
Drafting of the manuscript: YH Ting, TYT Cheung, DS Sahota, NKL Wong, FCK Wong, SR Subramaniam.
Critical revision of the manuscript for important intellectual content: DS Sahota, YH Ting, TYT Cheung, FCK Wong, NKL Wong.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This research was approved by the Joint Chinese University of Hong Kong—New Territories East Cluster Clinical Research Ethics Committee, Hong Kong (Ref No.: CRE 2020.060). Informed consent for amniocentesis in the current study and storage and use of excess amniotic fluid in future research was obtained from the patients at the time of amniocentesis.
 
Supplementary material
The supplementary material was provided by the authors and some information may not have been peer reviewed. Accepted supplementary material will be published as submitted by the authors, without any editing or formatting. Any opinions or recommendations discussed are solely those of the authors and are not endorsed by the Hong Kong Academy of Medicine and the Hong Kong Medical Association. The Hong Kong Academy of Medicine and the Hong Kong Medical Association disclaim all liability and responsibility arising from any reliance placed on the content.
 
References
1. Blazer S, Zimmer EZ, Bronshtein M. Nonvisualization of the fetal gallbladder in early pregnancy: comparison with clinical outcome. Radiology 2002;224:379-82. Crossref
2. Di Pasquo E, Kuleva M, Rousseau A, et al. Outcome of non-visualization of fetal gallbladder on second-trimester ultrasound: cohort study and systematic review of literature. Ultrasound Obstet Gynecol 2019;54:582-8. Crossref
3. Ting YH, So PL, Cheung KW, Lo TK, Ma TW, Leung TY. Non-visualisation of fetal gallbladder in a Chinese cohort. Hong Kong Med J 2022;28:116-23. Crossref
4. Parolini F, Boroni G, Milianti S, et al. Biliary atresia: 20-40-year follow-up with native liver in an Italian centre. J Pediatr Surg 2019;54:1440-4. Crossref
5. Muller F, Gauthier F, Laurent J, Schmitt M, Boué J. Amniotic fluid GGT and congenital extrahepatic biliary damage. Lancet 1991;337:232-3. Crossref
6. Burc L, Guibourdenche J, Luton D, et al. Establishment of reference values of five amniotic fluid enzymes. Analytical performances of the Hitachi 911. Application to complicated pregnancies. Clin Biochem 2001;34:317-22. Crossref
7. Bardin R, Danon D, Tor R, Mashiach R, Vardimon D, Meizner I. Reference values for gamma-glutamyl-transferase in amniotic fluid in normal pregnancies. Prenat Diagn 2009;29:703-6. Crossref
8. Dreux S, Boughanim M, Lepinard C, et al. Relationship of non-visualization of the fetal gallbladder and amniotic fluid digestive enzymes analysis to outcome. Prenat Diagn 2012;32:423-6. Crossref
9. Bardin R, Ashwal E, Davidov B, Danon D, Shohat M, Meizner I. Nonvisualization of the fetal gallbladder: can levels of gamma-glutamyl transpeptidase in amniotic fluid predict fetal prognosis? Fetal Diagn Ther 2016;39:50-5. Crossref
10. Stewart SH, Connors GJ, Hutson A. Ethnicity and gamma-glutamyltransferase in men and women with alcohol use disorders. Alcohol Alcohol 2007;42:24-7. Crossref
11. Salomon LJ, Alfirevic Z, Berghella V, et al. Practice guidelines for performance of the routine mid-trimester fetal ultrasound scan. Ultrasound Obstet Gynecol 2011;37:116-26. Crossref
12. Hartley JL, Davenport M, Kelly DA. Biliary atresia. Lancet 2009;374:1704-13. Crossref
13. Hsiao CH, Chang MH, Chen HL, et al. Universal screening for biliary atresia using an infant stool color card in Taiwan. Hepatology 2008;47:1233-40. Crossref
14. Zhan J, Chen Y, Wong KK. How to evaluate diagnosis and management of biliary atresia in the era of liver transplantation in China. World J Paediatr Surg 2018;1:e000002. Crossref
15. Chalouhi GE, Muller F, Dreux S, Ville Y, Chardot C. Prenatal non-visualization of fetal gallbladder: beware of biliary atresia! Ultrasound Obstet Gynecol 2011;38:237-8. Crossref
16. Rahamtalla D, Al Rawahi Y, Jawa ZM, Wali Y. Cystic biliary atresia in a neonate with antenatally detected abdominal cyst. BMJ Case Rep 2022;15:e246081. Crossref
17. Lin AE, Ticho BS, Houde K, Westgate MN, Holmes LB. Heterotaxy: associated conditions and hospital-based prevalence in newborns. Genet Med 2000;2:157-72. Crossref
18. Gottschalk I, Stressig R, Ritgen J, et al. Extracardiac anomalies in prenatally diagnosed heterotaxy syndrome. Ultrasound Obstet Gynecol 2016;47:443-9. Crossref
19. Ehret W. Use of Anticoagulants in Diagnostic Laboratory Investigations. Geneva: World Health Organization; 1999.
20. Czerkiewicz I, Dreux S, Beckmezian A, et al. Biochemical amniotic fluid pattern for prenatal diagnosis of esophageal atresia. Pediatr Res 2011;70:199-202. Crossref
21. Muller C, Czerkiewicz I, Guimiot F, et al. Specific biochemical amniotic fluid pattern of fetal isolated esophageal atresia. Pediatr Res 2013;74:601-5. Crossref
22. The Fetal Medicine Foundation. Duodenal atresia. Available from: https://fetalmedicine.org/education/fetal-abnormalities/gastrointestinal-tract/duodenal-atresia. Accessed 19 Jan 2023.
23. Ochoa JH, Chiesa M, Vildoza RP, Wong AE, Sepulveda W. Evaluation of the perianal muscular complex in the prenatal diagnosis of anorectal atresia in a high-risk population. Ultrasound Obstet Gynecol 2012;39:521-7. Crossref
24. Chen M, Leung TY, Sahota DS, et al. Ultrasound screening for fetal structural abnormalities performed by trained midwives in the second trimester in a low-risk population—an appraisal. Acta Obstet Gynecol Scand 2009;88:713-9. Crossref
25. Salomon LJ, Sotiriadis A, Wulff CB, Odibo A, Akolekar R. Risk of miscarriage following amniocentesis or chorionic villus sampling: systematic review of literature and updated meta-analysis. Ultrasound Obstet Gynecol 2019;54:442-51. Crossref
26. Serinet MO, Wildhaber BE, Broué P, et al. Impact of age at Kasai operation on its results in late childhood and adolescence: a rational basis for biliary atresia screening. Pediatrics 2009;123:1280-6. Crossref
27. Davenport M. Biliary atresia: clinical aspects. Semin Pediatr Surg 2012;21:175-84. Crossref

Healthy Eating Report Card for Pre-school Children in Hong Kong

Hong Kong Med J 2024 Jun;30(3):209–17 | Epub 21 May 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Healthy Eating Report Card for Pre-school Children in Hong Kong
Alison WL Wan, MSc; Kevin KH Chung, PhD; JB Li, MEd, PhD; Derwin KC Chan, MSc, PhD
Department of Early Childhood Education, The Education University of Hong Kong, Hong Kong SAR, China
 
Corresponding author: Dr Derwin KC Chan (derwin@eduhk.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This study aimed to develop the Healthy Eating Report Card for Pre-school Children in Hong Kong for evaluating the prevalence of healthy eating behaviours and favourable family home food environments (FHFEs) among pre-school children in Hong Kong.
 
Methods: In this cross-sectional study, 538 parent-child dyads from eight kindergartens in Hong Kong were recruited. Parents or guardians completed a questionnaire comprising Report Card items. The Report Card included two indicators of Children’s Eating Behaviours (ie, Children’s Dietary Patterns and Children’s Mealtime Behaviours) and three indicators of FHFEs (ie, Parental Food Choices and Preparation, Avoidance of Unhealthy Foods, and Family Mealtime Environments). Each indicator and its specific items were assigned a letter grade representing the percentage of participants achieving the predefined benchmarks. The grades were defined as A (≥80%, Excellent); B (60%-79%, Good); C (40%-59%, Fair); D (20%-39%, Poor); and F (<20%, Very poor). Plus (+) and minus (-) signs were used to indicate the upper or lower 5% of each grade.
 
Results: Overall, Children’s Eating Behaviours were classified as Fair (average grade of ‘C’), whereas FHFEs were classified as Good (average grade of ‘B’). The sub-grades ranged from ‘C’ to ‘A-’, as follows: Children’s Dietary Patterns, ‘C+’; Children’s Mealtime Behaviours, ‘C’; Parental Food Choices and Preparation, ‘C+’; Avoidance of Unhealthy Foods, ‘B’; and Family Mealtime Environments, ‘A-’.
 
Conclusion: The findings highlight areas for improvement in healthy eating among children. The Healthy Eating Report Card could offer novel insights into intervention tools that promote healthy eating.
 
 
New knowledge added by this study
  • Eating behaviours among pre-school children in Hong Kong were classified as Fair (average grade of ‘C’).
  • Among those children, family home food environments (FHFEs) were classified as Good (average grade of ‘B’).
  • There is considerable potential for improvement in children’s dietary patterns, children’s mealtime behaviours, and parental food choices and preparation.
Implications for clinical practice or policy
  • The Healthy Eating Report Card for Pre-school Children can be considered a useful tool for evaluating the prevalence of healthy eating behaviours and favourable FHFEs among pre-school children.
  • The grades provided by the Report Card offer valuable guidance concerning how healthy eating behaviours and favourable FHFEs among pre-school children could be promoted at the family, school, and community levels.
 
 
Introduction
Unhealthy dietary patterns, generally characterised by low dietary diversity, skipping breakfast, low consumption of fruits and vegetables, and frequent consumption of energy-dense/nutrient-poor foods and sugar-sweetened beverages, are common among children worldwide.1 2 It is particularly important for young children to adopt healthy dietary patterns because eating habits and food preferences in childhood can influence dietary patterns in adulthood.3 Similar to children in other regions, Hong Kong children have a high prevalence of unhealthy dietary patterns.4 A cross-sectional survey evaluating infant and young child feeding practices in Hong Kong identified numerous dietary problems, including dietary imbalance (eg, high protein but low fibre intake), overdependence on the use of formula milk, inadequate intake of vegetables and fruits, and unhealthy snacking and sugary beverage habits.5 A previous study revealed a need for dietary improvement among Hong Kong pre-school children.6 Key recommendations included a balanced diet, better nutritional adequacy, and greater independence during mealtimes (ie, self-feeding).6 Indeed, inappropriate behaviours during mealtimes, such as lack of self-feeding, food refusal, picky eating, and prolonged meals, are common in young children.7 8 Children with inappropriate mealtime behaviours may be susceptible to insufficient nutrient and/or energy intake.9 However, there is limited information available regarding the mealtime behaviours of typically developing pre-school children in Hong Kong.
 
Family home food environments (FHFEs), ie, parental food choices and preparation, avoidance of unhealthy foods, and family mealtime environments, have strong associations with children’s dietary habits and body weight.10 11 12 Parental use of nutrition labels to make healthier food choices has been linked to a lower probability of overweight or obesity in their children.13 Similarly, children with limited access to unhealthy foods are reportedly more likely to maintain a normal body weight.14 A review focusing on the effects of family and social environment on children’s dietary patterns found that a structured mealtime environment—namely, regular meals with family members and screen-free mealtimes—was associated with healthy food consumption patterns in children.15 In summary, children’s mealtime behaviours and FHFEs are important for healthy eating among young children. Thus far, no studies have provided an overview of eating behaviours and FHFEs among pre-school children in Hong Kong.
 
The use of a report card at the country-/region-level can provide a valuable overview of the prevalence of health behaviours through a conventional letter grading system (ranging from A+ to F).16 This framework has been used to evaluate various health-related behaviours (eg, physical activity,17 18 19 sedentary behaviour,17 18 19 smoking behaviour,20 and dietary patterns20 21 22) in children and youth. The findings of such report cards offer insights concerning the extent to which health behaviours are adopted in specific communities and provide targeted recommendations for health behaviours at the individual or public level.16 23 Moreover, the publication of these report cards can facilitate health promotion awareness and catalyse policy changes that motivate individuals to commit to health behaviours.16 19 24 Thus far, only one published report card (the Healthy Active Kids South Africa [HAKSA] Report Card) has revealed dietary patterns among children and youth.21 However, the HAKSA Report Card only covered the intake of fruits, vegetables, and unhealthy snacks; other essential aspects of healthy eating (eg, daily breakfast consumption, dietary variety, mealtime behaviours, and FHFEs) were not considered. Additionally, because the evidence underlying the grading criteria for various aspects of healthy eating was not explicitly stated, the findings of the HAKSA Report Card might not provide useful benchmarks concerning how well individuals adhere to healthy eating standards or recommendations adopted by health authorities. Therefore, we aimed to address the aforementioned research gaps through a cross-sectional study that developed the Healthy Eating Report Card for Pre-school Children in Hong Kong, using a grading scale and evidence-based benchmarks to assess the current prevalence of healthy eating behaviours and favourable FHFEs among pre-school children in Hong Kong.
 
Our Report Card assesses various indicators of Children’s Eating Behaviours (ie, Children’s Dietary Patterns and Children’s Mealtime Behaviours) and FHFEs (ie, Parental Food Choices and Preparation, Avoidance of Unhealthy Foods, and Family Mealtime Environments). Each indicator and its specific items are assigned letter grades based on predefined benchmarks, revealing how well pre-school children in Hong Kong meet the recommendations and standards established by the government and published literature concerning pre-school children’s healthy eating behaviours and FHFEs (Tables 125 26 27 28 29 30 31 and 223). The findings may enhance the understanding of eating behaviours and FHFEs among pre-school children in Hong Kong. Additionally, the Healthy Eating Report Card established in our study will be useful for future studies examining healthy eating among young children in other countries or regions.
 
Methods
Participants
We recruited 538 Hong Kong parent-child dyads from eight local kindergartens in three main regions of Hong Kong (29.55% of children from the New Territories, 62.64% of children from Kowloon, and 7.81% of children from Hong Kong Island). The children were aged between 2 and 6 years with a mean age of 4.10 years (standard deviation=0.92); 49.63% of the children were boys. Of the children, 33.77%, 29.46%, and 36.77% were in grades K1, K2, and K3, respectively. Respondents were mainly mothers (85.63%), followed by fathers (13.25%) and other legal guardians (1.12%). The mean respondent age was 36.62 years (standard deviation=5.84).
 
Procedures
This cross-sectional study examined the prevalence of healthy eating behaviours and favourable FHFEs among Hong Kong pre-school children in October 2021. We sent invitation letters to 89 randomly selected local kindergartens across 18 districts in Hong Kong (excluding international schools and special needs schools). Eight kindergartens agreed to participate in this study and distribute our questionnaire to eligible parents. The inclusion criteria required participants to: (1) be Chinese parents or guardians; (2) have at least one child in grades K1 to K3; and (3) have sufficient Chinese reading ability to complete the questionnaire. Schools and parents were both asked to provide written informed consent. Parents completed a parent-reported questionnaire (comprising the Healthy Eating Report Card items) about their children’s eating behaviours and FHFEs, which typically required 15 minutes to complete. Respondents received a HK$50 supermarket voucher as a token of appreciation for their participation.
 
Report Card questionnaire
The design of our Healthy Eating Report Card was based on the conceptual framework established by the Report Card on Physical Activity for Children and Youth, which offers a comprehensive grading framework and benchmarks to evaluate health behaviours.16 17 18 19 20 21 22 23 The Global Matrix 3.0 Physical Activity Report Card for Children and Youth has been used in 49 countries to evaluate the prevalence of physical activity behaviours among children and youth.23 Using a similar assessment framework, our Healthy Eating Report Card evaluated the prevalence of healthy eating behaviours and favourable FHFEs among Hong Kong pre-school children. The Healthy Eating Report Card Questionnaire consisted of 21 items which were developed based on the healthy eating guidelines and recommendations of the Department of Health of Hong Kong.25 26 The items were aligned with the five indicators of our Report Card, including two Children’s Eating Behaviours indicators (ie, Children’s Dietary Patterns and Children’s Mealtime Behaviours) and three FHFEs indicators (ie, Parental Food Choices and Preparation, Avoidance of Unhealthy Foods, and Family Mealtime Environments), to determine whether the children adhered to healthy eating behaviours and were involved in healthy FHFEs. Participants responded to questionnaire items using a 5-point Likert scale (ranging from ‘always’ to ‘never’), yes/no questions, open-ended questions, and multiple-choice questions. The questionnaire is provided in online supplementary Appendix 1.
 
Data analysis
Benchmarks for each indicator were established in accordance with recommendations and guidelines for healthy eating behaviours and FHFEs from the Hong Kong SAR Government and published literature.25 26 27 28 29 30 31 Based on the results of the questionnaire, we used a benchmark framework to determine the letter grade for each indicator that reflected the percentage of participants who met the predetermined benchmarks. A sub-grade was also determined for each indicator item. The questionnaire items and criteria of questionnaire answers related to the benchmarks are displayed in online supplementary Appendix 2.25 26 28 29 30 31 The grading system for the Healthy Eating Report Card was derived from the grading rubric of the Global Matrix 3.0 Physical Activity Report Card for Children and Youth, ie, A (≥80%, Excellent); B (60%-79%, Good); C (40%-59%, Fair); D (20%-39%, Poor); and F (<20%, Very poor).23 Plus (+) and minus (-) signs were used to show the upper or lower 5% of each grade.23 The proposed benchmarks and the grading scheme are shown in Tables 125 26 27 28 29 30 31 and 2,23 respectively.
 

Table 1. Indicators and benchmarks in the Healthy Eating Report Card for Pre-school Children in Hong Kong
 

Table 2. Grading system of the Healthy Eating Report Card for Pre-school Children in Hong Kong
 
Due to the percentage calculation, responses to 5-point Likert scale questions in the questionnaire were converted to binary variables; responses of ‘sometimes’, ‘often’ and ‘always’ were categorised as ‘yes’, whereas responses of ‘never’ and ‘rarely’ were categorised as ‘no’.32 All statistical analyses were performed using SPSS software (Windows version 26.0; IBM Corp, Armonk [NY], United States). Missing values ranged from 0.19% to 2.79% for questionnaire items because of nonresponses to some items. The descriptive statistics revealed valid percentages for each indicator item, along with 95% confidence intervals (CIs). The arithmetic mean for each indicator was calculated by summing the valid percentage for each item of the indicator, then dividing that value by the number of corresponding items. This arithmetic mean represents the average percentage of participants who met predefined benchmarks for that indicator.
 
Results
The letter grades of our Healthy Eating Report Card are summarised in Table 3. The descriptive statistics of the Report Card are displayed in online supplementary Appendix 3. The average grades of Children’s Eating Behaviours indicators and FHFEs indicators were ‘C’ and ‘B’, respectively, showing that the eating behaviours of Hong Kong pre-school children were classified as Fair, whereas FHFEs were classified as Good (Table 3). On average, nearly half of the pre-school children (52.88%; 95% CI=51.26%-54.50%) adhered to healthy eating behaviours, whereas more than half of the parents (70.87%; 95% CI=69.34%-72.40%) provided their children with favourable FHFEs.
 

Table 3. Letter grades assigned to indicators and their items in the Healthy Eating Report Card for Pre-school Children in Hong Kong
 
Children’s Dietary Patterns
Nearly all children (97.58%) had three regular meals daily. However, only 24.34% of the children ate a variety of foods. Approximately half of the children had adequate vegetable and fruit intakes (50.84% and 58.40%, respectively). Moreover, 55.76% did not have a formula milk-drinking habit. Additionally, around half of the children had low consumption of unhealthy snacks and sugary beverages (52.25% and 58.87%, respectively) [online supplementary Appendix 3]. Overall, Children’s Dietary Patterns was graded ‘C+’ (Fair), signifying that approximately half of the children (55.37%; 95% CI=53.98%-56.75%) adhered to healthy dietary patterns.
 
Children’s Mealtime Behaviours
Fewer than half of the children (43.02%) did not require parental assistance to finish a meal. Only 38.32% of the children could remain seated during mealtimes. Of the children, 49.15% and 67.98% did not exhibit picky eating behaviours and were willing to try new foods, respectively. Nearly half of the children (49.35%) did not exhibit slow-eating behaviours (online supplementary Appendix 3). Children’s Mealtime Behaviours was graded ‘C’ (Fair), showing that approximately half of the children (49.92%; 95% CI=46.28%-55.07%) exhibited desirable mealtime behaviours.
 
Parental Food Choices and Preparation
More than half of the parents (63.69%) used nutrition labels. Similarly, more than half of the parents (70.26%) considered low oil/salt/sugar food options for their children. Approximately half of the parents used low-fat cooking methods and reduced the use of sugar- or salt-containing condiments (53.53% and 43.58%, respectively) [online supplementary Appendix 3]. Parental Food Choices and Preparation was graded ‘C+’ (Fair), revealing that approximately half of the parents (57.88%; 95% CI=55.49%-60.28%) made healthy food choices and prepared healthy meals for their children.
 
Avoidance of Unhealthy Foods
Half of the parents (52.04%) did not reward their children with unhealthy snacks or drinks. A large majority of the parents (90.15%) limited the frequency and quantity of unhealthy foods (online supplementary Appendix 3). Therefore, Avoidance of Unhealthy Foods was graded ‘B’ (Good), signifying that more than half of the parents (71.10%; 95% CI=68.66%-73.53%) controlled their children’s access to unhealthy foods.
 
Family Mealtime Environments
A large majority of the children dined with their parents or family members and ate the same food as other family members (93.47% and 95.50%, respectively). More than half of the parents (61.08%) did not allow their children to use screen devices during mealtimes (online supplementary Appendix 3). Family Mealtime Environments was graded ‘A-’ (Excellent), showing that most children (83.46%; 95% CI=81.63%-85.29%) were involved in structured family mealtime environments.
 
Discussion
This study aimed to develop the Healthy Eating Report Card for Pre-school Children in Hong Kong, which assessed the prevalence of healthy eating behaviours and favourable FHFEs among pre-school children in Hong Kong. We established evidence-based benchmarks to guide the process of grading Children’s Eating Behaviours indicators and FHFEs indicators, then utilised letter grades to illustrate how well children adhered to healthy eating behaviours. Children’s Dietary Patterns and Children’s Mealtime Behaviours were graded ‘C+’ and ‘C’, respectively, signifying that eating behaviours in Hong Kong pre-school children were Fair; the average overall grade was ‘C’. Our findings indicate that approximately half of the pre-school children in Hong Kong adhered to healthy eating behaviours. Parental Food Choices and Preparation, Avoidance of Unhealthy Foods, and Family Mealtime Environments were graded ‘C+’, ‘B’, and ‘A-’, respectively, showing that FHFEs were Good; the average overall grade was ‘B’. Our results revealed that more than half of the pre-school children in Hong Kong were involved in a healthy home eating environment. Taken together, these findings enhance the understanding of pre-school children’s eating behaviours and FHFEs in Hong Kong.
 
Comparison of Report Card findings with Hong Kong’s previous data
Our Report Card showed that the eating behaviours of Hong Kong pre-school children were classified as Fair; some unhealthy dietary patterns and undesirable mealtime behaviours were prevalent because nearly half of the children did not meet predefined benchmarks for Children’s Eating Behaviours indicators. Previous research33 concerning the eating habits of Hong Kong pre-school children showed that more than half of the surveyed children (78.8%) had a habit of eating breakfast. Nevertheless, fewer than half of the children achieved the recommended daily intakes of vegetables (19.6%) and fruits (47.3%), respectively.33 Some studies identified a high prevalence of formula milk drinking among Hong Kong pre-school–aged children, such that 77% of 4-year-old children continue to drink formula milk.5 34 Overdependence on formula milk may reduce appetite in children and impede their development of healthy eating habits.35 Lo et al33 found that, on average, children consumed high energy-dense foods (eg, candy/chocolate, sweet crackers, and sugary beverages) more than twice per week; accordingly they suggested that such children should minimise their consumption of these foods. Additionally, an investigation regarding undesirable mealtime behaviours among Hong Kong pre-school children revealed that approximately 70% of the children required >30 minutes to finish a meal; these children often were unwilling to self-feed or finish their meals.6 The present study are consistent with previous research, indicating that there is considerable potential for improvement in eating behaviours among Hong Kong children.
 
Comparison of Report Card findings with international data
As mentioned above, the HAKSA Report Card assessed a few aspects of the dietary patterns of South African children and youth aged 3 to 8 years in 2018. Although using a grading rubric similar to our Report Card, the HAKSA Report Card only involved two indicators, namely Fruit and Vegetable Intake (graded ‘D’) and Snacking, Sugar-Sweetened Beverages, Dietary Sodium, and Fast Food Intake (graded ‘F’).21 Our Report Card may provide better coverage of healthy dietary patterns because it included assessments of regular daily meals, food variety, and formula milk-drinking habits, with a particular focus on pre-school children aged 2 to 6 years. Despite the discrepancies between studies, the healthy dietary patterns observed in our Report Card were more favourable than patterns observed in the HAKSA Report Card.21 This difference also implies that standards for healthy eating behaviours among children differ between South Africa and Hong Kong. Future studies should develop a cross-ountry/region–level Report Card and establish a global benchmark to comprehensively analyse global variations in healthy eating, thereby raising global awareness and stimulating global discussion regarding the promotion of healthy eating.
 
Novel findings on family home food environments
Very little is known about FHFEs of children in Hong Kong; therefore, the present study provides initial information concerning parental food choices and preparation, avoidance of unhealthy foods, and family mealtime environments among Hong Kong pre-school children. Based on our findings, the FHFEs of the pre-school children in Hong Kong were classified as Good. This result suggests that parents in Hong Kong attempt to promote healthy diets by limiting their children’s consumption of unhealthy foods. A previous study revealed that Hong Kong parents tended to adopt the ‘control over eating’ approach to feed their children, whereby parents primarily determine the amounts of food that children should eat, including unhealthy snacks.33 Moreover, parents often used food to reward or comfort children.36 The indicator of Family Mealtime Environments in our Report Card showed that most pre-school children had a structured family mealtime environment where they dined with their family and shared the same food with their family members. This finding may be attributed to the Chinese cultural emphasis on shared meals with family members.37 Thus, the results of the present study are consistent with previous research findings.
 
However, this high grade for Family Mealtime Environments may have increased the average grade for FHFEs indicators. Indeed, parents’ food choices, purchases, and preparation directly influence children’s home food environment and food consumption.38 When we specifically focused on the Parental Food Choices and Preparation indicator, the grade decreased to Fair (letter grade of ‘C+’), signifying that only half of the parents complied with healthy eating practices for their children, such as the use of nutrition labels and adoption of healthy cooking methods. Thus, it is important to promote healthy eating at home, which will facilitate healthy eating behaviours among children. A pilot FHFE intervention of the Healthy Home Offerings via the Mealtime Environment Plus programme, which provided parents and children with nutrition education and meal preparation training and activities, successfully promoted a structured mealtime environment at home and helped to improve dietary intake patterns.39 40 Accordingly, future studies might utilise nutrition education interventions to improve the FHFEs of pre-school children in Hong Kong.
 
Strengths and limitations
This study had some strengths. In particular, we collected primary data to enhance the understanding of eating behaviours and FHFEs among pre-school children in Hong Kong. Moreover, to our knowledge, this is the first use of a report card framework to comprehensively evaluate the prevalence of healthy eating behaviours among pre-school children.41 The Report Card can serve as an effective awareness-raising tool that provides novel insights concerning the promotion of healthy eating behaviours, as well as recommendations for healthy eating policies and healthy food environments.23
 
However, this study also had several limitations. First, its cross-sectional design precluded the identification of changes in children’s eating behaviours and FHFEs over time. Future studies could perform longitudinal measurements of variables in the Report Card to reveal changes or stability in healthy eating behaviours among young children; such measurements could also determine whether Report Card scores are predictive of health outcomes in children. An individual-level Healthy Eating Report Card should be developed in future studies to examine the effectiveness of the Report Card on parental intentions towards healthy eating, as well as children’s healthy eating behaviours and favourable FHFEs. Second, although the items of the Healthy Eating Report Card Questionnaire were developed based on the guidelines and recommendations of the Hong Kong SAR Government,25 26 the development of the questionnaire did not include evaluations of its content validity and psychometric properties. Future studies should examine the psychometric properties and other validity aspects of the Report Card questionnaire (eg, factorial validity, convergent validity, and discriminant validity).42 Third, the study relied on parent-reported questionnaires of children’s eating behaviours and FHFEs, which may be susceptible to response biases, social desirability bias, and general response tendencies.43 44 Validation studies comparing parent-reported questionnaires with more objective measures of children’s food intake and observational assessments of mealtime behaviours could be conducted in the future. Finally, the current Report Card does not reflect several components of children’s eating behaviours (eg, the frequency of dining out and the variety of vegetable and fruit consumption) and FHFEs (eg, parental feeding practices and accessibility of healthy food at home). Future studies should investigate whether these components could be included within the Healthy Eating Report Card to provide a more holistic assessment of healthy eating among children.
 
Conclusion
This study developed the Healthy Eating Report Card for Pre-school Children in Hong Kong to reflect the prevalence of healthy eating behaviours and favourable FHFEs among pre-school children in Hong Kong. The Report Card revealed that Children’s Eating Behaviours were classified as Fair (average grade of ‘C’), whereas FHFEs were classified as Good (average grade of ‘B’). There is considerable potential for improvement in children’s eating behaviours (ie, healthy dietary patterns and appropriate mealtime behaviours) and FHFEs (particularly concerning parental healthy food choices and preparation). We believe that the Report Card can serve as a useful tool for evaluating the prevalence of healthy eating behaviours and favourable FHFEs in young children; it could offer novel insights into strategies for promotion of healthy eating in pre-school setting.
 
Author contributions
Concept or design: All authors.
Acquisition of data: AWL Wan, DKC Chan.
Analysis or interpretation of data: AWL Wan, DKC Chan.
Drafting of the manuscript: AWL Wan, DKC Chan.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Acknowledgement
The authors thank Ms Kiko KH Leung, Ms Roni MY Chiu, and Ms Tracy CW Tang from the Department of Early Childhood Education of The Education University of Hong Kong for their assistance in preparing study materials and collecting data. The authors also thank the eight participating kindergartens for aiding in the distribution and collection of questionnaires from parents.
 
Funding/support
This research was funded by the Research Impact Cluster Fund of the Department of Early Childhood Education, Faculty of Education and Human Development, The Education University of Hong Kong, through an award to the corresponding author. The funder had no role in study design, data collection/analysis/interpretation or manuscript preparation.
 
Ethics approval
The study protocol of this research was approved by the Human Research Ethics Committee of The Education University of Hong Kong (Ref No.: 2020-2021-0420). All schools and parents provided written informed consent for participation in this research and have also consented to the publication of its findings.
 
Supplementary material
The supplementary material was provided by the authors and may include some information that was not peer reviewed. Accepted supplementary material will be published as submitted by the authors, without any editing or formatting. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by the Hong Kong Academy of Medicine and the Hong Kong Medical Association. The Hong Kong Academy of Medicine and the Hong Kong Medical Association disclaim all liability and responsibility arising from any reliance placed on the content.
 
References
1. World Health Organization. Healthy diet. 2020. Available from: https://www.who.int/news-room/fact-sheets/detail/healthy-diet. Accessed 9 Jul 2022.
2. Kupka R, Siekmans K, Beal T. The diets of children: overview of available data for children and adolescents. Glob Food Sec 2020;27:100442. Crossref
3. Movassagh EZ, Baxter-Jones AD, Kontulainen S, Whiting SJ, Vatanparast H. Tracking dietary patterns over 20 years from childhood through adolescence into young adulthood: the Saskatchewan Pediatric Bone Mineral Accrual Study. Nutrients 2017;9:990.Crossref
4. Department of Health. Hong Kong SAR Government. Diet, physical activity and health: Hong Kong situation. 2010. Available from: https://www.change4health.gov.hk/filemanager/common/image/strategic_framework/action_plan/action_plan_2_e.pdf. Accessed 5 Jul 2022.
5. Woo J, Chan R, Li L, Luk WY. A survey of infant and young child feeding in Hong Kong: diet and nutrient intake. 2012. Available from: https://www.fhs.gov.hk/english/reports/files/Diet_nutrientintake_executive%20summary_2504.pdf. Accessed 5 Jul 2022.
6. Yip PS, Chan VW, Lee QK, Lee HM. Diet quality and eating behavioural patterns in preschool children in Hong Kong. Asia Pac J Clin Nutr 2017;26:298-307. Crossref
7. Crist W, Napier-Phillips A. Mealtime behaviors of young children: a comparison of normative and clinical data. J Dev Behav Pediatr 2001;22:279-86. Crossref
8. Archer LA, Rosenbaum PL, Streiner DL. The children’s eating behavior inventory: reliability and validity results. J Pediatr Psychol 1991;16:629-42. Crossref
9. Samuel TM, Musa-Veloso K, Ho M, Venditti C, Shahkhalili-Dulloo Y. A narrative review of childhood picky eating and its relationship to food intakes, nutritional status, and growth. Nutrients 2018;10:1992. Crossref
10. Boswell N, Byrne R, Davies PS. Family food environment factors associated with obesity outcomes in early childhood. BMC Obes 2019;6:17. Crossref
11. Vereecken C, Haerens L, De Bourdeaudhuij I, Maes L. The relationship between children’s home food environment and dietary patterns in childhood and adolescence. Public Health Nutr 2010;13:1729-35. Crossref
12. Rex SM, Kopetsky A, Bodt B, Robson SM. Relationships among the physical and social home food environments, dietary intake, and diet quality in mothers and children. J Acad Nutr Diet 2021;121:2013-20.e1. Crossref
13. Chang HH, Nayga RM Jr. Mother’s nutritional label use and children’s body weight. Food Policy 2011;36:171-8. Crossref
14. Nepper MJ, Chai W. Associations of the home food environment with eating behaviors and weight status among children and adolescents. J Nutr Food Sci 2015(S12):1-5. Crossref
15. Patrick H, Nicklas TA. A review of family and social determinants of children’s eating patterns and diet quality. J Am Coll Nutr 2005;24:83-92. Crossref
16. Coppinger T, Milton K, Murtagh E, et al. Global Matrix 3.0 Physical Activity Report Card for Children and Youth: a comparison across Europe. Public Health 2020;187:150-6. Crossref
17. Huang WY, Wong SH, Sit CH, et al. Results from the Hong Kong’s 2018 Report Card on physical activity for children and youth. J Exerc Sci Fit 2019;17:14-9. Crossref
18. Barnes JD, Colley RC, Tremblay MS. Results from the Active Healthy Kids Canada 2011 Report Card on Physical Activity for Children and Youth. Appl Physiol Nutr Metab 2012;37:793-7. Crossref
19. Tremblay MS, Barnes JD, Bonne JC. Impact of the Active Healthy Kids Canada report card: a 10-year analysis. J Phys Act Health 2014;11 Suppl 1:S3-20. Crossref
20. De Villiers A, Steyn N, Coopoo Y, et al. Healthy Active Kids. South Africa Report Card 2010. 2010. Available from: https://www.activehealthykids.org/wp-content/uploads/2017/03/south-africa-report-card-long-form-2010.pdf. Accessed 5 Jul 2022.
21. Draper CE, Tomaz SA, Bassett SH, et al. Results from the Healthy Active Kids South Africa 2018 Report Card. SAJCH 2019;13:130-6. Crossref
22. Draper C, Basset S, de Villiers A, Lambert EV; HKKSA Writing Group. Results from South Africa’s 2014 Report Card on Physical Activity for Children and Youth. J Phys Act Health 2014;11 Suppl 1:S98-104. Crossref
23. Aubert S, Barnes JD, Abdeta C, et al. Global Matrix 3.0 Physical Activity Report Card grades for children and youth: results and analysis from 49 countries. J Phys Act Health 2018;15:S251-73. Crossref
24. Tremblay MS, Gray CE, Akinroye K, et al. Physical activity of children: a global matrix of grades comparing 15 countries. J Phys Act Health 2014;11 Suppl 1:S113-25. Crossref
25. Family Health Service, Department of Health, Hong Kong SAR Government. Healthy eating for preschool children (2 years to 5 years old). 2022. Available from: https://www.fhs.gov.hk/english/health_info/child/12185.html. Accessed 5 Jul 2022.
26. Centre for Health Protection, Department of Health, Hong Kong SAR Government. Nutrition guidelines for children aged 2 to 6. Revised 2018. Available from: https://www.startsmart.gov.hk/files/pdf/nutritional_guide_en.pdf. Accessed 5 Jul 2022.
27. United Nations World Food Programme. Food consumption score nutritional quality analysis (FCS-N) guidelines. 2015. Available from: https://docs.wfp.org/api/documents/WFP-0000007074/download/. Accessed 5 May 2022.
28. Bui C, Lin LY, Wu CY, Chiu YW, Chiou HY. Association between emotional eating and frequency of unhealthy food consumption among Taiwanese adolescents. Nutrients 2021;13:2739. Crossref
29. Family Health Service, Department of Health, Hong Kong SAR Government. Healthy eating for 6 to 24 month old children (3) ready to go (12-24 months). 2022. Available from: https://www.fhs.gov.hk/english/health_info/child/16301.html. Accessed 5 Jul 2022.
30. Centre for Health Protection, Department of Health, Hong Kong SAR Government. Ideas for healthy cooking. 2017. Available from: https://www.chp.gov.hk/en/static/90039.html. Accessed 5 Jul 2022.
31. Family Health Service, Department of Health, Hong Kong SAR Government. Do your 0-5-years old children need electronic screen products? 2022. Available from: https://www.fhs.gov.hk/english/health_info/child/30047.html. Accessed 5 Jul 2022.
32. Shim JE, Kim J, Mathai RA; STRONG Kids Research Team. Associations of infant feeding practices and picky eating behaviors of preschool children. J Am Diet Assoc 2011;111:1363-8. Crossref
33. Lo K, Cheung C, Lee A, Tam WW, Keung V. Associations between parental feeding styles and childhood eating habits: a survey of Hong Kong pre-school children. PLoS One 2015;10:e0124753. Crossref
34. Yeung S, Chan R, Li L, Leung S, Woo J. Bottle milk feeding and its association with food group consumption, growth and socio-demographic characteristics in Chinese young children. Matern Child Nutr 2017;13:e12341. Crossref
35. Family Health Service, Department of Health, Hong Kong SAR Government. Healthy eating for infants and young children—milk feeding. 2022. Available from: https://www.fhs.gov.hk/english/health_info/child/12549.html. Accessed 5 Aug 2022.
36. Wong EM, Sit JW, Tarrant MA, Cheng MM. The perceptions of obese school children in Hong Kong toward their weight-loss experience. J Sch Nurs 2012;28:370-8. Crossref
37. Ma G. Food, eating behavior, and culture in Chinese society. J Ethn Foods 2015;2:195-9. Crossref
38. Nowicka P, Keres J, Ek A, Nordin K, Sandvik P. Changing the home food environment: parents’ perspectives four years after starting obesity treatment for their preschool aged child. Int J Environ Res Public Health 2021;18:11293. Crossref
39. Fulkerson JA, Neumark-Sztainer D, Story M, et al. The Healthy Home Offerings via the Mealtime Environment (HOME) Plus study: design and methods. Contemp Clin Trials 2014;38:59-68. Crossref
40. Fulkerson JA, Friend S, Horning M, et al. Family home food environment and nutrition-related parent and child personal and behavioral outcomes of the Healthy Home Offerings via the Mealtime Environment (HOME) Plus program: a randomized controlled trial. J Acad Nutr Diet 2018;118:240-51. Crossref
41. Olstad DL, Raine KD, Nykiforuk CI. Development of a report card on healthy food environments and nutrition for children in Canada. Prev Med 2014;69:287-95. Crossref
42. Wan AW, Chung KK, Li JB, Xu SS, Chan DK. An assessment tool for the international healthy eating report card for preschool-aged children: a cross-cultural validation across Australia, Hong Kong, Singapore, and the United States. Front Nutr 2024;11:1340007. Crossref
43. Chan DK, Ivarsson A, Stenling A, Yang SX, Chatzisarantis NL, Hagger MS. Response-order effects in survey methods: a randomized controlled crossover study in the context of sport injury prevention. J Sport Exerc Psychol 2015;37:666-73. Crossref
44. Chan DK, Stenling A, Yusainy C, et al. Editor’s Choice: Consistency tendency and the theory of planned behavior: a randomized controlled crossover trial in a physical activity context. Psychol Health 2020;35:665-84. Crossref

Changes in cardiovascular disease risk predicted by the Framingham risk model in the Hong Kong population between 2003-2005 and 2014-2015: data from Population Health Surveys

Hong Kong Med J 2024 Jun;30(3):202–8 | Epub 29 May 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Changes in cardiovascular disease risk predicted by the Framingham risk model in the Hong Kong population between 2003-2005 and 2014-2015: data from Population Health Surveys
Brian YC Sung, MB, BS1; Eric HM Tang, BSc2; Laura Bedford, BSc2; Carlos KH Wong, MPhil, PhD2,3; Emily TY Tse, MB, BS, FHKAM (Family Medicine)2; Esther YT Yu, MB, BS, FHKAM (Family Medicine)2; Bernard MY Cheung, MB, BChir, PhD1; Cindy LK Lam, MB, BS, MD2
1 Department of Medicine, The University of Hong Kong, Hong Kong SAR, China
2 Department of Family Medicine and Primary Care, The University of Hong Kong, Hong Kong SAR, China
3 Department of Pharmacology and Pharmacy, The University of Hong Kong, Hong Kong SAR, China
 
Corresponding author: Prof Bernard MY Cheung (mycheung@hku.hk)
 
 Full paper in PDF
 
Abstract
Introduction: The Framingham risk model estimates a person’s 10-year cardiovascular disease (CVD) risk. This study used this model to calculate the changes in sex- and age-specific CVD risks in the Hong Kong Population Health Survey (PHS) 2014/15 compared with two previous surveys conducted during 2003 and 2005, namely, PHS 2003/2004 and Heart Health Survey (HHS) 2004/2005.
 
Methods: This study included individuals aged 30 to 74 years from PHS 2014/15 (n=1662; n=4 445 868 after population weighting) and PHS 2003/2004 and HHS 2004/2005 (n=818; n=3 495 074 after population weighting) with complete data for calculating the risk of CVD predicted by the Framingham model. Sex-specific CVD risks were calculated based on age, total cholesterol and high-density lipoprotein cholesterol levels, mean systolic blood pressure, smoking habit, diabetic status, and hypertension treatment. Mean sex- and age-specific CVD risks were calculated; differences in CVD risk between the two surveys were compared by independent t tests.
 
Results: The difference in 10-year CVD risk from 2003-2005 to 2014-2015 was not statistically significant (10.2% vs 10.6%; P=0.29). After age standardisation according to World Health Organization world standard population data, a small decrease in CVD risk was observed, from 9.4% in 2003-2005 to 8.8% in 2014-2015. Analysis according to age-group showed that more participants aged 65 to 74 years were considered high risk in 2003 to 2005 (2003-2005: 66.8% vs 2014-2015: 53.1%; P=0.028). This difference may be due to the decrease in smokers among men (2003-2005: 30.5% vs 2014-2015: 24.0%; P<0.001).
 
Conclusion: From 2003-2005 to 2014-2015, there was a small decrease in age-standardised 10-year CVD risk. A holistic public health approach simultaneously targeting multiple risk factors is needed to achieve greater decreases in CVD risk.
 
 
New knowledge added by this study
  • The stagnation in 10-year cardiovascular disease (CVD) risk between 2003 and 2015 suggests that despite improvements in treatment, more effective prevention strategies (eg, improvements in diet and physical activity levels) are needed.
  • The effect of the decreasing number of current smokers was not strongly reflected in the change in 10-year CVD risk. This may be due to an increased prevalence of diabetes and an increased proportion of participants receiving antihypertensive medications.
Implications for clinical practice or policy
  • The findings suggest that the use of lipid-lowering drugs and antihypertensive medications does not effectively translate into overall cardiovascular risk reduction in the Hong Kong population, unless these treatments are simultaneously paired with prevention strategies targeting CVD risk factors.
 
 
Introduction
Cardiovascular disease (CVD) constitutes a spectrum of diseases that affect the heart and blood vessels. Worldwide, CVD is the leading cause of death as well as a major cause of premature death and chronic disability in numerous regions.1 In 2017, CVD was responsible for an estimated 17.8 million deaths, representing 31% of all global deaths.2 Hypertension, smoking status, hyperlipidaemia, and diabetes mellitus are prominent risk factors for CVD.3 The global prevalence of CVD risk factors is increasing. Currently, an estimated 15% of the world’s population (1.13 billion people) has hypertension, and the prevalence is expected to increase to 29% by 2025.4 5 Additionally, the global prevalence of diabetes increased from 211 million in 1990 to 476 million in 2017, representing a 129.7% increase in nearly three decades.6 The prevalences of CVD and its risk factors are expected to continue to increase in the near future due to industrialisation and population ageing. Fortunately, the World Health Organization (WHO) has estimated that premature CVD is preventable in >75% of cases, and risk factor amelioration can help reduce the burden caused by CVD.7
 
The risk of CVD over the next 10 years for an individual can be estimated using prediction models. Cardiovascular disease risk prediction is important at the individual and population levels. At the individual level, CVD risk prediction allows primary care medical professionals to identify high-risk patients. Risk factors for CVD, such as hypertension, can be treated accordingly to reduce the patient’s future risk of CVD. At the population level, CVD risk trends allow health policy planners to make evidence-based decisions and review current public health strategies used in CVD prevention.8
 
Changes in CVD risk can serve as a reference to indicate changes in public health status within a population. Two studies have evaluated changes in the 10-year CVD risk in the United States (US) population. In a study by Ajani and Ford,9 risk models adopted by the National Cholesterol Education Program Adult Treatment Panel III were utilised to estimate coronary heart disease risk, rather than the more precise Framingham formulae for estimation of overall CVD risk. Notably, their analysis lacked age stratification. On the other hand, Lopez-Jimenez et al10 evaluated the changes in CVD risk between 1976 and 2004. To our knowledge, no previous study has evaluated the change in 10-year CVD risk in an Asian population, and insights are needed regarding the cardiovascular health status of the population in a more recent time period.
 
In this study, we aimed to calculate and compare the changes in sex- and age-specific CVD risks predicted by the Framingham model in a Hong Kong general population by using the Hong Kong Population Health Survey (PHS) 2014/15 in combination with two previous surveys conducted in 2003 to 2005, namely, PHS 2003/2004 and Heart Health Survey (HHS) 2004/2005.
 
Methods
Study design and sampling
The data in this study were sourced from PHS 2003/200411 and PHS 2014/15.12 Population Health Surveys are territory-wide cross-sectional surveys conducted by the Department of Health of the Hong Kong SAR Government. These surveys target the land-based non-institutional population of individuals aged ≥15 years in Hong Kong, excluding foreign domestic helpers and visitors. Systematic replicate sampling was adopted to select living quarters that were representative of the Hong Kong general population. All domestic households in the selected living quarters and household members in the target population were individually surveyed. Written consent was obtained from individuals who agreed to participate in a PHS. Participants were invited to complete a face-to-face interview, where they provided information about their socio-demographic characteristics, disease status, and daily lifestyle habits. After the interview, consenting participants aged 15 to 84 years were randomly selected to undergo a health examination that included physical measurements and biochemical testing.12 Health examinations for participants of PHS 2003/2004 were conducted as part of the HHS 2004/2005.
 
The STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) checklist was followed for preparing this manuscript.
 
Predicted risk of cardiovascular disease over the next 10 years
The main outcome of this study was the predicted risk of CVD over the next 10 years among people aged 30 to 74 years. Thus far, no specific prediction models have been developed for the Hong Kong population. In this study, we adopted the prediction model for primary care developed by the Framingham Heart Study Cohort in the general adult population aged 30 to 74 years.13 The Framingham CVD risk model was validated and can be applied to the Chinese population, but requires recalibration in men.14 A CVD event was defined as a composite of coronary heart disease, cerebrovascular events, peripheral artery disease, and heart failure in the Framingham model. The predicted CVD risk over the next 10 years was calculated for each participant using the following information: age, total cholesterol level, high-density lipoprotein cholesterol level, systolic blood pressure, use of antihypertensive medication, current smoking status, and diabetes status.
 
Statistical analysis
A complete-case analysis approach was utilised in this study. Descriptive statistics were used to present the characteristics of included individuals. Sex- and age-specific predicted CVD risks in 2003-2005 and 2014-2015 were calculated to summarise the change in risk according to sex and age-group. Significant differences in the above factors between the two PHSs were tested by independent t test or Chi squared test, as appropriate.
 
To summarise results at the population level, population weighting established by the Department of Health was applied according to age-group and sex for participants in each PHS. To compare results at the population level, age-standardised predicted CVD risk was calculated using WHO 2000-2025 world standard population data15 and the US population census data in 2000.16
 
All statistical analyses were performed with SPSS software (Windows version 26.0; IBM Corp, Armonk [NY], US). All significance tests were two-tailed, and P values <0.05 were considered statistically significant.
 
Results
In total, 1662 and 818 participants were included in the PHS 2014/15 cohort and the PHS 2003/2004 and HHS 2004/2005 cohort, respectively. These individuals represented populations of 4 445 868 and 3 495 074, respectively. Table 1 shows the baseline characteristics of the PHS and HHS cohorts according to Framingham predictors. The mean age was similar in both cohorts (2003-2005: 48.6 years vs 2014-2015: 50.8 years) and the proportions of male and female participants were similar (female participants in 2003-2005: 54.1% vs 2014-2015: 53.0%).
 

Table 1. Summary statistics for risk factors among the participants in the health surveys
 
Predicted CVD risk increases with age and is higher in men (Table 2). In PHS 2003/2004 and HHS 2004/2005, the mean CVD risk increased with age in both sexes, from 1.6% among women aged 30-44 years to 17.8% among women aged 65-74 years, and from 5.1% among men aged 30-44 years to 33.4% among men aged 65-74 years. Between the two surveys, there was no significant difference in overall 10-year CVD risk (2003-2005: 10.2% vs 2014-2015: 10.6%; P=0.29). After age standardisation according to WHO world standard population data, the age-standardised predicted CVD risks in the 2003-2005 cohort and the 2014-2015 cohort were 9.4% and 8.8%, respectively (Table 3). The small decrease in predicted CVD risk may be due to the decrease in smokers among men (30.5% vs 24.0%; P<0.001) [Table 1].
 

Table 2. Mean weighted 10-year cardiovascular disease risk predicted by the Framingham model among participants aged 30 to 74 years in the health surveys according to age-group and sex
 

Table 3. Age-standardised estimates of cardiovascular disease risk for the participants of the health surveys
 
The risk of cardiovascular events over the next 10 years was classified as low (CVD risk <10%), medium (CVD risk ≥10% and <20%), and high (CVD risk ≥20%); the distributions of risk groups are presented in Table 4. Among participants aged 30 to 74 years, the risk group distributions were similar in both cohorts. When analysed according to sex, 29.1% of men and 5.1% of women were classified as high-risk in PHS 2014/15, whereas 28.2% of men and 6.4% of women were classified as high-risk in PHS 2003/2004 and HHS 2004/2005. When analysed according to age-group, more participants aged 65 to 74 years were classified as high-risk in PHS 2003/2004 and HHS 2004/2005 (66.8% vs 2014-2015: 53.1%; P=0.028).
 

Table 4. Ten-year cardiovascular disease risk level among non-institutionalised individuals aged 30 to 74 years according to age-group based on the Framingham risk model
 
Discussion
Using representative samples from the Hong Kong PHS and HHS, we found that the 10-year CVD risk increases with age and is consistently higher in men (men: 15.5% vs women: 6.2%; P<0.001, in PHS 2014/15) [Table 2]. This trend is consistent with previous reports; for example, an analysis of NHANES (National Health and Nutrition Examination Survey) III data showed that men have a significantly higher CVD risk (11.8% in men vs 5.1% in women) and that CVD risk is higher in older age-groups (16.2% for participants aged 60-74 years vs 11.2% for participants aged 50-59 years).10 Thus, risk management for older adults may represent a challenge to the current health infrastructure. The burden of CVD is directly associated with increased morbidity and mortality in patients, and it translates to substantial healthcare costs. In Hong Kong, the number of people aged ≥65 years is predicted to reach 2.58 million (35.9% of the population) by 2064.17 Thus, there is a critical need to achieve a more comprehensive understanding of the aetiologies associated with CVD in older adults.
 
The results have extensive public health implications. Although age-standardised rates of death from CVD in Hong Kong greatly decreased from 93.4 per 100 000 standard population in 2001 to 56.0 per 100 000 standard population in 2017,18 there was no significant difference in overall 10-year CVD risk between 2003-2005 and 2014-2015 (10.2% vs 10.6%; P=0.29) [Table 2]. After age standardisation according to WHO world standard population data, a small decrease in CVD risk was observed, from 9.4% in 2003-2005 to 8.8% in 2014-2015 [Table 3]. A study of NHANES data revealed a similar trend, consisting of a small decrease in CVD risk from 1988 to 2004 (7.9% to 7.4%; P<0.001).10 The stagnation in 10-year CVD risk between 2003-2005 and 2014-2015 suggests that despite improvements in treatment, more effective prevention strategies (eg, improvements in diet and physical activity levels) are needed.19 Primary care intervention to manage modifiable risk factors, such as hypertension, dyslipidaemia, and diabetes, can complement population-based policies. Efforts to ensure access to appropriate healthcare and affordable medications will help control abnormal risk factor levels.20 Furthermore, despite the importance of targeting individual risk factors to reduce prevalence, the long-term objective should be reduced overall risk of CVD. This objective requires a holistic approach simultaneously targeting multiple risk factors.
 
Our results highlight the need for overall risk assessment, in addition to targeted efforts focused on specific CVD risk factors. During the past decade, numerous public health initiatives (eg, smoking cessation campaigns) have been implemented to reduce the prevalence of established risk factors for CVD. It is reasonable to expect that CVD risk would decrease over time in conjunction with the decreased prevalences of various risk factors, such as the declining number of current smokers. Surprisingly, these changes were not strongly reflected in the change in 10-year CVD risk. One explanation is that decreases in the prevalence of some risk factors are offset by increases in others. For example, population ageing in Hong Kong may shift more adults into the high-risk group. However, because the Framingham model was limited to individuals aged 30 to 74 years, the change in mean age between the two surveys is inconsequential. Other possible changes in risk factors include increases in the prevalence of diabetes and hypertension, as reflected by the increased proportion of participants receiving antihypertensive medications (Table 1). Although CVD mortality has considerably decreased, it is concerning that CVD risk has not substantially diminished over the past decade. This lack of improvement in CVD risk may lead to an increasing community burden of CVD in the near future, especially in the context of population ageing.
 
The overall 10-year CVD risk for participants aged 30 to 74 years in 2003-2005 after age adjustment to US census population data in 2000 was 10.7%. During a similar time period (1999-2004), the US population had a 10-year CVD risk of 7.4%.10 When Hong Kong men were stratified according to risk group in 2014-2015, 48.8% were classified as low-risk, 22.1% were classified as medium-risk, and 29.1% were classified as high-risk (Table 4). For comparison, among men in the United Kingdom population during 2012, 46.5% were classified as low-risk, 39.9% were classified as medium-risk, and 13.6% were classified as high-risk using the National Institute for Health and Care Excellence Framingham risk model.21 In terms of risk distribution, a greater proportion of Hong Kong men were classified as high-risk compared with men in the United Kingdom in the early 2010s. In terms of age-standardised risk, the Hong Kong population had a higher 10-year CVD risk than the US population in 2003.
 
Analysis according to age showed that the proportion of low-risk participants increased from 2003-2005 to 2014-2015, particularly in the age-groups of 55-64 and 65-74 years (2003-2005: 28.5% vs 2014-2015: 44.7% for participants aged 55-64 years, and 6.1% vs 15.4% for participants aged 65-74 years) [Table 4]. This increased proportion may be the result of primary care clinicians recognising age as a prominent risk factor for CVD, leading to more aggressive treatment of modifiable risk factors. Furthermore, there were fewer male smokers in 2014-2015 than in 2003-2005 (24.0% vs 30.5%; P<0.001) [Table 1], suggesting that the success of recent anti-smoking campaigns also contributed to this paradigm shift.
 
Contrary to a previous report addressing changes in several CVD risk factors,22 we used the widely validated Framingham risk model to predict the risk of CVD. By measuring the net change in cardiovascular risk, we more comprehensively estimated the impacts of prevention strategies; this is particularly important because some risk factors worsened, some improved, and some remained stable over time. To our knowledge, this is the first study to evaluate the change in 10-year CVD risk in an Asian population. The results reinforce the need for more aggressive community-wide and clinic-based preventions, with an emphasis on increased exercise hours during leisure time, dietary changes that promote higher intake of vegetables and fruits, as well as lower intake of salt and saturated fats, weight maintenance, and smoking cessation. The findings also suggest that the discovery and use of lipid-lowering drugs and antihypertensive medications does not effectively translate into overall risk reduction in an Asian population, unless these treatments are simultaneously paired with prevention strategies targeting CVD risk factors.
 
Strengths and limitations
This study was based on two Hong Kong population health surveys, and thus the sample is highly representative of the general population. Baseline data were collected through laboratory tests and face-to-face interviews, suggesting that these data are highly reliable. The long interval between the two health surveys (2003-2005 to 2014-2015) also provides insights regarding the effectiveness of current CVD prevention strategies. Limitations of the current study involve its use of the Framingham risk prediction model and the PHS and HHS. Similar to other surveys, the PHS and HHS are susceptible to participation bias because the sample data might not provide an accurate representation of the overall population. Many PHS variables are self-reported and may lead to reporting bias. Considering the effort required to complete the long questionnaires, the collected data may be susceptible to non-response bias and recall bias. Furthermore, because the Framingham cohort primarily consisted of Caucasians, the predicted 10-year CVD risk in the Hong Kong population calculated using the Framingham model should be interpreted cautiously. The Framingham risk model was developed for people without a history of CVD; thus, a small proportion of the overall sample lacking a history of CVD might have been included in the study, causing inaccuracies in the results. Additionally, the model may overestimate the 10-year CVD risk for men in Hong Kong,14 and a recalibrated model with greater predictive power may be required. Although several limitations and assumptions may hinder the prediction of CVD risk, these potential sources of bias were present in both surveys. Therefore, the effects of such biases may be less important when comparing the change in predicted 10-year CVD risk between the two time points.
 
Conclusion
During the period from 2003-2005 to 2014-2015, the change in predicted 10-year CVD risk was not statistically significant. However, the proportion of low-risk participants within older age-groups was higher in PHS 2014/15 than in PHS 2003/2004 and HHS 2004/2005. More aggressive CVD prevention strategies and primary care interventions are needed to address CVD risk factors.
 
Author contributions
Concept or design: BMY Cheung, CLK Lam.
Acquisition of data: BYC Sung, EHM Tang, BMY Cheung, CLK Lam.
Analysis or interpretation of data: BYC Sung, EHM Tang.
Drafting of the manuscript: BYC Sung, EHM Tang.
Critical revision of the manuscript for important intellectual content: L Bedford, CKH Wong, ETY Tse, EYT Yu, BMY Cheung, CLK Lam.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
As advisers of the journal, CKH Wong and EYT Yu were not involved in the peer review process. Other authors have disclosed no conflicts of interest.
 
Acknowledgement
The authors thank the Department of Health of the Hong Kong SAR Government for granting approval to use the data from Population Health Survey 2003/2004, Heart Health Survey 2004/2005, and Population Health Survey 2014/15 for this research. The authors also thank Dr Weinan Dong and Ms Tingting Wu from the Department of Family Medicine and Primary Care of The University of Hong Kong for their assistance in data interpretation and revising the final draft of the manuscript.
 
Declaration
Part of this study was presented as a poster at the ESC Congress 2021 – The Digital Experience (virtual), 27-30 August 2021, and was published as an abstract (Sung BY, Tang EH, Bedford L, et al. Change in Framingham cardiovascular disease risk between 2003 and 2014 in the Hong Kong Population Health Survey [abstract]. Eur Heart J 2021;42:1).
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
The requirement for ethics approval for this research was waived by the Institutional Review Board of The University of Hong Kong/Hospital Authority Hong Kong West Cluster, Hong Kong as this study involved secondary analysis of de-identified governmental data.
 
References
1. Roth GA, Johnson C, Abajobir A, et al. Global, regional, and national burden of cardiovascular diseases for 10 causes, 1990 to 2015. J Am Coll Cardiol 2017;70:1-25. Crossref
2. GBD 2017 Causes of Death Collaborators. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet 2018;392:1736-88. Crossref
3. Romero CX, Romero TE, Shlay JC, Ogden LG, Dabelea D. Changing trends in the prevalence and disparities of obesity and other cardiovascular disease risk factors in three racial/ethnic groups of USA adults. Adv Prev Med 2012;2012:172423. Crossref
4. World Health Organization. Hypertension. 2019. Available from: https://www.who.int/news-room/fact-sheets/detail/hypertension. Accessed 8 Jan 2021.
5. Kearney PM, Whelton M, Reynolds K, Muntner P, Whelton PK, He J. Global burden of hypertension: analysis of worldwide data. Lancet 2005;365:217-23. Crossref
6. Lin X, Xu Y, Pan X, et al. Global, regional, and national burden and trend of diabetes in 195 countries and territories: an analysis from 1990 to 2025. Sci Rep 2020;10:14790. Crossref
7. World Health Organization. Cardiovascular diseases. 2016. Available from: https://www.who.int/health-topics/cardiovascular-diseases#tab=tab_1. Accessed 8 Jan 2021.
8. World Health Organization. Prevention of cardiovascular disease. Guidelines for assessment and management of cardiovascular risk. 2007. Available from: https://apps.who.int/iris/bitstream/handle/10665/43685/9789241547178_eng.pdf?sequence=1&isAllowed=y. Accessed 8 Jan 2021.
9. Ajani UA, Ford ES. Has the risk for coronary heart disease changed among U.S. adults? J Am Coll Cardiol 2006;48:1177-82. Crossref
10. Lopez-Jimenez F, Batsis JA, Roger VL, Brekke L, Ting HH, Somers VK. Trends in 10-year predicted risk of cardiovascular disease in the United States, 1976 to 2004. Circ Cardiovasc Qual Outcomes 2009;2:443-50. Crossref
11. Department of Health, Hong Kong SAR Government; Department of Community Medicine, The University of Hong Kong. Population Health Survey 2003/2004 2015. Available from: https://www.chp.gov.hk/files/pdf/report_on_population_health_survey_2003_2004_en.pdf. Accessed 8 Jan 2021.
12. Surveillance and Epidemiology Branch, Centre for Health Protection, Department of Health, Hong Kong SAR Government. Report of Population Health Survey 2014/2015. 2017. Available from: https://www.chp.gov.hk/files/pdf/dh_phs_2014_15_full_report_eng.pdf. Accessed 8 Jan 2021.
13. D’Agostino RB Sr, Vasan RS, Pencina MJ, et al. General cardiovascular risk profile for use in primary care: the Framingham Heart Study. Circulation 2008;117:743-53. Crossref
14. Lee CH, Woo YC, Lam JK, et al. Validation of the Pooled Cohort equations in a long-term cohort study of Hong Kong Chinese. J Clin Lipidol 2015;9:640-6.e2. Crossref
15. World Health Organization. Age standardization of rates: a new WHO standard. 2001. Available from: https://cdn.who.int/media/docs/default-source/gho-documents/global-health-estimates/gpe_discussion_paper_series_paper31_2001_age_standardization_rates.pdf. Accessed 6 May 2024.
16. United States Census Bureau. Introduction to Census 2000 Data Products. Available from: https://usa.ipums.org/usa/resources/voliii/pubdocs/2000/mso-01icdp.pdf. Accessed 1 May 2024.
17. Census and Statistics Department, Hong Kong SAR Government. Hong Kong population projections 2017-2066. 2017. Available from: https://www.censtatd.gov.hk/media_workers_corner/pc_rm/hkpp2017_2066/index.jsp. Accessed 8 Jan 2021.
18. Department of Health, Hong Kong SAR Government. Non-Communicable Diseases Watch. Overview of cardiovascular diseases. September 2018. Available from: https://www.chp.gov.hk/files/pdf/ncd_watch_sep_2018.pdf. Accessed 8 Jan 2021.
19. National Prevention Council, United States Government. National Prevention Strategy. America’s plan for better health and wellness. June 2011. Available from: https://www.hhs.gov/sites/default/files/disease-prevention-wellness-report.pdf. Accessed 8 Jan 2021.
20. Chow CK, Nguyen TN, Marschner S, et al. Availability and affordability of medicines and cardiovascular outcomes in 21 high-income, middle-income and low-income countries. BMJ Glob Health 2020;5:e002640. Crossref
21. Collins GS, Altman DG. Predicting the 10-year risk of cardiovascular disease in the United Kingdom: independent and external validation of an updated version of QRISK2. BMJ 2012;344:e4181. Crossref
22. Carroll MD, Lacher DA, Sorlie PD, et al. Trends in serum lipids and lipoproteins of adults, 1960-2002. JAMA 2005;294:1773-81. Crossref

Detection of significant liver fibrosis in Chinese psoriasis patients receiving methotrexate: a comparison between transient elastography and liver histology

© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Detection of significant liver fibrosis in Chinese psoriasis patients receiving methotrexate: a comparison between transient elastography and liver histology
Christina SM Wong, FRCP (Edin), FHKAM (Medicine)# 1; Loey LY Mak, MD, FRCP (Glasg)# 2; Victor KH Lee, FRCR, FHKAM (Radiology)3; Regina CL Lo, MD, FHKAM (Pathology)4; Martin MH Chung, MRCP, FHKAM (Medicine)1; Ferdinand Chu, FRCR, FACLM5,6; CK Yeung, MD, FRCP1; MF Yuen, PhD, FRCP2; Henry HL Chan, MD, PhD1
1 Division of Dermatology, Department of Medicine, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
2 Division of Gastroenterology and Hepatology, Department of Medicine, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
3 Imaging and Interventional Radiology Centre, CUHK Medical Centre, Hong Kong SAR, China
4 Department of Pathology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
5 Department of Radiology, Queen Mary Hospital, Hong Kong SAR, China
6 St Vincent’s Hospital, Sydney, Australia
# Equal contribution
 
Corresponding author: Prof Henry HL Chan (hhlchan@hku.hk)
 
 Full paper in PDF
 
Abstract
Introduction: Methotrexate (MTX) is effective for treating psoriasis and psoriatic arthritis, but its potential hepatoxicity remains a concern. Liver biopsy, the gold standard for detecting MTX-induced liver injury, is invasive and carries considerable risk. Transient elastography (TE) offers a non-invasive alternative for detecting advanced liver fibrosis. This study investigated the performance of TE in detecting MTX-induced liver fibrosis among Chinese psoriasis patients, compared with liver biopsy.
 
Methods: This study included adult patients with clinical psoriasis. Liver stiffness measurement using TE was performed in patients receiving MTX. Exclusion criteria were known liver cirrhosis, positive viral hepatitis carrier status, or conditions influencing TE performance. Liver biopsy was performed when liver stiffness was ≥7.1 kilopascals (kPa) or when the total cumulative dose (TCD) of MTX was ≥3.5 g.
 
Results: A total of 228 patients were screened; among 34 patients who met the inclusion criteria, nine (26.5%) had significant liver fibrosis (Roenigk grade ≥3a). The area under the receiver operating characteristic curve was 0.76 (95% confidence interval=0.59-0.93; P=0.021), indicating that TE had satisfactory performance in detecting liver fibrosis. A cut-off value of 7.1 kPa of liver stiffness yielded 100% sensitivity and 68% specificity. Liver fibrosis was not correlated with the TCD of MTX or the duration of MTX use; it was significantly correlated with obesity and diabetes status (body mass index ≥30 kg/m2, waist circumference ≥138 cm, and glycated haemoglobin level ≥7.8%).
 
Conclusions: Transient elastography is reliable and superior to the TCD for detecting liver fibrosis in Chinese psoriasis patients receiving MTX. Liver biopsy should be reserved for high-risk patients or patients with liver stiffness ≥11.7 kPa on TE.
 
 
New knowledge added by this study
  • Transient elastography (TE) exhibits satisfactory performance in the detection of methotrexate (MTX)-induced liver fibrosis among Chinese psoriasis patients receiving MTX.
  • Although current guidelines state that liver biopsy should be considered if the total cumulative dose of MTX is ≥3.5 g, it is shown that the dose was not correlated with the degree of liver fibrosis in Chinese psoriasis patients. Importantly, liver stiffness (LS) measurement by TE is a more appropriate method for monitoring MTXinduced liver fibrosis.
  • Chinese patients with obesity who exhibit a high body mass index and large abdominal circumference have a higher risk of liver fibrosis and should be closely monitored when receiving MTX.
Implications for clinical practice or policy
  • Transient elastography can be used to monitor MTX-induced liver fibrosis, whereas liver biopsy should be reserved for high-risk psoriasis patients.
  • Yearly TE monitoring is recommended for patients with LS ≥7.1 kilopascals (kPa), while liver biopsy should be considered for patients with LS ≥11.7 kPa.
 
 
Introduction
Methotrexate (MTX) is an effective immunosuppressive drug for moderate to severe psoriasis and psoriatic arthritis.1 2 It is considered relatively safe and cost-effective for long-term use. However, prolonged used of MTX can potentially cause liver fibrosis and fatty changes without alterations in liver enzymes. The mechanism of MTX-induced liver fibrosis is suspected to involve the production of extracellular adenosine, a pro-fibrotic agent.3
 
Although liver fibrosis can be evaluated by imaging or biochemical parameters, liver biopsy remains the gold standard for diagnosing liver fibrosis.4 However, it has limitations such as sampling error and poor feasibility for serial assessments. The rates of liver biopsy–associated morbidity and mortality are approximately 1% and 0.01% to 0.1%, respectively.5 6 The Roenigk scale is generally used to grade MTX-induced liver fibrosis,7 but other systems (eg, Ishak and METAVIR) have also been used for fibrosis assessment.8 9
 
The measurement of serum level of specific biomarkers such as the amino terminal of type III procollagen peptide can be used as alternative methods to assess liver fibrosis.10 11 12 13 However, these measurements are not widely available in many regions. Therefore, a sensitive and specific non-invasive technique is required to detect MTX-induced liver injury.
 
Transient elastography (TE) is a non-invasive method for assessing liver ‘hardness’ or stiffness that involves measuring the velocity of a vibration wave (ie, a shear wave) when travelling to a particular depth inside the liver, based on the principle that the wave velocity is greater in fibrotic tissue than in normal liver tissue. This technique has been used as a screening tool for liver cirrhosis in various conditions.14 15 16 17 In contrast to liver biopsy, TE allows the simultaneous assessment of a larger sampling area, is easily reproducible, and has low inter-observer variability and a low failure rate (<5%).15 Generally, a liver stiffness (LS) of ≤5 kilopascals (kPa) suggests a low probability of fibrosis, whereas a value of ≥7 kPa suggests a high likelihood of advanced fibrosis in patients with various chronic liver disorders.14 15 16 17 18 Thus far, there are limited data regarding TE accuracy and cut-off threshold in terms of detecting MTX-induced liver injury among Chinese psoriasis patients.
 
This study aimed to evaluate the performance (reliability) of TE in detecting significant liver fibrosis among psoriasis patients in the Chinese population, compared with gold-standard liver biopsy assessment using the Roenigk classification.
 
Methods
This cross-sectional single-centre study was conducted from 1 December 2019 to 31 March 2021. Patients with psoriasis and/or psoriatic arthritis undergoing regular follow-up in the dermatological clinic at a major tertiary hospital in Hong Kong were consecutively screened for inclusion (Fig 1). The inclusion criteria were as follows: age ≥18 years, clinical diagnosis of psoriasis with or without arthritis, and current use of MTX with a cumulative dosage of ≥150 mg or minimum duration of 6 months. Patients were excluded if they had known liver cirrhosis or concomitant chronic liver disease (including chronic hepatitis B or C infection, alcohol use disorder, Wilson’s disease, autoimmune hepatitis, primary biliary cholangitis, primary sclerosing cholangitis, and a history of hepatocellular carcinoma), had a condition that could influence TE accuracy (eg, ascites, severe renal disease, peritoneal dialysis, or severe cardiovascular insufficiency),14 15 16 17 18 refused to give consent, were pregnant, or had not been receiving MTX. After the acquisition of informed consent, all participants provided a thorough history and underwent a comprehensive examination.
 

Fig 1. Participant recruitment
 
Clinical and laboratory data
Upon recruitment, clinical and metabolic measurements, including body weight, body mass index (BMI), systolic and diastolic blood pressure, and waist circumference, were recorded. The duration and cumulative dosage of MTX treatment, disease severity (measured by the Psoriasis Area and Severity Index), and body surface area (BSA) were documented. Laboratory parameters assessed included hepatitis B/C infection status, complete blood count, serum fasting glucose level, glycated haemoglobin (HbA1c) level, triglyceride level, low- and high-density lipoprotein cholesterol levels, liver enzyme levels (eg, aspartate aminotransferase and alanine aminotransferase), and renal function. Patient demographic data, co-morbidities (hypertension, diabetes, cardiovascular disease, chronic liver/renal disease, and alcoholic liver disease), and concomitant medications were collected from electronic medical records.
 
Liver stiffness measurement by transient elastography
Transient elastography assessments were conducted within 1 month after recruitment. Liver stiffness was evaluated using the FibroScan ultrasonic imaging device (Echosens, Paris, France) and expressed as the median value (in kPa) after at least 10 successful acquisitions. Measurements were considered reliable if the success rate was ≥60%, combined with an interquartile range (IQR) of ≤30%.15 16 For patients with a BMI of <30 kg/m2, an M probe was used; for those with a BMI of ≥30 kg/m2, an XL probe was used. The controlled attenuation parameter (CAP) was also recorded to estimate liver steatosis, expressed in decibels per metre (dB/m).16 The investigators (senior research assistants trained to perform TE) were blinded to the patients’ clinical characteristics, MTX dosage, and previous liver ultrasound or biopsy results. Patients were instructed to fast for 4 hours prior to LS measurements. In previous studies involving Asian populations, significant liver fibrosis or cirrhosis was indicated by LS values of ≥7.1 kPa19 and >14.0 kPa,20 respectively. Therefore, in our study, significant liver fibrosis on TE was defined as LS ≥7.1 kPa.
 
Liver biopsy
A liver biopsy (to assess the degree of liver fibrosis by histology) was performed within 3 months after TE in patients who had significant liver fibrosis (ie, LS ≥7.1 kPa on TE) or a total cumulative dose (TCD) of MTX ≥3.5 g, with or without additional risk factors. This protocol was adopted based on international guidelines1 2 18 19 20 21 22 and previous studies in Asian populations.21 22 Ultrasound-guided core-needle liver biopsy procedures were performed by an experienced radiologist using a percutaneous approach.5 6
 
Histological assessments were conducted by a pathologist with expertise in hepatobiliary disorders, who was blinded to the TE results. The Roenigk scale was used to grade liver fibrosis/cirrhosis, defined as grade 1 (no fibrosis, no or mild fatty infiltration, no or mild nuclear variability, and no or mild portal inflammation), grade 2 (no fibrosis, but moderate to severe fatty infiltration, nuclear pleomorphism, and portal inflammation), grade 3a (mild fibrosis, moderate to severe fatty infiltration, portal tract enlargement, and lobular necrosis), grade 3b (moderate to severe fibrosis), and grade 4 (cirrhosis).7 For patients with MTX-induced liver injury (Roenigk grade 3a), MTX could be continued, with follow-up liver biopsy repeated in 6 months. For those with significant liver fibrosis or cirrhosis (Roenigk grades 3b or 4), MTX was discontinued and alternative treatment was initiated.2
 
Data analysis and statistics
For statistical analysis, continuous variables were expressed as median (range or IQR, as specified) or mean (± standard deviation) values, as appropriate. Receiver operating characteristic (ROC) curves were used to determine the predictive ability of TE-based LS relative to histopathology (Roenigk grade ≥3a), with 95% confidence intervals (CIs). Correlations between two variables were calculated using Pearson or Spearman rank correlation coefficients. The Chi squared test or Fisher’s exact test was used for comparisons of categorical variables, as appropriate. Quantitative variables were subjected to normality assessment via the Shapiro–Wilk test; non–normally distributed variables were compared between groups using the Wilcoxon rank-sum test. Statistical analyses were conducted using SPSS software (Windows version 26.0; IBM Corp, Armonk [NY], United States). P values <0.05 were considered statistically significant.
 
Results
Baseline characteristics
In total, 785 psoriasis patients were recruited into the study; 234 fulfilled the screening criteria and were included in the analysis. Six patients subsequently withdrew their consent (Fig 1).
 
Among 228 patients who underwent TE evaluation, 140 were diagnosed with psoriasis and 88 were diagnosed with psoriatic arthritis (Table 1). Fifty-eight patients, who either had LS ≥7.1 kPa or received a TCD of MTX ≥3.5 g, were advised to undergo liver biopsy; 24 patients who refused or had contraindications to liver biopsy were excluded from the study (Fig 1). Thus, 34 patients (24 fulfilling the TE criteria and 15 fulfilling the TCD criteria of MTX) who had undergone both TE and liver biopsies were included in further analyses. The clinical and demographic details of the study participants are summarised in Table 1.
 
Among the 34 patients, the median values of LS and CAP were 8.80 kPa (IQR, 7.1-12.4; range, 3.5-30.4) and 321.0 dB/m (IQR, 262-357; range, 200-400), respectively. Furthermore, 24 patients (70.6%) had a high LS value (ie, ≥7.1 kPa, indicative of significant fibrosis); 24 patients (70.6%) had a CAP value of ≥268 dB/m, indicative of moderate to severe steatosis.
 

Table 1. Demographic and clinical characteristics of patients who underwent transient elastography and those who underwent liver biopsy assessments
 
Histology evaluation revealed liver fibrosis in nine of 34 (26.5%) patients: six had Roenigk grade 3a, three had Roenigk grade 3b, and none had Roenigk grade 4 (cirrhosis).
 
Correlations of liver stiffness with clinical and laboratory parameters
Liver stiffness measurements via TE showed moderate correlations with BMI (r=0.441; P=0.009), waist circumference (r=0.437; P=0.01), and body weight (r=0.456; P=0.007); there were no correlations with the TCD of MTX or duration of MTX use (Table 2).
 

Table 2. Correlation of liver stiffness values with clinical and laboratory parameters (n=34)
 
Patients with LS ≥7.1 kPa had a higher BMI (P=0.01), body weight (P=0.01), and waist circumference (P=0.03). They also exhibited greater disease severity with a higher Psoriasis Area and Severity Index score (P=0.02) and more extensive BSA involvement (P=0.03). Furthermore, patients with LS ≥7.1 kPa had higher fasting glucose (P=0.03) and HbA1c levels (P=0.02), but they did not show significant differences in serum lipid levels (Table 3).
 

Table 3. Comparison of psoriasis patients with and without liver fibrosis according to transient elastography and liver histology (n=34)
 
In terms of histology findings, patients with a Roenigk grade ≥3a had significantly greater BMI (P=0.01), waist circumference (P=0.04), BSA (P=0.04), and HbA1c values (P=0.03), compared with those who exhibited lower stages of histological fibrosis. Notably, there were no significant differences in the TCD of MTX, lipid profiles, or liver and renal biochemistries between the two groups. Patients with Roenigk grade ≥3a (LS: 11.7 kPa; range, 7.8-15.7) had higher LS values than those with Roenigk grade <3a (LS: 8.6 kPa; range, 5.9-11) [P=0.018] (Table 3).
 
Performance characteristics of transient elastography for diagnosing significant/advanced liver fibrosis
Comparison of TE-based LS values (in kPa) with histopathology revealed an area under the ROC curve of 0.76 (95% CI=0.59-0.93; P=0.021) [Fig 2], which demonstrated the satisfactory performance of TE in detecting significant liver fibrosis. An LS cut-off value of 7.1 kPa yielded 100% sensitivity and 68% specificity for diagnosing Roenigk grade ≥3a.
 

Fig 2. Performance of transient elastography as reflected by area under the receiver operating characteristic (ROC) curve comparing liver fibrosis detection between liver stiffness measurement (in kilopascals) and liver biopsy (Roenigk grade ≥3a)
 
In a subgroup analysis of patients with LS ≥7.1 kPa, excluding patients who only met the TCD criteria of MTX (n=24), the area under the ROC curve with reference to Roenigk grade ≥3a was 0.702 (95% CI=0.47-0.94); an LS cut-off value of 10.7 kPa yielded 86% sensitivity and 59% specificity. In contrast, when ROC analysis was focused on the subgroup of patients with a TCD of MTX ≥3.5 g (n=14, excluding patients who only met the TE criteria), the area under the ROC curve was 0.622 (95% CI=0.31-0.94); an LS cut-off value of 8.2 g yielded 60% sensitivity and 67% specificity.
 
Discussion
Use of transient elastography to monitor liver fibrosis in psoriasis patients receiving methotrexate
International guidelines recommend using TE for routine monitoring of MTX therapy.2 4 According to the Australasian position statement, TE monitoring is recommended every 3 years if initial LS is <7.5 kPa and yearly if LS is ≥7.5 kPa; liver biopsy is recommended if LS is >9.5 kPa.23
 
Currently, there is no guideline for monitoring MTX-induced liver fibrosis among Chinese psoriasis patients in Hong Kong. On the basis of previous studies,2 4 23 the present study utilised a modified approach that reflects our department’s routine practice of conducting liver biopsy for patients who have TE-based LS ≥7.1 kPa or a TCD of MTX ≥3.5 g, with or without abnormal liver biochemistry.
 
Our study confirmed the robust performance of TE in detecting significant liver fibrosis among Chinese psoriasis patients receiving MTX; the LS cut-off value of 7.1 kPa yielded 100% sensitivity and 68% specificity. These findings are consistent with a recent review and studies in other countries (Table 4).7 19 20 According to these published data, TE demonstrated fair to good performance in detecting liver fibrosis, with high negative predictive values (NPVs) [83% to 96%] but generally low positive predictive values (PPVs),19 20 21 22 23 24 which might be explained by the overall low prevalence of significant liver fibrosis in this population and the higher rate of TE failure among patients with obesity. Obesity might also influence diagnostic performance; a recent review noted that obesity could substantially reduce TE accuracy.19 Lee et al25 showed that, compared with control participants, independent risk factors for liver fibrosis included diabetes mellitus (odds ratio [OR]=30.4), obesity with high BMI (OR=8.3), and overweight (OR=6.3). Our results supported previous observations that TE-based LS measurements were not correlated with the TCD of MTX, although they were associated with BMI, diabetes mellitus, and obesity.19 24 Rongngern et al19 analysed 41 Asian psoriasis patients receiving MTX; they demonstrated that TE had good performance in detecting MTX-induced liver injury, with an area under the ROC curve of 0.78.19
 

Table 4. Comparison of studies about the diagnostic performance of transient elastography in detecting methotrexate-induced liver fibrosis among psoriasis patients, compared with liver histology
 
In this same study,19 using an LS cut-off value of 7.1 kPa, for detecting MTX-induced liver injury, defined as Roenigk grade ≥3a, TE provided a sensitivity and specificity of 50% and 83.9%, respectively, and a PPV of 50% and a NPV of 84%; in addition, the use of TE values ≥7.1 yielded 50% sensitivity and 76.9% specificity for detecting significant liver fibrosis, defined as METAVIR stage ≥F2; and giving a PPV of 10% and a NPV of 96.8%. Similarly, our study showed a PPV of 26.7% and a NPV of 100%. In the study of 53 psoriasis patients receiving MTX, Khandpur et al20 identified median LS values of 5.3 kPA (range, 2.7-17.8); TE could only detect 4 (21%) of 19 patients with liver fibrosis (Ishak stage ≥F1).20
 
Because the median LS for our patients with significant liver fibrosis (Roenigk grade ≥3a) was 11.7 kPa (IQR, 7.8-15.7), we recommend yearly TE monitoring for patients with LS ≥7.1 kPa. Liver biopsy should be considered for patients with LS ≥11.7 kPa, instead of a TCD ≥3.5 g or LS ≥7.1 kPa; earlier biopsy is suggested for high-risk patients (eg, patients with high BMI, abdominal obesity, or diabetes mellitus).
 
Associations of body mass index, abdominal obesity, and glycated haemoglobin level with liver fibrosis
The median LS in this study (n=228) was 6.91 kPa (IQR, 4.5-7.1). Among the 34 patients who underwent liver biopsy, the median LS was 8.80 kPa (IQR, 7.1-12.4). Overall, 9 of 34 patients (26.5%) had Roenigk grade 3a (mild)/3b (moderate to severe) liver fibrosis, whereas 14 (41.2%) patients had moderate to severe fatty infiltration (Roenigk grade 2) [Table 1]. The prevalences of liver fibrosis and steatotic changes in our study were higher than the rates reported in a 2015 review, where histology showed that only 5% of patients (range, 0%-33%) receiving MTX had developed significant liver fibrosis.14
 
Impacts of body mass index and coexisting non-alcoholic steatohepatitis or fatty liver disease on liver stiffness measurement
The aetiology of liver fibrosis can be multifactorial. Previous studies have shown that obesity, combined with other metabolic risk factors, is associated with liver fibrosis in non-alcoholic fatty liver disease (NAFLD) patients and psoriasis patients.26 27 28 The potential contributions of coexisting non-alcoholic steatohepatitis or NAFLD and metabolic syndrome to liver fibrosis have been suggested. In our cohort, >40% of psoriasis patients had a CAP of ≥268 dB/m, indicative of moderate to severe steatosis. The liver fibrosis could be attributed to coexisting non-alcoholic steatohepatitis or NAFLD, in addition to MTX-induced changes. The work of Wong et al26 demonstrated that NAFLD patients with BMI ≥30 kg/m2 had higher LS compared with normal healthy individuals. Our findings corroborate these observations; we found that BMI, body weight, and waist circumference were moderately correlated with TE-based LS measurements (all r>0.40; all P<0.05) [Table 2]. Intriguingly, although 70.6% of our biopsy cohort had moderate to severe hepatic steatosis (Table 1), CAP values were not significantly associated with LS >7.1 kPa or Roenigk grade ≥3a (Table 3). Therefore, the adverse effect of BMI on LS cannot be explained by concomitant hepatic steatosis alone.
 
In our study, patients with clinically significant liver fibrosis more frequently displayed characteristics of metabolic syndrome compared with patients lacking histologically confirmed liver fibrosis, as evidenced by significantly greater BMI (33.5 ± 0.68 kg/m2 vs 29.4 ± 6.18 kg/m2; P=0.01), waist circumference (138.0 ± 52.0 cm vs 101.2 ± 14.9 cm; P=0.04), and HbA1c values (7.86 ± 2.02% vs 6.15 ± 1.16%; P=0.03) [Table 3]. Sub-analysis showed that all patients with histologically confirmed liver fibrosis had a BMI ≥25 kg/m2 (Table 5), suggesting that psoriasis patients should maintain a normal BMI to decrease the risk of liver fibrosis.
 

Table 5. Liver stiffness values according to body mass index, psoriasis severity, and liver histology among psoriasis patients receiving methotrexate
 
Role of transient elastography: liver stiffness measurements to guide liver biopsy considerations
Despite the recommendation to perform a screening liver biopsy for exclusion of possible liver cirrhosis among patients receiving long-term MTX therapy (with a TCD >3.5 g),1 2 our study did not identify a positive correlation between significant liver fibrosis and the TCD of MTX. Similarly, a study involving 420 patients with inflammatory arthritis receiving MTX revealed no significant correlation between cumulative MTX dosage and TE-based LS measurements.24 In the present study, although the TCD of MTX was higher in patients with liver fibrosis (5.23 g vs 4.79 g in those without), the difference was not statistically significant. In this context, LS measurements may be superior to the TCD of MTX when considering the need for liver biopsy to rule out advanced liver fibrosis.
 
Contrary to concerns about hepatotoxicity during long-term MTX therapy, we did not find a correlation between LS values and the duration of MTX use. The duration of MTX use in patients without liver fibrosis was nearly 10 years, indicating that the drug is generally well-tolerated in psoriasis patients. Patients who tended to continue MTX treatment belonged to an MTX-responsive group without significant adverse events, such as liver derangement identified via blood tests. In contrast, patients experiencing liver derangement or other adverse events might have discontinued MTX treatment earlier and were thus excluded from the study. Although we excluded patients with chronic hepatitis B, TE-based LS measurement is a widely validated non-invasive tool for real-world assessments of liver fibrosis in such patients; it allows the prediction of advanced fibrosis and disease progression.29 Therefore, TE should also be considered a valuable tool in guiding treatment for psoriasis patients with chronic hepatitis B who are receiving MTX.
 
In addition to LS measurement, TE can assess the degree of steatosis through CAP. This assessment was beneficial among patients with psoriasis in the present study; 26.5% (9 of 34) of the patients had metabolic syndrome and were predisposed to concomitant hepatic steatosis, regardless of MTX use. The median CAP value in our study was 321 ± 95 dB/m (range, 200-400). The area under the ROC curve of TE was 0.783 (95% CI=0.61-0.95; P<0.01); the cut-off of 254 dB/m for detecting steatosis yielded 91% sensitivity and 60% specificity. However, the presence of simple hepatic steatosis alone does not warrant liver biopsy, and management decisions should follow appropriate clinical guidelines.30 31 32
 
Limitations
This study had some limitations. Notably, its sample size was small. Although liver fibrosis is generally uncommon in patients with psoriasis receiving MTX therapy, a larger sample size may be required for more definitive conclusions. Additionally, sampling error and inter- and intra-observer variabilities in histological assessment of liver tissue may have influenced the findings.
 
Conclusion
Transient elastography is a reliable screening tool for detecting significant liver fibrosis in Chinese psoriasis patients receiving MTX. When considering liver biopsy to rule out the possibility of clinically significant liver fibrosis, TE-based LS measurements provide superior reference information, compared with the TCD of MTX. Patients with high BMI, body weight, and abdominal obesity have a higher risk of liver fibrosis. Therefore, these factors should be considered when monitoring MTX-related liver fibrosis in psoriasis patients.
 
Author contributions
Concept or design: CSM Wong, LLY Mak, CK Yeung, HHL Chan, MF Yuen.
Acquisition of data: CSM Wong, MMH Chung, LLY Mak.
Analysis or interpretation of data: CSM Wong, MMH Chung, LLY Mak.
Drafting of the manuscript: CSM Wong, LLY Mak, F Chu, VKH Lee, RCL Lo.
Critical revision of the manuscript for important intellectual content: CK Yeung, HHL Chan, MF Yuen.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Acknowledgement
The authors thank Ms Rachael Yu, Ms Davis Wong, and Ms Ivy Cheng from Division of Dermatology, Department of Medicine, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong for their assistance with patient recruitment, data acquisition, and analysis.
 
Declaration
The preliminary data of this research were presented as ePoster presentation in the 31st European Academy of Dermatology and Venereology Congress 2022 (7-10 September 2022, Milan, Italy and online).
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This study was approved by the Institutional Review Board of The University of Hong Kong and Hong Kong West Cluster, Hospital Authority, Hong Kong (Ref No.: UW19-390) and was conducted in full compliance with the International Council for Harmonisation E6 guideline for Good Clinical Practice and the principles of the Declaration of Helsinki. Patient consent has been obtained for all clinical information and images reported in this article. All participant information has been deidentified and remains anonymous.
 
References
1. Kalb RE, Strober B, Weinstein G, Lebwohl M. Methotrexate and psoriasis: 2009 National Psoriasis Foundation Consensus Conference. J Am Acad Dermatol 2009;60:824-37. Crossref
2. Warren RB, Weatherhead SC, Smith CH, et al. British Association of Dermatologists’ guidelines for the safe and effective prescribing of methotrexate for skin disease 2016. Br J Dermatol 2016;175:23-44. Crossref
3. Themido R, Loureiro M, Pecegueiro M, Brandão M, Campos MC. Methotrexate hepatotoxicity in psoriatic patients submitted to long-term therapy. Acta Derm Venereol 1992;72:361-4. Crossref
4. Kremer JM, Alarcón GS, Lightfoot RW Jr, et al. Methotrexate for rheumatoid arthritis. Suggested guidelines for monitoring liver toxicity. American College of Rheumatology. Arthritis Rheum 1994;37:316-28. Crossref
5. Howlett DC, Drinkwater KJ, Lawrence D, Barter S, Nicholson T. Findings of the UK national audit evaluating image-guided or image-assisted liver biopsy. Part II. Minor and major complications and procedure-related mortality. Radiology 2013;266:226-35. Crossref
6. Regev A, Berho M, Jeffers LJ, et al. Sampling error and intraobserver variation in liver biopsy in patients with chronic HCV infection. Am J Gastroenterol 2002;97:2614-8. Crossref
7. Berends MA, Snoek J, de Jong EM, et al. Biochemical and biophysical assessment of MTX-induced liver fibrosis in psoriasis patients: FibroTest predicts the presence and FibroScan predicts the absence of significant liver fibrosis. Liver Int 2007;27:639-45. Crossref
8. Ishak K, Baptista A, Bianchi L, et al. Histological grading and staging of chronic hepatitis. J Hepatol 1995;22:696-9. Crossref
9. Goodman ZD. Grading and staging systems for inflammation and fibrosis in chronic liver diseases. J Hepatol 2007;47:598-607. Crossref
10. Carneiro SC, Cássia FF, Lamy F, Chagas VL, Ramos-e-Silva M. Methotrexate and liver function: a study of 13 psoriasis cases treated with different cumulative dosages. J Eur Acad Dermatol Venereol 2008;22:25-9. Crossref
11. Chou R, Wasson N. Blood tests to diagnose fibrosis or cirrhosis in patients with chronic hepatitis C virus infection: a systematic review. Ann Intern Med 2013;158:807-20. Crossref
12. Fung J, Lai CL, Fong DY, Yuen JC, Wong DK, Yuen MF. Correlation of liver biochemistry with liver stiffness in chronic hepatitis B and development of a predictive model for liver fibrosis. Liver Int 2008;28:1408-16. Crossref
13. Martyn-Simmons CL, Rosenberg WM, Cross R, Wong T, Smith CH, Barker JN. Validity of noninvasive markers of methotrexate-induced hepatotoxicity: a retrospective cohort study. Br J Dermatol 2014;171:267-73. Crossref
14. Brener S. Transient elastography for assessment of liver fibrosis and steatosis: an evidence-based analysis. Ont Health Technol Assess Ser 2015;15:1-45.
15. Foucher J, Castéra L, Bernard PH, et al. Prevalence and factors associated with failure of liver stiffness measurement using FibroScan in a prospective study of 2114 examinations. Eur J Gastroenterol Hepatol 2006;18:411-2. Crossref
16. Karlas T, Petroff D, Sasso M, et al. Individual patient data meta-analysis of controlled attenuation parameter (CAP) technology for assessing steatosis. J Hepatol 2017;66:1022-30. Crossref
17. Castéra L, Vergniol J, Foucher J, et al. Prospective comparison of transient elastography, Fibrotest, APRI, and liver biopsy for the assessment of fibrosis in chronic hepatitis C. Gastroenterology 2005;128:343-50. Crossref
18. Friedrich-Rust M, Ong MF, Martens S, et al. Performance of transient elastography for the staging of liver fibrosis: a meta-analysis. Gastroenterology 2008;134:960-74. Crossref
19. Rongngern P, Chularojanamontri L, Wongpraparut C, et al. Diagnostic performance of transient elastography for detection of methotrexate-induced liver injury using Roenigk classification in Asian patients with psoriasis: a retrospective study. Arch Dermatol Res 2017;309:403-8. Crossref
20. Khandpur S, Yadav D, Jangid B, et al. Ultrasound liver elastography for the detection of liver fibrosis in patients with psoriasis and reactive arthritis on long-term methotrexate therapy: a cross-sectional study. Indian J Dermatol Venereol Leprol 2020;86:508-14. Crossref
21. Marsh RL, Kelly S, Mumtaz K, Kaffenberger J. Utility and limitations of transient elastography to monitor hepatic steatosis, hepatic fibrosis, and methotrexate-associated hepatic disease in psoriasis: a systematic review. J Clin Aesthet Dermatol 2021;14:24-8.
22. Cheng HS, Rademaker M. Monitoring methotrexate-induced liver fibrosis in patients with psoriasis: utility of transient elastography. Psoriasis (Auckl) 2018;8:21-9. Crossref
23. Rademaker M, Gupta M, Andrews M, et al. The Australasian Psoriasis Collaboration view on methotrexate for psoriasis in the Australasian setting. Australas J Dermatol 2017;58:166-70. Crossref
24. Darabian S, Wade JP, Kur J, Wade SD, Sayre EC, Badii M. Using FibroScan to assess for the development of liver fibrosis in patients with arthritis on methotrexate: a single-center experience. J Rheumatol 2022;49:558-65. Crossref
25. Lee JH, Loo CH, Tan WC, Lee CK, Jamil A, Khor YH. Comparison of noninvasive screening tools for hepatic fibrosis, association with methotrexate cumulative dose, and risk factors in psoriasis patients. Dermatol Ther 2022;35:e15203. Crossref
26. Wong GL, Chan HL, Choi PC, et al. Association between anthropometric parameters and measurements of liver stiffness by transient elastography. Clin Gastroenterol Hepatol 2013;11:295-302.e1-3. Crossref
27. Rodríguez-Zúñiga MJ, García-Perdomo HA. Systematic review and meta-analysis of the association between psoriasis and metabolic syndrome. J Am Acad Dermatol 2017;77:657-66.e8. Crossref
28. Malatjalian DA, Ross JB, Williams CN, Colwell SJ, Eastwood BJ. Methotrexate hepatotoxicity in psoriatics: report of 104 patients from Nova Scotia, with analysis of risks from obesity, diabetes and alcohol consumption during long term follow-up. Can J Gastroenterol 1996;10:369-75. Crossref
29. Wong GL. Non-invasive assessments for liver fibrosis: the crystal ball we long for. J Gastroenterol Hepatol 2018;33:1009-15. Crossref
30. Cusi K, Isaacs S, Barb D, et al. American Association of Clinical Endocrinology Clinical Practice Guideline for the diagnosis and management of nonalcoholic fatty liver disease in primary care and endocrinology clinical settings: co-sponsored by the American Association for the Study of Liver Diseases (AASLD). Endocr Pract 2022;28:528-62. Crossref
31. Chalasani N, Younossi Z, Lavine JE, et al. The diagnosis and management of nonalcoholic fatty liver disease: practice guidance from the American Association for the Study of Liver Diseases. Hepatology 2018;67:328-57. Crossref
32. European Association for the Study of the Liver; Clinical Practice Guideline Panel; Berzigotti A, et al. EASL Clinical Practice Guidelines on non-invasive tests for evaluation of liver disease severity and prognosis—2021 update. J Hepatol 2021;75:659-89. Crossref

Comparison of clinical characteristics between ACOSOG Z0011–eligible cohort and sentinel lymph node–positive breast cancer patients in Hong Kong

Hong Kong Med J 2024 Apr;30(2):139–46 | Epub 25 Mar 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Comparison of clinical characteristics between ACOSOG Z0011–eligible cohort and sentinel lymph node–positive breast cancer patients in Hong Kong
Vivian Man, FCSHK, FRCSEd; Ava Kwong, FCSHK, FRCS
Department of Surgery, Queen Mary Hospital, Hong Kong SAR, China
 
Corresponding author: Prof Ava Kwong (avakwong@hku.hk)
 
 Full paper in PDF
 
Abstract
Introduction: The American College of Surgeons Oncology Group (ACOSOG) Z0011 trial resulted in de-escalation of axillary surgery among early-stage breast cancer patients with low-volume sentinel lymph node (SLN) disease undergoing breast-conserving surgery and radiation therapy. Nevertheless, the mastectomy rate in the Chinese population remains high. This study compared the clinical characteristics of the ACOSOG Z0011–eligible cohort with SLN-positive breast cancer patients in Hong Kong.
 
Methods: This retrospective analysis of a prospectively maintained database at a university-affiliated breast cancer centre in Hong Kong was performed from June 2014 to May 2019. The database included all patients with clinical tumour (T) stage T1 or T2 invasive breast carcinoma, no palpable adenopathy, one or two positive SLNs on histological examination, and no prior neoadjuvant systemic treatment. Comparisons were made between the mastectomy and breast-conserving treatment groups in our cohort, along with the sentinel-alone arm in the ACOSOG Z0011 trial.
 
Results: One hundred and seventy-one patients met the inclusion criteria: 112 underwent mastectomy and 59 underwent breast-conserving treatment. Our mastectomy group had higher prevalences of T2 tumours (P<0.001), lymphovascular invasion (P<0.001), and SLN macrometastases (P=0.004) compared with the ACOSOG Z0011 cohort. However, in our patient population, mean pathological size slightly differed between the mastectomy and breast-conserving treatment groups (2.2 cm vs 1.8 cm; P=0.005). Other histopathological features were similar.
 
Conclusion: This study demonstrated that clinicopathological features were comparable between SLN-positive breast cancer patients undergoing mastectomy and those undergoing breast-conserving treatment. Low-risk SLN-positive mastectomy patients may safely avoid completion axillary lymph node dissection.
 
 
New knowledge added by this study
  • Despite the high rate of mastectomy in Hong Kong, a small proportion of node-positive breast cancer patients met the American College of Surgeons Oncology Group (ACOSOG) Z0011 eligibility criteria to forgo axillary lymph node dissection.
  • Sentinel lymph node–positive breast cancer patients undergoing mastectomy displayed clinicopathological features similar to those undergoing breast-conserving treatment in Hong Kong.
Implications for clinical practice or policy
  • By expanding the AMAROS trial (After Mapping of the Axilla: Radiotherapy Or Surgery?) eligibility to include ACOSOG Z0011–ineligible mastectomy patients, more patients could avoid axillary lymph node dissection with adjuvant radiotherapy, potentially reducing morbidity.
  • Further studies are necessary to explore when adjuvant axillary radiotherapy is indicated among mastectomy patients with low axillary nodal burden.
 
 
Introduction
The evolution of optimal axillary management for breast cancer patients has led to emphasis on the de-escalation of axillary surgery and minimisation of surgical morbidity. Favourable results from the American College of Surgeons Oncology Group (ACOSOG) Z0011 phase 3 randomised clinical trials have redefined the indications for completion axillary lymph node dissection (ALND) in patients with positive sentinel lymph nodes (SLNs). Early-stage breast cancer patients who undergo upfront breast-conserving surgery and have one or two positive SLNs can safely forgo ALND while maintaining good overall survival and disease-free survival.1 2 Consequently, the ASCO (American Society of Clinical Oncology)3 and the NCCN (National Comprehensive Cancer Network)4 have revised their clinical practice guidelines to recommend against completion ALND in this subset of patients. Although this guidance has led to a significant decline in the rate of completion ALND among ACOSOG Z0011–eligible patients,5 6 7 a similar reduction was observed among patients undergoing mastectomy.8 9 This reduction was particularly pronounced among patients with SLN micrometastases.8 Further evidence was obtained in the phase 3 IBCSG (International Breast Cancer Study Group) 23-01 randomised controlled trials, where approximately 10% of patients with SLN micrometastases underwent mastectomy; subgroup analysis demonstrated that disease-free survival among patients without axillary dissection was non-inferior to those with axillary dissection after 10 years of follow-up.10 11 Similarly, in the AMAROS trial (After Mapping of the Axilla: Radiotherapy Or Surgery?), 17% of patients with tumour (T) staging T1 to T2 primary breast cancer underwent mastectomy.12 Axillary radiotherapy led to an oncological outcome comparable to completion ALND but was associated with a lower rate of lymphoedema.
 
In Hong Kong, factors such as the relatively small breast sizes among Chinese women13 and more conservative cultural attitudes13 14 have contributed to a higher rate of mastectomy. The decision to perform mastectomy has prevented a substantial number of breast cancer patients from meeting the ACOSOG Z0011 criteria. Our previous study evaluated the applicability of ACOSOG Z0011 criteria in Hong Kong.15 Patients with clinical nodal (N) staging N0 breast cancer and one or more positive SLNs were stratified into eligible and ineligible groups according to the ACOSOG Z0011 criteria, with 93% of patients in the ineligible group underwent mastectomy.15 Importantly, only 24% of patients in that study met the ACOSOG Z0011 criteria and could potentially avoid ALND.15 Therefore, it is important to identify a low-risk subset of SLN-positive mastectomy patients who could benefit from this non-ALND approach. This retrospective study was conducted to compare the clinical characteristics of SLN-positive breast cancer patients in Hong Kong with the ACOSOG Z0011–eligible cohort.
 
Methods
Patient recruitment
This retrospective analysis of a prospectively maintained database was conducted at Queen Mary Hospital, a university-affiliated tertiary breast cancer centre in Hong Kong, from June 2014 to May 2019. Potentially eligible patients in the database were identified by an independent research assistant according to whether they met the ACOSOG Z0011 criteria, irrespective of breast surgery type. Patients were excluded if they had positive non-SLNs or positive SLNs only detected by immunohistochemical staining. Relevant data were extracted in July 2020 and missing information was verified using the Clinical Management System, a central computer system for medical records across public hospitals in Hong Kong. Recruited patients were divided into two groups, namely, the mastectomy group and the breast-conserving treatment (ie, ACOSOG Z0011–eligible) group.
 
Clinical management and pathological assessment
All breast cancer patients underwent mammography and ultrasound of the breasts and axillae for clinical tumour and nodal staging. Sentinel lymph node biopsy (SLNB), offered to patients with clinically node-negative disease, was performed with a dual tracer of radioisotope and patent blue dye. Sentinel lymph nodes were defined as lymph nodes with ex vivo gamma probe counts exceeding 10% of the highest ex vivo reading or lymph nodes that displayed blue staining. Non-SLNs were defined as suspicious nodes that were neither hot (high gamma probe counts) nor blue-stained during SLNB, or nodes that were removed during completion ALND. During the study period, intraoperative frozen sections of SLNs or suspicious non-SLNs were routinely collected; these were analysed by standard haematoxylin and eosin staining. Immunohistochemistry was performed in cases of suspected nodal metastasis. Completion ALND was conducted if frozen or paraffin sections showed evidence of nodal metastasis. All final pathological results were reviewed in multidisciplinary meetings. The pathologies of SLNs were considered normal or containing one of the following: macrometastases (>2 mm), micrometastases (>0.2 to ≤2 mm), or isolated tumour cells (≤0.2 mm). For patients undergoing breast-conserving surgery, ‘no ink on tumour’ was regarded as an adequate resection margin16; alternatively, a second operation was performed to ensure a clear resection margin. Adjuvant treatment was administered by breast oncology specialists according to decisions made in multidisciplinary meetings.
 
Statistical analysis
Patient demographic characteristics and tumour characteristics were retrieved from database records; percentages were calculated. Missing information was evaluated and managed by pairwise deletion. Comparisons were made between the mastectomy and breast-conserving treatment groups in our cohort, along with the sentinel-alone arm in the ACOSOG Z0011 trial (n=436, in intention to treat).1 2 Analyses followed the per-protocol approach and calculations were performed with SPSS software (Windows version 24.0; IBM Corp, Armonk [NY], United States). Comparisons between cohorts were conducted with Student’s t test or the Chi squared test, as appropriate. Human epidermal growth factor receptor 2 (HER2) status was not assessed in the ACOSOG Z0011 study; therefore, HER2 statuses were only compared within our cohort. The Memorial Sloan Kettering Cancer Center (MSKCC) breast cancer nomogram,17 a well-validated prediction tool to assess the likelihood of non–sentinel node metastases18 19 20 (including external validation in the Chinese population19 20), was used to calculate probability through an online calculator that considered nine variables; comparisons were made between the breast-conserving treatment and mastectomy groups. P values <0.05 were considered statistically significant.
 
Results
In our centre, the ACOSOG Z0011 criteria have been used to manage patients undergoing breast-conserving surgery since June 2019. From June 2014 to May 2019, 1249 breast cancer patients underwent SLNB in our institution; 171 patients (13.7%) met the study inclusion criteria of clinical T1 or T2 invasive breast cancer and one or two positive SLNs. One hundred and twelve patients (65.5%) underwent mastectomy and 59 patients (34.5%) underwent breast-conserving treatment. The median follow-up period was 58 months (range, 25-84).
 
Our mastectomy group versus the sentinel-alone arm in the ACOSOG Z0011 trial
Patient demographic characteristics and tumour characteristics of our mastectomy group and the sentinel-alone arm in the ACOSOG Z0011 trial are presented in Table 1. Invasive ductal carcinoma was more common in our patient population than in the ACOSOG Z0011 group. A higher prevalence of clinical T2 breast cancers (~50%) was observed in our mastectomy group (P<0.001). There were also significantly more patients with lymphovascular invasion in our cohort than in the sentinel-alone arm in the ACOSOG Z0011 trial (P<0.001). Although nearly half of the original ACOSOG Z0011 cohort had micrometastatic SLNs, approximately 70% of mastectomy patients had macrometastatic SLNs (P=0.004). These findings suggested that the clinicopathological profile was more aggressive in patients requiring mastectomy.
 

Table 1. Clinical characteristics of patients with mastectomy in this study and the sentinel-alone arm in the ACOSOG (American College of Surgeons Oncology Group) Z0011 trial
 
Our mastectomy group versus our breast-conserving treatment group
In our patient cohort, the mastectomy group exhibited many clinicopathological characteristics similar to the breast-conserving treatment group (Table 2). There were no statistically significant differences in terms of age, tumour grade, lymphovascular invasion status, oestrogen receptor/progesterone receptor status, or HER2 status. The mastectomy group had relatively larger tumours than the breast-conserving treatment group (mean: 2.2 cm vs 1.8 cm; P=0.005). Although the difference was not statistically significant, the mastectomy group tended to have larger proportions of patients with two metastatic SLNs (24.1% vs 13.6%; P=0.1) and SLN macrometastases (70.5% vs 57.6%; P=0.11) than the breast-conserving treatment group. Furthermore, the MSKCC probability for additional metastatic non-SLNs was slightly higher in the mastectomy group than in the breast-conserving treatment group (37.1% vs 31.4%; P=0.03) [Table 2].
 

Table 2. Clinical characteristics of patients with mastectomy and breast-conserving treatment in this study
 
Ninety-seven patients (86.6%) in the mastectomy group and 45 patients (76.3%) in the breast-conserving treatment group underwent completion ALND. Among patients who underwent mastectomy and completion ALND, 26 patients (26.8%) had additional non-SLN metastases (range, 1-18). In contrast, eight patients (17.8%) in the breast-conserving treatment group had additional non-SLN metastases (range, 1-8). There was no statistically significant difference in the rate of non-SLN metastases between the two treatment arms (P=0.24). Twenty-nine patients (17.9%) underwent SLNB alone; 15 of these patients were in the mastectomy group. Most patients with SLNB alone had micrometastatic SLNs (89.7%) and one patient had isolated tumour cells. None of the patients with SLNB alone experienced recurrence.
 
Adjuvant treatment
In the mastectomy group, 97 patients (86.6%) underwent post-mastectomy irradiation targeting the chest wall and third field regional nodes. Third field regional nodes refer to level III axillary and supraclavicular lymph node regions. None of these patients developed chest wall or axillary recurrence during the follow-up period. Among the 15 patients who did not undergo post-mastectomy irradiation, eight (53.3%) had micrometastatic SLNs and six (40.0%) had macrometastatic SLNs. There were two recurrences (13.3%). First, a 38-year-old patient with one macrometastatic SLN developed ipsilateral chest wall recurrence 4 years after the index operation; this recurrence was managed by a second operation. Second, a patient with two macrometastatic SLNs refused adjuvant systemic treatment and died of breast cancer&dash;related distant metastases. One hundred and ten patients in the mastectomy group (98.2%) received adjuvant systemic treatment: 10 patients (8.9%) required chemotherapy only, 22 patients (19.6%) required hormonal treatment only, and 78 patients (69.6%) required both of these treatments. Seven patients (6.3%) in the mastectomy group developed distant recurrence, and there were three (2.7%) breast cancer–related deaths.
 
In the breast-conserving treatment group, 58 of the 59 patients underwent adjuvant whole-breast irradiation; 61.0% of these patients underwent additional third field nodal irradiation. Fifty-eight patients (98.3%) in the breast-conserving treatment group received adjuvant systemic treatment involving hormonal therapy and/or chemotherapy. Three patients (5.1%) had distant recurrence; among them, one (1.7%) died at 39 months after the initial diagnosis. One patient experienced ipsilateral breast recurrence at 30 months and underwent completion mastectomy.
 
Discussion
The favourable oncological results of the ACOSOG Z0011 trial1 2 have challenged the conventional approach of performing completion ALND in patients with SLN metastases. Patients with one or two SLN metastases who underwent breast-conserving surgery, whole-breast irradiation, and adjuvant systemic treatment could safely forgo completion ALND. This paradigm shift has led to substantial de-escalation of axillary surgery worldwide.5 A meta-analysis by Schmidt-Hansen et al,21 which involved 2020 patients and findings from the IBCSG 23-0110 11 and the AATRM (Agència d’Avaluació de Tecnologia i Recerca Mèdiques) 048/13/200022 trials, concluded that SLNB alone was sufficient for locoregional control in early breast cancer, without adverse effects on survival.
 
Limitations of the ACOSOG Z0011 study
Despite widespread adoption of the ACOSOG Z0011 criteria, the study has been criticised in several ways. The low locoregional relapse rate of 1.5% indicates that the study was underpowered.23 Furthermore, significant deviation in the radiotherapy protocol, such that 18.9% of patients received ‘high tangents’ radiotherapy, has raised questions concerning the oncological safety of SLNB alone in patients without third field nodal irradiation.24 Combined with the insufficient numbers of mastectomy patients in the IBCSG 23-01,10 11 AMAROS,12 and AATRM 048/13/200022 trials, it has been unclear whether this non-ALND approach can be extrapolated to SLN-positive breast cancer patients who undergo mastectomy with or without radiotherapy.
 
Aggressive tumour characteristics among mastectomy patients and local or regional failure rate
In this study, we compared the clinicopathological characteristics among our mastectomy group, our breast-conserving treatment group, and the sentinel-alone arm in the original ACOSOG Z0011 study. Unsurprisingly, our mastectomy group exhibited more aggressive tumour characteristics than the sentinel-alone arm in the Western population; specifically, it had a larger tumour size, more frequent lymphovascular invasion, and a greater proportion of patients with SLN macrometastases. These differences in clinicopathological features have also been reported in Western populations. For example, Hennigs et al8 analysed a large German cohort that included 4093 SLN-positive mastectomy patients. Compared with the entire study cohort of 166 074 patients, T2 tumour and lymphovascular invasion were more commonly found in patients requiring mastectomy. Additionally, the study by Milgrom et al25 included 535 early-stage breast cancer patients with a positive SLNB and no ALND. In their mastectomy group, patients had significantly larger tumours and more frequently displayed multifocal/multicentric disease. However, these adverse pathological features among mastectomy patients did not justify a more aggressive axillary approach. Similarly, the low rates of local and regional failure observed in our cohort were consistent with previous reports, suggesting that axillary-specific treatment can be considered in this group of patients with low-volume SLN disease.25 26 27 28 Debate persists regarding the comparatively large proportions of patients with micrometastatic disease in the original ACOSOG Z0011 trial1 2 and other studies.25 26 Cowher et al29 published a retrospective analysis of patients who underwent mastectomy and conservative axillary regional excision (ie, removal of SLNs and other palpable nodes). Among 144 patients with pathological N1 disease, a small proportion (24%) had micrometastatic disease; only three axillary recurrences (2.1%) were reported.29 Notably, the low locoregional failure rate was not attributed to post-mastectomy irradiation25 26 27 28 29 or increased use of chemotherapy.26 27 28
 
Intrinsic differences in tumour characteristics between different patient populations
In our previous study, we demonstrated differences in clinical characteristics between Asian and Western populations.15 In the present study, our breast-conserving treatment group had a higher rate of clinical T2 tumours and more frequent lymphovascular invasion compared with the Western population. Similar findings were observed in Korean30 and Japanese31 studies, which revealed larger and higher-grade tumours, increased lymphovascular permeation, and more frequent SLN macrometastases. Despite these disparities, the Korean30 and Japanese31 studies both demonstrated safe application of ACOSOG Z0011 criteria in Asia, with low incidences of disease recurrence. These intrinsic differences in tumour characteristics between Eastern and Western populations have presumably reduced the gap in clinicopathological features between patients undergoing mastectomy and those undergoing breast-conserving surgery. In the head-to-head comparison between our mastectomy cohort and our breast-conserving treatment group, the only notable difference involved the mean pathological size of the invasive focus (2.2 cm vs 1.8 cm; P=0.005); the clinical tumour stage distribution did not differ (P=0.69) [Table 2]. The small difference in mean MSKCC breast cancer nomogram probability (37.1% vs 31.4%; P=0.03) could also be related to the difference in pathological size, which is one of the nine variables considered in the nomogram. Therefore, we believe that a non-ALND approach in this low-risk subset of SLN-positive mastectomy patients is acceptable.
 
Residual non–sentinel lymph node metastasis in non–axillary lymph node dissection approach
The primary concern regarding extrapolation of this non-ALND approach is the risk of undertreatment for patients with an extensive nodal burden. The original ACOSOG Z0011 trial revealed a non-SLN macrometastasis rate of 27.3% in the ALND group.1 2 The AMAROS trial also showed that 33% of patients in the ALND group had additional positive lymph nodes.12 Importantly, the axillary recurrence rate remained low in both of these studies. In our SLN-positive mastectomy and breast-conserving treatment groups, the proportions of patients with additional non-SLN metastases were 26.8% and 17.8%, respectively. Among patients undergoing adjuvant irradiation and adjuvant systemic treatment, it is likely that some non-SLN metastases do not progress to clinically detectable disease.
 
Limitations of this study
This study had several limitations. First, its retrospective design could result in recall bias and the potential for missing clinical information. Although data from the ACOSOG Z0011 trial were limited with respect to HER2 status, extracapsular extension, and multifocality, we attempted to mitigate this issue by including some of the affected variables in the comparison of our mastectomy and breast-conserving treatment groups. Second, we could not address the need for post-mastectomy irradiation among patients in this study. The value of such irradiation for breast cancer patients with <4 positive lymph nodes remains controversial. The meta-analysis by the Early Breast Cancer Trialists’ Collaborative Group,32 which included 1314 breast cancer patients with one to three positive nodes after mastectomy and ALND, suggested that radiotherapy provided oncological benefit in terms of locoregional recurrence, overall recurrence, and breast cancer mortality. However, this meta-analysis has been criticised for including some very early studies from the 1970s, in which the reported recurrence rates were much higher than rates in later studies. In 2016, a focused update by the American Society of Clinical Oncology, American Society for Radiation Oncology, and Society of Surgical Oncology acknowledged the use of post-mastectomy radiotherapy for this group of patients but recommended clinical judgement for patients with a low risk of locoregional recurrence.33 In our centre, post-mastectomy irradiation was generally administered to patients with pathological N1 disease during the study period; 86.6% of patients in the present study underwent adjuvant radiotherapy. Considering the similarities in clinicopathological features and adjuvant systemic treatment use between our SLN-positive mastectomy and breast-conserving treatment groups, we suspect that it is safe for selected low-risk SLN-positive mastectomy patients to forgo ALND through the expansion of AMAROS eligibility12 to ACOSOG Z0011–ineligible patients. Several ongoing randomised studies, such as the English POSNOC (POsitive Sentinel NOde: adjuvant therapy alone versus adjuvant therapy plus Clearance or axillary radiotherapy)34 and the Dutch BOOG 2013-07,35 are recruiting breast cancer patients who undergo mastectomy and have a maximum of two to three positive SLNs; these studies aim to compare completion axillary treatment (ALND or axillary radiotherapy) and the lack of completion axillary treatment. Additionally, the SINODAR-ONE trial36 recently published their subgroup analysis and found non-inferior overall survival and recurrence-free survival among mastectomy patients receiving SLNB and ALND. The ongoing studies are expected to provide more robust evidence concerning the optimal treatment for SLN-positive mastectomy patients.
 
Conclusion
This study demonstrated the clinicopathological similarities between SLN-positive mastectomy and breast-conserving treatment groups among breast cancer patients in Hong Kong. Cautious application of the non-ALND approach in mastectomy patients with low-volume SLN disease is reasonable, considering the low locoregional recurrence rate. However, additional research is needed to standardise the adjuvant post-mastectomy radiotherapy protocol, especially among patients who forego ALND.
 
Author contributions
Concept or design: V Man.
Acquisition of data: V Man.
Analysis or interpretation of data: V Man.
Drafting of the manuscript: Both authors.
Critical revision of the manuscript for important intellectual content: A Kwong.
 
Both authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
Both authors have disclosed no conflicts of interest.
 
Declaration
This study has been presented and awarded the Young Investigator Award Best Scientific Paper in the Hong Kong Society of Breast Surgeons 5th Annual Scientific Meeting (19 September 2021, Hong Kong).
 
Funding/support
This study received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This study was approved by the Institutional Review Board of The University of Hong Kong/Hospital Authority Hong Kong West Cluster, Hong Kong (Ref No.: HKU/HA HKW UW 09-045). Written informed consent was obtained from patients for all treatments, procedures, and publication.
 
References
1. Giuliano AE, Hunt KK, Ballman KV, et al. Axillary dissection vs no axillary dissection in women with invasive breast cancer and sentinel node metastasis: a randomized clinical trial. JAMA 2011;305:569-75. Crossref
2. Guiliano AE, Ballman KV, McCall L, et al. Effect of axillary dissection vs no axillary dissection on 10-year overall survival among women with invasive breast cancer and sentinel node metastasis: the ACOSOG Z0011 (Alliance) randomized clinical trial. JAMA 2017;318:918-26. Crossref
3. Lyman GH, Termin S, Edge SB, et al. Sentinel lymph node biopsy for patients with early-stage breast cancer: American Society of Clinical Oncology clinical practice guideline update. J Clin Oncol 2014;32:1365-83. Crossref
4. Gradishar WJ, Anderson BO, Balassanian R, et al. NCCN guidelines insights: breast cancer, version 1.2017. J Natl Compr Canc Netw 2017;15:433-51. Crossref
5. Camp MS, Greenup RA, Taghian A, et al. Application of ACOSOG Z0011 criteria reduces perioperative costs. Ann Surg Oncol 2013;20:836-41. Crossref
6. Delpech Y, Bricou A, Lousquy R, et al. The exportability of the ACOSOG Z0011 criteria for omitting axillary lymph node dissection after positive sentinel lymph node biopsy findings: a multicenter study. Ann Surg Oncol 2013;20:2556-61. Crossref
7. Verheuvel NC, Voogd AC, Tjan-Heijnen VC, Roumen RM. Potential impact of application of Z0011 derived criteria to omit axillary lymph node dissection in node positive breast cancer patients. Eur J Surg Oncol 2016;42:1162-8. Crossref
8. Hennigs A, Riedel F, Feißt M, et al. Evolution of the use of completion axillary lymph node dissection in patients with T1/2N0M0 breast cancer and tumour-involved sentinel lymph nodes undergoing mastectomy: a cohort study. Ann Surg Oncol 2019;26:2435-43. Crossref
9. Poodt IG, Spronk PE, Vugts G, et al. Trends on axillary surgery in nondistant metastatic breast cancer patients treated between 2011 and 2015: a Dutch population-based study in the ACOSOG-Z0011 and AMAROS era. Ann Surg 2018;268:1084-90. Crossref
10. Galimberti V, Cole BF, Zurrida S, et al. Axillary dissection versus no axillary dissection in patients with sentinel-node micrometastases (IBCSG 23-01): a phase 3 randomised controlled trial. Lancet Oncol 2013;14:297-305. Crossref
11. Galimberti V, Cole BF, Viale G, et al. Axillary dissection versus no axillary dissection in patients with breast cancer and sentinel-node micrometastases (IBCSG 23-01): 10-year follow-up of a randomised, controlled phase 3 trial. Lancet Oncol 2018;19:1385-93. Crossref
12. Donker M, van Tienhoven G, Straver ME, et al. Radiotherapy or surgery of the axilla after a positive sentinel node in breast cancer (EORTC 10981-22023 AMAROS): a randomised, multicentre, open-label, phase 3 non-inferiority trial. Lancet Oncol 2014;15:1303-10. Crossref
13. Sinnadurai S, Kwong A, Hartman M, et al. Breast-conserving surgery versus mastectomy in young women with breast cancer in Asian settings. BJS Open 2018;3:48-55. Crossref
14. Chan SW, Cheung C, Chan A, Cheung PS. Surgical options for Chinese patients with early invasive breast cancer: data from the Hong Kong Breast Cancer Registry. Asian J Surg 2017;40:444-52.Crossref
15. Man V, Lo MS, Kwong A. The applicability of the ACOSOG Z0011 criteria to breast cancer patients in Hong Kong. Chin Clin Oncol 2021;10:27. Crossref
16. Moran MS, Schnitt SJ, Giuliano AE, et al. Society of Surgical Oncology–American Society for Radiation Oncology consensus guideline on margins for breast-conserving surgery with whole-breast irradiation in stages I and II invasive breast cancer. Ann Surg Oncol 2014;21:704-16. Crossref
17. Memorial Sloan Kettering Cancer Center. Breast cancer nomogram: breast additional non-SLN metastases. Available from: https://nomograms.mskcc.org/breast/BreastAdditionalNonSLNMetastasesPage.aspx. Accessed 19 Mar 2024.
18. Gur AS, Unal B, Ozbek U, et al. Validation of breast cancer nomograms for predicting the non–sentinel lymph node metastases after a positive sentinel lymph node biopsy in a multi-center study. Eur J Surg Oncol 2010;36:30-5. Crossref
19. Kuo YL, Chen WC, Yao WJ, et al. Validation of Memorial Sloan-Kettering Cancer Center nomogram for prediction of non–sentinel lymph node metastasis in sentinel lymph node positive breast cancer patients: an international comparison. Int J Surg 2013;11:538-43. Crossref
20. Bi X, Wang Y, Li M, et al. Validation of the Memorial Sloan Kettering Cancer Center nomogram for predicting non–sentinel lymph node metastasis in sentinel lymph node–positive breast-cancer patients. Onco Targets Ther 2015;8:487-93. Crossref
21. Schmidt-Hansen M, Bromham N, Hasler E, Reed MW. Axillary surgery in women with sentinel node–positive operable breast cancer: a systematic review with meta-analyses. Springerplus 2016;5:85. Crossref
22. Solá M, Alberro JA, Fraile M, et al. Complete axillary lymph node dissection versus clinical follow-up in breast cancer patients with sentinel node micrometastasis: final results from the multicenter clinical trial AATRM 048/13/2000. Ann Surg Oncol 2013;20:120-7. Crossref
23. Huang TW, Kuo KN, Chen KH, et al. Recommendation for axillary lymph node dissection in women with early breast cancer and sentinel node metastasis: a systematic review and meta-analysis of randomized controlled trials using the GRADE system. Int J Surg 2016;34:73-80. Crossref
24. Jagsi R, Chadha M, Moni J, et al. Radiation field design in the ACOSOG Z0011 (Alliance) trial. J Clin Oncol 2014;32:3600-6. Crossref
25. Milgrom S, Cody H, Tan L, et al. Characteristics and outcomes of sentinel node–positive breast cancer patients after total mastectomy without axillary-specific treatment. Ann Surg Oncol 2012;19:3762-70. Crossref
26. FitzSullivan E, Bassett RL, Kuerer HM, et al. Outcomes of sentinel lymph node–positive breast cancer patients treated with mastectomy without axillary therapy. Ann Surg Oncol 2017;24:652-9. Crossref
27. Snow R, Reyna C, Johns C, et al. Outcomes with and without axillary node dissection for node-positive lumpectomy and mastectomy patients. Am J Surg 2015;210:685-93. Crossref
28. Joo JH, Kim SS, Son BH, et al. Axillary lymph node dissection does not improve post-mastectomy overall or disease-free survival among breast cancer patients with 1-3 positive nodes. Cancer Res Treat 2019;51:1011-21. Crossref
29. Cowher MS, Grobmyer SR, Lyons J, O’Rourke C, Baynes D, Crowe JP. Conservative axillary surgery in breast cancer patients undergoing mastectomy: long-term results. J Am Coll Surg 2014;218:819-24. Crossref
30. Jung J, Han W, Lee ES, et al. Retrospectively validating the results of the ACOSOG Z0011 trial in a large Asian Z0011-eligible cohort. Breast Cancer Res Treat 2019;175:203-15. Crossref
31. Kittaka N, Tokui R, Ota C, et al. A prospective feasibility study applying the ACOSOG Z0011 criteria to Japanese patients with early breast cancer undergoing breast-conserving surgery. Int J Clin Oncol 2018;23:860-6. Crossref
32. EBCTCG (Early Breast Cancer Trialists’ Collaborative Group); McGale P, Taylor C, et al. Effect of radiotherapy after mastectomy and axillary surgery on 10-year recurrence and 20-year breast cancer mortality: meta-analysis of individual patient data for 8135 women in 22 randomised trials. Lancet 2014;383:2127-35. Crossref
33. Recht A, Comen EA, Fine RE, et al. Postmastectomy radiotherapy: an American Society of Clinical Oncology, American Society for Radiation Oncology, and Society of Surgical Oncology focused guideline update. Pract Radiat Oncol 2016;6:e219-34. Crossref
34. Goyal A, Dodwell D. POSNOC: a randomised trial looking at axillary treatment in women with one or two sentinel nodes with macrometastases. Clin Oncol (R Coll Radiol) 2015;27:692-5. Crossref
35. van Roozendaal LM, de Wilt JH, van Dalen T, et al. The value of completion axillary treatment in sentinel node positive breast cancer patients undergoing a mastectomy: a Dutch randomized controlled multicentre trial (BOOG 2013-07). BMC Cancer 2015;15:610. Crossref
36. Tinterri C, Canavese G, Gatzemeier W, et al. Sentinel lymph node biopsy versus axillary lymph node dissection in breast cancer patients undergoing mastectomy with one to two metastatic sentinel lymph nodes: sub-analysis of the SINODAR-ONE multicentre randomized clinical trial and reopening of enrolment. Br J Surg 2023;110:1143-52. Crossref

Exploration of clinical and ethical issues in an expanded newborn metabolic screening programme: a qualitative interview study of healthcare professionals in Hong Kong

Hong Kong Med J 2024 Apr;30(2):120–9 | Epub 9 Apr 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Exploration of clinical and ethical issues in an expanded newborn metabolic screening programme: a qualitative interview study of healthcare professionals in Hong Kong
Olivia MY Ngan, PhD1,2; Ching Janice Tam, MSc (Medical Genetics), BNurs3; CK Li, MD, FRCPCH4,5
1 Medical Ethics and Humanities Unit, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
2 Centre for Medical Ethics and Law, The University of Hong Kong, Hong Kong SAR, China
3 Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR, China
4 Department of Paediatrics, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR, China
5 Hong Kong Hub of Paediatric Excellence, The Chinese University of Hong Kong, Hong Kong SAR, China
 
Corresponding author: Dr Olivia MY Ngan (olivian1@hku.hk)
 
 Full paper in PDF
 
Abstract
Introduction: The Newborn Screening Programme for Inborn Errors of Metabolism (NBSIEM) enables early intervention and prevents premature mortality. Residual dried bloodspots (rDBS) from the heel prick test are a valuable resource for research. However, there is minimal data regarding how stakeholders in Hong Kong view the retention and secondary use of rDBS. This study aimed to explore views of the NBSIEM and the factors associated with retention and secondary use of rDBS among healthcare professionals in Hong Kong.
 
Methods: Between August 2021 and January 2022, semi-structured interviews were conducted with 30 healthcare professionals in obstetrics, paediatrics, and chemical pathology. Key themes were identified through thematic analysis, including views towards the current NBSIEM and the retention and secondary use of rDBS.
 
Results: After implementation of the NBSIEM, participants observed fewer patients with acute decompensation due to undiagnosed inborn errors of metabolism. The most frequently cited clinical utilities were early detection and improved health outcomes. Barriers to rDBS storage and its secondary use included uncertain value and benefits, trust concerns, and consent issues.
 
Conclusion: This study highlighted healthcare professionals’ concerns about the NBSIEM and uncertainties regarding the handling or utilisation of rDBS. Policymakers should consider these concerns when establishing new guidelines.
 
 
New knowledge added by this study
  • After implementation of the Newborn Screening Programme for Inborn Errors of Metabolism, participants observed fewer patients with acute decompensation due to undiagnosed inborn errors of metabolism.
  • The obligation to know more about a child’s health and the drive for an altruistic contribution to science were factors supporting the retention of residual dried bloodspots (rDBS) for secondary research use.
  • Uncertain value and benefits of rDBS, along with concerns regarding trust, privacy, and consent, were cited as barriers to the retention of rDBS for secondary research use.
Implications for clinical practice or policy
  • The retention of rDBS requires inherent trust based on public support, with strict clinical and ethical parameters.
  • Concerns about privacy and consent issues related to genomic information should be addressed before next-generation sequencing is integrated into clinical care for newborns.
 
 
Introduction
Inborn errors of metabolism (IEM) are rare genetic diseases arising from congenital deficiencies of certain enzymes or cofactors. The accumulation of excessive toxic substances and the absence of essential metabolites may damage vital organs, impair normal metabolism, or increase risks of morbidity and mortality. A small proportion of IEM cases can be diagnosed and treated early through dietary interventions. Patients substantially benefit from early diagnosis and appropriate disease monitoring.
 
The incidence of IEM in Hong Kong is 1 in 1682 newborns.1 In response to public health concerns, a territory-wide free voluntary Newborn Screening Programme for IEM (NBSIEM) was implemented for all newborns born in public birthing units, beginning in 2017.2 This programme covers 27 conditions, including severe combined immunodeficiency (SCID).3 Spots of blood are collected from newborns within 24 to 72 hours after birth, preferably following 24 hours of milk feeding, using a heel prick test; these samples are discarded after hospital laboratories perform quality control and assurance monitoring (online supplementary Table 1).
 
The materials on dried bloodspots provide clinical benefits and lifelong healthcare research opportunities that are advantageous to individuals and the population. However, the retention and use of residual DBS (rDBS) has led to controversies regarding privacy, transparency, consent, misuse of, and unauthorised access to information, unclear research purposes, and the absence of data management and governance protocols.4 Despite these concerns, it is common for rDBS to be routinely stored and used for research purposes in some regions. For example, in Denmark, a nationwide newborn screening programme was implemented in 1975; it currently screens for 17 diseases.5 Samples are stored indefinitely with consent in the Danish Newborn Screening Biobank at the State Serum Institute.6 Similarly, a programme in the Netherlands screens for 31 conditions.7 Although participation is voluntary, the participation rate has reached 99.3%.8 In the Netherlands, rDBS samples are stored for 1 year to facilitate quality control. Most samples are stored for an additional 4 years for secondary uses, such as disease-specific biomedical research and patient-specific diagnostic purposes, after the acquisition of parental consent.9 The International Society for Neonatal Screening compared national newborn screening policies, revealing great variation in programme acceptance, consent procedure, storage, and length of storage.8 The Society’s findings highlight the importance of incorporating local views during policy development.
 
Two empirical studies in Hong Kong revealed that parents were unaware of the expanded newborn screening programme and the potential value of rDBS.10 11 The secondary use of rDBS in medical and health research is well-supported, mainly on the basis of altruism. However, participation does not provide direct individual benefits. Factors contributing to parental support towards retention and secondary use of rDBS include parental consent and trust in the relevant authority. If explicit permission is obtained, parents are more willing to contribute their child’s rDBS card. An opt-out approach and broad consent for unspecified use were considered unfavourable options.11 Although multiple rDBS-focused studies in Hong Kong have included public stakeholders, few have assessed the attitudes of healthcare professionals (HCPs)12; none have been conducted since implementation of the territory-wide screening programme. To address this gap, the present study explored views of the NBSIEM and the factors associated with retention and secondary use of rDBS cards among HCPs in Hong Kong.
 
Methods
Sampling and recruitment
Semi-structured interviews were conducted among 30 HCPs in obstetrics, paediatrics, and chemical pathology practising in eight public and two private institutions in Hong Kong between August 2021 and January 2022. Purposeful sampling was used to select key stakeholders involved in the NBSIEM according to disciplines and responsibilities. Initial invitations were sent through the two local medical universities (ie, The University of Hong Kong and The Chinese University of Hong Kong) and associated hospitals. Referrals via snowballing were also performed to recruit additional participants.
 
The study inclusion criteria were academics and HCPs (eg, medical, nursing, and laboratory staff) involved in IEM-related research or clinical work. Individuals who did not meet the criteria were excluded. Overall, this study recruited participants who were involved in recruitment and counselling within the NBSIEM, laboratory data analysis and interpretation, or academic research related to IEM.
 
Each participant received a detailed description of the research. All participants gave written informed consent before taking part in the study. The second author conducted the interviews at locations convenient for participants, such as meeting rooms, offices, and coffee shops. The interview length ranged from 37 to 71 minutes. Upon completion of the interview, each participant received a supermarket voucher for HK$200.
 
Interview guide
A semi-structured topic guide was developed based on existing literature concerning newborn screening and ethical considerations (Table).4 11 12 13 14 15 Prior to data collection, the guide was reviewed by a senior paediatrician to ensure its content validity, relevance, clarity, and cultural sensitivity.
 

Table. Sample interview questions4 11 12 13 14 15
 
Data analysis
Interviews were digitally recorded, transcribed verbatim in the original language (Cantonese), and then translated into English. Transcripts were anonymised and assigned an identification code. The interviewer and another member of the research team reviewed the transcription accuracy. Two independent researchers read and coded the transcripts and audio recordings via thematic analysis, in which textual data were coded and labelled in an inductive manner. New thematic codes that did not fit into predetermined categories were created and refined, as necessary. Codes were compared and discussed among research members until a consensus was reached. Reflexivity was maintained during the discussion and data analysis process. The entire research team identified emerging themes from the early and intermediate stages of interviews, then recruited HCPs to represent each new theme until theoretical saturation was achieved (ie, no new themes emerged in the discussions and existing themes were consistently observed).
 
Results
Interviewee characteristics
Thirty HCPs were recruited and interviewed. The study sample was diverse. Among the interviewees, 15 (50.0%) were doctors and 13 (43.3%) were nurses or midwives; 13 (43.3%) worked in obstetrics and gynaecology and 13 (43.3%) worked in paediatrics; 28 worked in the public sector (93.3%); and 14 (46.7%) had >10 years of clinical experience (online supplementary Table 2). Two major thematic themes were identified, namely, views towards the current NBSIEM and views towards the retention and secondary use of rDBS. Illustrative quotations are used to support the themes.
 
Theme 1: views towards the current newborn screening programme
Perceived clinical utility
The implementation of the NBSIEM was a public health achievement, and interviewees showed a positive attitude towards the programme. Early detection and improved health outcomes were the most commonly reported clinical utility outcomes (n=21, 70.0%). Some interviewees, primarily paediatric HCPs, noted a difference between the periods before and after territory-wide test implementation. Before NBSIEM, it was not uncommon for doctors working in intensive care units to encounter cases of acute decompensation due to undiagnosed IEM. Considering the broad phenotypes involved in IEM, such conditions may not be recognised in early stages.
 
‘I have been working in the ward for ages and observed that many patients with IEM deteriorated to an irreversible stage. With this NBSIEM, we can screen out IEM cases and offer treatment. The patients achieve normal development like others. In other words, the screening helped many people.’ (Interview 25, paediatric nurse)
 
Affected families previously endured a long wait for diagnosis before the NBSIEM; there was a substantial psychological burden involved. Parental distress was observed.
 
‘From hospital admission to disease diagnosis, it takes 2 weeks. The whole process involved hospital transfer from the (suspected IEM clinic) to the specialised team at (Hong Kong) Children’s Hospital, conducting blood investigation, and making a diagnosis.’ (Interview 20, paediatric doctor)
 
Some limitations were noted, including false-positive and false-negative results, as well as call-back rates. Interviewees cited the reduction of recall caseloads as an advantage associated with implementation of second-tier tests.
 
‘It is best if we can eliminate the false-positives. To achieve this purpose, we implement aggressive second-tier testing. Our pathologists also make stringent interpretations. Without pathologists working in laboratories, clinicians can only draw reference from the pre-set levels.’ (Interview 30, pathology doctor)
 
Screening panel
The programme covers 27 conditions, including SCID.3 Interviewees generally agreed that the benefits of screening for lethal IEM conditions outweigh the costs of screening, despite the very low incidence of IEMs. When asked about their views on the current panel, interviewees emphasised that disease selection must be based on public health principles, supported by Wilson and Jungner’s screening criteria.16 Diseases in the panel should be treatable and have a high prevalence in Hong Kong.
 
‘I support the NBSIEM. I believe (the experts) came up with the 27 conditions based on robust considerations, including incidence and prevalence, availability of treatment, mortality prevention, cost-effectiveness, etc. It is beneficial to patients.’ (Interview 13, obstetrics and gynaecology doctor)
 
Healthcare professionals have encountered parents who have completed the publicly funded IEM test and selected an additional private IEM test solely based on the number of conditions. For such parents, the underlying motivation is that ‘screening more conditions is perceived to be more definitive’.
 
If I were a mother with a child, I would like to know whether my child was affected by these diseases. The more conditions the panel includes, the better. One can prevent the onset of disease. Parents are helpless when diseases occur suddenly.’ (Interview 24, obstetrics and gynaecology doctor)
 
‘Some mothers compared the list of conditions between the public and private sectors. I am talking about the difference between 26 and 30 conditions, respectively. They would rather pay out of pocket and send the baby to retake the test in the private sector.’ (Interview 16, obstetrics and gynaecology academic)
 
One paediatrician questioned whether a genetic test with a larger number of conditions contributes to enhanced parental control and confidence regarding the newborn’s health. She was aware of some urine tests available in the direct-to-consumer market that screen for so-called ‘non-diseases’—short-/branched-chain acyl-coenzyme A dehydrogenase and 3-methylcrotonyl-coenzyme A carboxylase deficiency—although such conditions do not require follow-up. She highlighted the importance of periodically reviewing conditions on the panel according to locality-specific factors, including disease prevalence, clinical sensitivity and specificity, treatment, and cost-effectiveness.
 
‘Running an analysis on a rDBS card is not difficult. What is more challenging is the post-analysis follow-up. Compared with other NBSIEMs, the United States screens for the greatest number of conditions, maybe 40, while the United Kingdom screens for five conditions. Instead of adding conditions, should we also consider taking out some (non-disease) conditions from the list?’ (Interview 27, paediatric doctor)
 
Source of information
Three HCPs (10.0%) reported that most Hospital Authority staff received NBSIEM information through departmental seminars and training sessions, which prepared them to complete the consent procedure with parents. When asked about their understanding of IEMs, knowledge levels varied among frontline staff involved in the NBSIEM. Some were uncertain what the test evaluated.
 
‘What is it (NBSIEM) testing for…? Is it checking for chromosomal defects? Or is it checking for lack of (metabolites)?’ (Interview 15, obstetrics and gynaecology nurse)
 
Some also mistakenly thought that the NBSIEM analysed genes.
 
‘It is a filter paper with some dried bloodspots, testing IEM genes.’ (Interview 29, obstetrics and gynaecology doctor)
 
Some frontline staff wanted additional information beyond the procedure. They felt unprepared for questions about the diseases, symptoms, test procedures, and care for patients with IEMs. They felt anxious or uncomfortable explaining these aspects to parents with some level of understanding.
 
‘After implementing this programme, what the patients will undergo, where they will be referred to, what to do with a confirmed diagnosis… to be honest, I learned everything from the protocol. In actual settings, parents asked many practical questions, such as “what to be cautious about during daily care”—I do not know how to answer them. These are not common diseases observed in the ward, but we are asked to counsel parents.’ (Interview 24, obstetrics and gynaecology doctor)
 
A senior doctor responsible for providing educational seminars noted that training should not be an isolated event; periodic refresher training should be provided.
 
‘We provide intensive training for nurses, all of them. (In the training), we give clear explanations, conduct videotaping (for review), and address inquiries and questions. In addition, we also plan to host refresher courses every few years. Hong Kong requires more observation before moving forward.’ (Interview 27, paediatric doctor)
 
Experience with parental counselling and consent procedure
Interviewees felt that the educational pamphlet and consent material are easy to read. During parental counselling, HCPs were prepared to answer parents’ enquiries. Most parents supported the NBSIEM. Parents wanted to identify IEM conditions in their newborns because they felt that early detection could facilitate autonomous decision-making related to their child and other family members.
 
‘I observed that most parents enrol in the NBSIEM as they would like to know sooner if their babies are affected. If a diagnosis is confirmed, we run a genetic test to predict the risk of recurrence. Only a few refuse to take part in the programme.’ (Interview 23, paediatric doctor)
 
Midwives played an important role in obtaining parental consent in antenatal clinics. Refusals of the current NBSIEM are infrequent. Five frontline staff (16.7%) involved in the recruitment process observed that only a small number of parents, who were sceptical about medical interventions or had religious affiliation–based reservations, declined to join the programme.
 
‘Some people who advocate minimal medicalisation do not consent to procedures in our hospital. For example, they refuse vaccinations and vitamin K injections.’ (Interview 10, obstetrics and gynaecology nurse)
 
Participants involved in recruitment recognised that informed consent procedures were intended to enable parents to make informed choices. The current opt-in consent approach allows HCPs to obtain explicit permission from parents. The participants observed that parents, especially Hong Kong Chinese individuals, were vocal about the patient’s right to know.
 
‘Nowadays, patients put a strong emphasis on patient’s rights, thinking that “you need my consent before carrying out a procedure".’ (Interview 5, obstetrics and gynaecology nurse)
 
In particular, six HCPs (20.0%) speculated that opt-in consent was more accepted by parents and thus easier to obtain. It provided parents with a sense of personal control by allowing them to give explicit permission. Opt-in consent has been used in many medical settings. It is more familiar to and accepted by community members with respect to studies of genetic material.
 
‘Must I choose a consent model for handling genetic materials? It would be an opt-in approach.’ (Interview 6, obstetrics and gynaecology doctor)
 
Seven HCPs (23%) felt that consent was needed because of the invasiveness of the procedure, but some HCPs felt that the procedure involved minimal harm.
 
‘(The phlebotomists) perform an invasive procedure on the infant, which may cause discomfort or pain. Opt-in is preferable to an opt-out approach. (Interview 27, paediatric doctor)
 
‘It is just a heel prick test and will not affect the baby. I cannot see the downsides (of the screening).’ (Interview 16, obstetrics and gynaecology academic)
 
Opt-in consent was perceived to be more efficient. There may be opposition to an opt-out approach. Some HCPs (n=6, 20.0%) felt that an opt-out approach would increase sample sizes and contribute to advances in medical research.
 
‘Inborn errors of metabolism would be a prevalent issue, and therefore, opt-out is better than the opt-in approach. Like an human immunodeficiency virus test with an opt-out approach, one can refuse to take the test for a valid reason. There are treatments for IEMs; opt-out is a desirable consent model.’ (Interview 24, obstetrics and gynaecology doctor)
 
One doctor pondered the adoption of different methods in obtaining informed consent because opt-in and opt-out approaches are ‘like two sides of the same coin’. He stated that the key aspect of selecting an appropriate consent approach is the parental counselling process. He also emphasised that the consent procedure is not absent from the opt-out approach and that effective communication remains important.
 
Disclosure of confirmed results of inborn errors of metabolism to affected families
When a confirmed IEM diagnosis was disclosed to an affected family, parents often felt shocked, stressed, and guilty about having an ‘abnormal’ baby. They then began to explore the financial implications; for example, some worried about treatment costs and uncertainties. Counselling is limited to discussing the diagnosis; it also includes psychological support through follow-up care involving a multidisciplinary team.
 
‘Many families were worried when they heard about the IEM diagnosis, as they knew it was a life-long condition. It is tough to handle (bad news). Their child will be different from other peers, and finances will be affected. They must self-finance the drugs.’ (Interview 28, paediatric nurse)
 
On some occasions, the NBSIEM is beneficial to the newborn and has implications for the entire family. Some interviewees reported disclosing an IEM result relevant to the mother, rather than the infant. In one case, the HCP informed the involved family members and provided follow-up care for the newborn’s siblings.
 
‘Sometimes, a secondary finding is related to the mother instead of the child. When maternal blood contamination is present (the baby is not affected), the mother is referred to relevant specialists for medical follow-up.’ (Interview 23, paediatric doctor)
 
‘We had a positive result for citrullinemia deficiency. The newborn had three brothers and a sister with the same disease. Now we are following them.’ (Interview 27, paediatric doctor)
 
Theme 2: acceptance of retention and secondary use of residual dried bloodspots
Motivations for storage
Interviewees were asked about their views of rDBS storage. Healthcare professionals supported the long-term storage of rDBS through the NBSIEM to facilitate advances in public health epidemiology, forensic purposes, familial disease analysis, and development of other screening tests. Some HCPs highlighted the importance of rDBS in supporting scientific advances. Stored rDBS could be used to enhance healthcare management and clinical testing, such as establishment of local reference standards.
 
‘We did not know how to define the cut-off values at first. Within the United States, the cut-off values differ by state. The initial cut-off values we chose may not reflect local needs. Residual dried bloodspots (storage) is essential to develop a large data pool that supports a control pool when technology advances.’ (Interview 19, pathology doctor)
 
Healthcare professionals observed that most parents demonstrated substantial interest in knowledge about their children. They thought that parents would like to have the right to obtain medical information regarding their children.
 
‘I believe that most parents would agree to save (their) genetic material...perhaps... they would not mind if the laboratory preserved the DNA material and let them know the findings of future screening tests.’ (Interview 17, paediatric doctor)
 
Barriers to storage
Uncertain value of retention
Four HCPs (13.3%) were worried that the public lacked an understanding of how rDBS could generate knowledge. This lack of awareness may be linked to an unwillingness among parents to permit the use of their children’s rDBS samples.
 
‘Many laypeople may not understand why they should engage in research studies. They may be reluctant to take part in research studies due to their own beliefs.’ (Interview 3, obstetrics and gynaecology nurse)
 
Two-fifths of HCPs (n=12) questioned the need for the long-term storage programme.
 
‘Several ongoing studies on population genetics use a wide-consent approach, supported by government funding. Does every newborn have to provide data (to support this research)? I doubt it.’ (Interview 18, paediatric doctor)
 
‘With strong opposition, I dissent to the storage (of rDBS) as I see no value at all. Perhaps it offers convenience for research, but it provides no personal benefits.’ (Interview 30, pathology doctor)
 
Interviewees believed that genetic material is very stable and does not easily degrade, despite long storage periods. Although storage is possible, some concerns were raised about its cost-effectiveness.
 
‘I heard researchers (scientists) mention that proper sample storage incurs a considerable cost.’ (Interview 13, obstetrics and gynaecology doctor)
 
No direct benefit to patients or parents
Around one-fourth of HCPs (n=7, 23.3%) would only support clinical research if the findings could be used to help their children and patients. Parents were not expected to be interested in research, especially if it did not provide direct clinical benefit to their children.
 
‘Parents care about whether the disease can be treated or not. Knowledge of disease aetiology is only relevant to public health or research institutes.’ (Interview 2, obstetrics and gynaecology nurse)
 
‘Would I receive the data if I donated a sample? I would donate a sample if the researcher would return the data. I must know every single conclusion or diagnosis from data generated using the rDBS. I would refuse if no data were returned.’ (Interview 13, obstetrics and gynaecology doctor)
 
Trust and privacy concerns regarding responsible authorities
Another recurring theme was trust in the context of primary privacy concerns, such as data leakage and misuse of private information generated from sensitive genetic materials.
 
‘It may not be desirable to store (genetic materials) for a long time. The longer it is stored, the more concerns arise. Immediate disposal would be more reassuring in terms of the protection of privacy.’ (Interview 12, paediatric nurse)
 
‘Some people may steal genetic information (rDBS cards) for illegal (or unauthorised) purposes.’ (Interview 12, paediatric nurse)
 
Obligation to return research findings
Generally, around 30% of the interviewed doctors and researchers (n=9) believed they have a duty to warn research participants upon finding abnormalities, enabling parents to take appropriate action after receiving relevant test results, including secondary findings.
 
There is an obligation to inform the patients (of medically actionable findings) because we work in this profession. First, we do no harm. If a significant finding warrants medical attention, we should be responsive and responsible.’ (Interview 24, obstetrics and gynaecology doctor)
 
Issues with obtaining consent for storage purposes
The importance of consent was acknowledged, but there was disagreement concerning the need for broad or specific consent. Interviewees frequently noted that broad consent is convenient for researchers.
 
‘Our understanding of IEMs or diagnostic tests increases as time goes by. The advantage of broad consent is that we do not need to obtain consent when new technology evolves. Like the recently added SCID, we do not need to redesign or implement a new consent procedure again when adding new conditions to the panel.’ (Interview 9, paediatric doctor)
 
Despite the view that broad consent may permit more efficient use of biospecimens and relevant data, there were concerns about public acceptance. Some interviewees stated that it would be challenging to obtain consent for all future research and explain the need for a change in consent approach.
 
‘Broad consent implies uncertainty in the research scope, which leads to parental concern. Parents are uncertain how the rDBS will be used or handled. I feel uneasy during counselling. I am not sure how their blood will be used in a research project, but in short, it will be helpful.’ (Interview 7, paediatric doctor)
 
‘Broad consent entails an unknown. As such, parents might be unwilling to sign the consent form (and contribute the rDBS).’ (Interview 27, paediatric doctor)
 
For HCPs, the legitimacy and scope of consent are key considerations. Specific consent is commonly exercised in clinical or research procedures in Hong Kong. It is recognised as the most appropriate procedure because it ensures patients receive information about the study. A few interviewees mentioned that no existing framework recommends the use of broad consent; thus, they favoured the use of specific consent.
 
‘Specific consent may not provide sufficient coverage of all possible research. If there is a breach in the protocol, it may bring about ethical and legal issues.’ (Interview 26, obstetrics and gynaecology doctor)
 
‘I found that specific consents were more protective for HCPs.’ (Interview 15, obstetrics and gynaecology nurse)
 
‘I have never sought ethical approval for broad consent from institutional research boards.’ (Interview 28, paediatric nurse)
 
The level of public knowledge regarding the NBSIEM requires further analysis. Education and counselling might be intended to address problems that arise from long-term storage. One doctor emphasised that proper counselling on tests involving genetic material should be considered best practice. Some interviewees expressed a desire to prepare themselves to address parents’ concerns.
 
‘The drawbacks of the NBSIEM should be discussed, apart from privacy and personal genetic information. Parents should be aware that there are many unknowns in genetics. (As medical professionals) we have, of course, fewer concerns. Suppose I have to conduct genetic counselling for an IEM test. In that case, I will cover all the aspects, including the basic understanding of genetics, even if it is a selected target gene panel. I do not see much difference in terms of counselling across all forms of genetic tests.’ (Interview 9, paediatric doctor)
 
Discussion
This study explored the voices of HCPs from various backgrounds and discussed clinical and ethical issues during the early implementation phase of the NBSIEM. Similar to professionals in the United Kingdom,13 HCPs in Hong Kong did not exhibit extensive knowledge and awareness of IEM conditions, which may have detrimental effects on patient-centred care. First, parental autonomy might be undermined because parents are not adequately informed about the test procedure and conditions. Second, a lack of understanding regarding IEMs can lead to suboptimal clinical care. Children with IEMs attend multiple specialist clinics to manage multiple co-morbidities. Caregivers encounter difficulties, such as miscommunication or inconsistent information about medications or dietary restrictions, when attending non–IEM-specific clinics.14 They face numerous psychosocial challenges in caring for their children,15 17 and increased awareness of these stressors among healthcare providers could improve communication for the entire family. More than four-fifths of individuals in Hong Kong attend medical services at public hospitals,18 and many parents are expected to participate in the NBSIEM. The establishment of training or educational interventions and a centralised pipeline to coordinate care are essential considerations for patient-centred care that focuses on caregivers of children with IEMs. Because hospitals are expanding screening for other uncommon disorders, such as SCID,19 the results of the present study may inform the development of a family-oriented framework for IEM management.
 
Development of the current NBSIEM was based on a stringent infrastructure and second-tier testing pipeline.20 Samples with borderline or ambiguous results were sent for further genetic tests to confirm the diagnosis and carrier status. Carnitine deficiency, citrin deficiency, methylmalonic acidaemia, and glutaric aciduria type I are examples of diseases with relatively high incidences of false-positives or false-negatives.1 21 After the implementation of stringent second-tier tests, the recall rate has declined to 0.3% to 0.4%, similar to the standards of international IEM programmes.22 This work has been successful, and the retention of rDBS to create a large-scale genetic biobank will be the next focus of public health dialogue.
 
It is important to note that territory-wide biobanks are not common; biobank platforms in Hong Kong currently are operated by individual hospitals or institutions. A notable example is Children of 1997, a population-based birth cohort study of local infants.23 Other existing platforms include disease-oriented biobanks,24 which support quality assurance and conduct epidemiology studies; they also identify risk factors, novel molecular markers, and genetic variants associated with diabetes and related complications. The establishment of biobanks at separate institutions has led to non-standardised informed consent practices. Many ethical and legal issues remain unresolved in efforts to harmonise all regional biobanks. Public awareness of the value of rDBS has been low11; improvements in public acceptance and engagement are needed for broad support of rDBS storage or biobanks.
 
The present study highlighted common ethical, legal, and social concerns as barriers to the storage of rDBS. Trust and low awareness of the potential value of rDBS were cited as primary barriers. In contrast to the assumptions of HCPs, parents generally agree with academic researchers and doctors accessing their children’s rDBS and health data after explicit consent has been provided.11 The optimal consent model for the use of rDBS outside of screening purposes depends on cultural and social characteristics that vary among regions. In the past three decades, some countries have stored rDBS without consent, leading to public controversy and lawsuits.25 Considering these situations, the retention of rDBS requires inherent trust based on public support, with strict clinical and ethical parameters. Essential factors in establishing trust are consent to participate in the NBSIEM, as well as consent for rDBS retention and secondary uses; questions remain regarding the optimal approach to obtaining consent.25 Other factors involved in decision-making concerning rDBS retention and secondary uses include timing of consent, adequate communication and discussion of potential uses, protection of privacy, and responsible governance.9 11 26 These factors should be considered in public policy initiatives.
 
Concerns about privacy issues and discrimination related to genomic information will be amplified as next-generation sequencing is integrated into clinical care for newborns.27 28 In South Korea, next-generation DNA sequencing has been evaluated for use in primary newborn screening.29 In the United Kingdom, Genomics England plans to offer whole-genome sequencing to newborns, identifying actionable genetic conditions that may impact infants in early childhood.30 There is evidence that sequencing data provides information about conditions not currently assessed in newborns, as well as information with unclear clinical significance.29 The previous regime agreed that it was appropriate to disclose incidental research findings if they would directly benefit the child after considering risk and benefit. However, this approach may differ in cultural and social settings when considering the child’s future and non-therapeutic genomic information.
 
Limitations and strengths of the study
Most participants in this study were HCPs working in the public sector; they may have different views regarding clinical utility, value, and perceived cost-benefit, compared with stakeholders from other healthcare settings. However, this study used various sampling strategies to recruit a heterogeneous group of HCPs with diverse specialities, roles, and responsibilities in the screening programme, as well as years of experience. A longitudinal study would provide long-term insights concerning the NBSIEM. Knowledge of heterogeneous IEMs and perception of rDBS storage among HCPs could be analysed via quantitative methods.
 
Conclusion
This study highlighted HCPs’ concerns about the NBSIEM and uncertainties regarding the handling or utilisation of rDBS. Policymakers should consider these concerns when establishing new guidelines. Future investigations should explore parents’ experiences with screening for rare metabolic conditions and communication of positive results.
 
Author contributions
Concept or design: OMY Ngan.
Acquisition of data: CJ Tam.
Analysis or interpretation of data: All authors.
Drafting of the manuscript: OMY Ngan, CJ Tam.
Critical revision of the manuscript for important intellectual content: CK Li.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors declared no conflicts of interest.
 
Acknowledgement
The authors thank all healthcare professionals who participated in the study.
 
Funding/support
This research was supported by the Direct Grant for Research from the Faculty of Medicine at The Chinese University of Hong Kong (2020/2021) [Ref No.: 2020.081]. The funder had no role in study design, data collection/analysis/interpretation or manuscript preparation.
 
Ethics approval
The research was approved by the Survey and Behavioural Research Ethics Committee of The Chinese University of Hong Kong (Ref No.: SBRE-20-846). All participants provided written consent for interview and publication of the study.
 
Supplementary material
The supplementary material was provided by the authors and some information may not have been peer reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by the Hong Kong Academy of Medicine and the Hong Kong Medical Association. The Hong Kong Academy of Medicine and the Hong Kong Medical Association disclaim all liability and responsibility arising from any reliance placed on the content.
 
References
1. The Task Force on the Pilot Study of Newborn Screening for Inborn Errors of Metabolism. Evaluation of the 18-month “Pilot Study of Newborn Screening for Inborn Errors of Metabolism” in Hong Kong. Hong Kong J Paediatr (new series) 2020;25:16-22.
2. Belaramani KM, Chan TC, Hau EW, et al. Expanded newborn screening for inborn errors of metabolism in Hong Kong: results and outcome of a 7-year journey. Int J Neonatal Screen 2024;10:23. Crossref
3. Hospital Authority. Newborn Screening Programme for Inborn Errors of Metabolism (IEM). October 2023. Available from: https://www21.ha.org.hk/smartpatient/SPW/MediaLibraries/SPW/SPWMedia/Education-pamphlet_NBS-IEM_Eng-(Oct-2023).pdf?ext=.pdf. Accessed 27 Mar 2024.
4. Ngan OM, Li CK. Ethical issues of dried blood spot storage and its secondary use after newborn screening programme in Hong Kong. Hong Kong J Paediatr (new series) 2020;25:8-15.
5. Lund A, Wibrand F, Skogstrand K, et al. Danish expanded newborn screening is a successful preventive public health programme. Dan Med J 2020;67:A06190341.
6. Nordfalk F, Ekstrøm CT. Newborn dried blood spot samples in Denmark: the hidden figures of secondary use and research participation. Eur J Hum Genet 2019;27:203-10. Crossref
7. Jansen ME, Klein AW, Buitenhuis EC, Rodenburg W, Cornel MC. Expanded neonatal bloodspot screening programmes: an evaluation framework to discuss new conditions with stakeholders. Front Pediatr 2021;9:635353. Crossref
8. Loeber JG, Platis D, Zetterström RH, et al. Neonatal screening in Europe revisited: an ISNS perspective on the current state and developments since 2010. Int J Neonatal Screen 2021;7:15. Crossref
9. Jansen ME, van den Bosch LJ, Hendriks MJ, et al. Parental perspectives on retention and secondary use of neonatal dried bloodspots: a Dutch mixed methods study. BMC Pediatr 2019;19:230. Crossref
10. Mak CM, Lam CW, Law CY, et al. Parental attitudes on expanded newborn screening in Hong Kong. Public Health 2012;126:954-9. Crossref
11. Hui LL, Nelson EA, Deng HB, et al. The view of Hong Kong parents on secondary use of dried blood spots in newborn screening program. BMC Med Ethics 2022;23:105. Crossref
12. Mak CM, Law EC, Lee HH, et al. The first pilot study of expanded newborn screening for inborn errors of metabolism and survey of related knowledge and opinions of health care professionals in Hong Kong. Hong Kong Med J 2018;24:226-37. Crossref
13. Moody L, Atkinson L, Kehal I, Bonham JR. Healthcare professionals’ and parents’ experiences of the confirmatory testing period: a qualitative study of the UK expanded newborn screening pilot. BMC Pediatr 2017;17:121. Crossref
14. Siddiq S, Wilson BJ, Graham ID, et al. Experiences of caregivers of children with inherited metabolic diseases: a qualitative study. Orphanet J Rare Dis 2016;11:168. Crossref
15. Zeltner NA, Landolt MA, Baumgartner MR, et al. Living with intoxication-type inborn errors of metabolism: a qualitative analysis of interviews with paediatric patients and their parents. JIMD Rep 2017;31:1-9. Crossref
16. Petros M. Revisiting the Wilson–Jungner criteria: how can supplemental criteria guide public health in the era of genetic screening? Genet Med 2012;14:129-34. Crossref
17. Grant S, Cross E, Wraith JE, et al. Parental social support, coping strategies, resilience factors, stress, anxiety and depression levels in parents of children with MPS III (Sanfilippo syndrome) or children with intellectual disabilities (ID). J Inherit Metab Dis 2013;36:281-91. Crossref
18. Hospital Authority. Hospital Authority Strategic Plan 2022-2027: Towards Sustainable Healthcare. Available from: https://www.ha.org.hk/haho/ho/ap/HA_StrategicPlan2022-2027_Eng_211216.pdf. Accessed 16 Dec 2021.
19. Leung D, Lee PP, Lau YL. Review of a decade of international experiences in severe combined immunodeficiency newborn screening using T-cell receptor excision circle. Hong Kong J Paediatr (new series) 2020;25:30-41.
20. Tsang KY, Chan TC, Yeung MC, Wong TK, Lau WT, Mak CM. Validation of amplicon-based next generation sequencing panel for second-tier test in newborn screening for inborn errors of metabolism. J Lab Med 2021;45:267-74. Crossref
21. Hennermann JB, Roloff S, Gellermann J, Grüters A, Klein J. False-positive newborn screening mimicking glutaric aciduria type I in infants with renal insufficiency. J Inherit Metab Dis 2009;32 Suppl 1:S355-9. Crossref
22. la Marca G. Mass spectrometry in clinical chemistry: the case of newborn screening. J Pharm Biomed Anal 2014;101:174-82. Crossref
23. Schooling CM, Hui LL, Ho LM, Lam TH, Leung GM. Cohort profile: ‘Children of 1997’: a Hong Kong Chinese birth cohort. Int J Epidemiol 2012;41:611-20. Crossref
24. Ng IH, Cheung KK, Yau TT, Chow E, Ozaki R, Chan JC. Evolution of diabetes care in Hong Kong: from the Hong Kong Diabetes Register to JADE-PEARL Program to RAMP and PEP Program. Endocrinol Metab (Seoul) 2018;33:17-32. Crossref
25. Cunningham S, O’Doherty KC, Sénécal K, Secko D, Avard D. Public concerns regarding the storage and secondary uses of residual newborn bloodspots: an analysis of print media, legal cases, and public engagement activities. J Community Genet 2015;6:117-28. Crossref
26. Hendrix KS, Meslin EM, Carroll AE, Downs SM. Attitudes about the use of newborn dried blood spots for research: a survey of underrepresented parents. Acad Pediatr 2013;13:451-7. Crossref
27. Pereira S, Robinson JO, Gutierrez AM, et al. Perceived benefits, risks, and utility of newborn genomic sequencing in the BabySeq project. Pediatrics 2019;143(Suppl 1):S6-13. Crossref
28. Johnston J, Lantos JD, Goldenberg A, et al. Sequencing newborns: a call for nuanced use of genomic technologies. Hastings Cent Rep 2018;48 Suppl 2:S2-6. Crossref
29. Woerner AC, Gallagher RC, Vockley J, Adhikari AN. The use of whole-genome and exome sequencing for newborn screening: challenges and opportunities for population health. Front Pediatr 2021;9:663752. Crossref
30. Biesecker LG, Green ED, Manolio T, Solomon BD, Curtis D. Should all babies have their genome sequenced at birth? BMJ 2021;375:n2679. Crossref

Stevens–Johnson syndrome and toxic epidermal necrolysis in Hong Kong

Hong Kong Med J 2024 Apr;30(2):102–9 | Epub 26 Mar 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Stevens–Johnson syndrome and toxic epidermal necrolysis in Hong Kong
Christina MT Cheung, MB, ChB, MRCP1; Mimi M Chang, MB, ChB, FRCP1; Joshua JX Li, MB, ChB, FHKCPath2; Agnes WS Chan, MB, ChB, MRCP1
1 Department of Medicine and Therapeutics, Prince of Wales Hospital, The Chinese University of Hong Kong, Hong Kong SAR, China
2 Department of Anatomical and Cellular Pathology, Prince of Wales Hospital, The Chinese University of Hong Kong, Hong Kong SAR, China
 
Corresponding author: Dr Agnes WS Chan (agneswschan@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: Stevens–Johnson syndrome (SJS) and toxic epidermal necrolysis (TEN) [hereafter, SJS/TEN] are uncommon but severe mucocutaneous reactions. Although they have been described in many populations worldwide, data from Hong Kong are limited. Here, we explored the epidemiology, disease characteristics, aetiology, morbidity, and mortality of SJS/TEN in Hong Kong.
 
Methods: This retrospective cohort study included all hospitalised patients who had been diagnosed with SJS/TEN in Prince of Wales Hospital from 1 January 2004 to 31 December 2020.
 
Results: There were 125 cases of SJS/TEN during the 17-year study period. The annual incidence was 5.07 cases per million. The mean age at onset was 51.4 years. The mean maximal body surface area of epidermal detachment was 23%. Overall, patients in 32% of cases required burns unit or intensive care unit admission. Half of the cases involved concomitant sepsis, and 23.2% of cases resulted in multiorgan failure or disseminated intravascular coagulation. The mean length of stay was 23.9 days. The cause of SJS/TEN was attributed to a drug in 91.9% of cases, including 84.2% that involved anticonvulsants, allopurinol, antibiotics, or analgesics. In most cases, patients received treatment comprising either best supportive care alone (35.2%) or combined with intravenous immunoglobulin (43.2%). The in-hospital mortality rate was 21.6%. Major causes of death were multiorgan failure and/or fulminant sepsis (81.5%).
 
Conclusion: This study showed that SJS/TEN are uncommon in Hong Kong but can cause substantial morbidity and mortality. Early recognition, prompt withdrawal of offending agents, and multidisciplinary supportive management are essential for improving clinical outcomes.
 
 
New knowledge added by this study
  • Stevens–Johnson syndrome (SJS) and toxic epidermal necrolysis (TEN) are rare severe cutaneous adverse reactions in Hong Kong, with a combined annual incidence of 5.07 cases per million.
  • Stevens–Johnson syndrome and TEN cause considerable burdens on the Hong Kong healthcare system due to their prolonged length of stay, high demand for intensive care, and substantial mortality.
Implications for clinical practice or policy
  • Clinicians should be aware of the early signs and symptoms of SJS and TEN to enable rapid recognition of the disease and prompt withdrawal of culprit drugs.
  • Dedicated multidisciplinary teams should be established in tertiary centres to optimise patient outcomes.
 
 
Introduction
Stevens–Johnson syndrome (SJS) and toxic epidermal necrolysis (TEN) are uncommon but potentially life-threatening severe mucocutaneous reactions characterised by extensive epidermal necrosis and detachment. Both entities are considered variants of a single disease continuum and are classified according to the percentage of body surface area (BSA) with epidermal detachment.1 2 Although SJS and TEN (hereafter, SJS/TEN) have been described in all ethnicities worldwide,3 studies of these reactions in Hong Kong have been limited.4 5 The incidence, clinical characteristics, aetiology, treatment regimen, morbidity, and mortality in the territory are largely unknown. This pilot study aimed to review cases of SJS/TEN over a 17-year period at a tertiary referral centre in Hong Kong, and to aid future research in Hong Kong focused on severe cutaneous adverse reactions.
 
Methods
This retrospective cohort study included all hospitalised patients who had been diagnosed with SJS/TEN and were treated in Prince of Wales Hospital (PWH), a major regional hospital under the New Territories East Cluster (NTEC), from 1 January 2004 to 31 December 2020.
 
Patient identification
Patients with clinical and histological diagnoses of SJS/TEN were identified from the Hospital Authority database using International Classification of Diseases, Ninth Revision, Clinical Modification diagnosis codes and the database of the Department of Anatomical and Cellular Pathology of PWH, respectively.
 
Inclusion criteria
Diagnoses of SJS/TEN were based on consensus guidelines.1 Patients were diagnosed with SJS, SJS/TEN overlap, and TEN when they exhibited epidermal detachment levels of <10%, 10% to 30%, and >30%, respectively, with consistent histological features (if skin biopsy was performed). Consistent histological features were regarded as partial- to full-thickness epidermal necrosis.
 
Exclusion criteria
Patients were excluded if they had an alternative diagnosis, such as severe cutaneous adverse reactions other than SJS/TEN (eg, drug reaction with eosinophilia and systemic symptoms syndrome/acute generalised exanthematous pustulosis/generalised bullous fixed drug eruption), erythema multiforme major, autoimmune blistering disease, acute graft-versus-host disease, and infections such as staphylococcal scalded skin syndrome.
 
Data collection and statistical analysis
Clinical characteristics were collected from electronic records and, when available, hospital case notes. The following clinical characteristics were recorded and analysed: age at onset, sex, ethnicity, maximal BSA of detached or detachable skin, SCORTEN (Severity-of-Illness Score for Toxic Epidermal Necrolysis) prognostic score,6 mucosa involved, histology results if available, causative drugs, time from exposure to onset, time from onset to admission and treatment, treatment regimen, disease complications, mortality and its cause, and length of stay. Efforts to identify causative drugs were guided by the ALDEN (algorithm of drug causality for epidermal necrolysis) score,7 which was retrospectively calculated by two independent investigators. All clinical data were expressed as percentages or means ± standard deviations unless otherwise specified.
 
This article was written in compliance with the STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) reporting guidelines.
 
Results
Using the International Classification of Diseases, Ninth Revision, Clinical Modification diagnosis codes for SJS/TEN, 164 potential patients with 166 cases of SJS/TEN during the period from January 2004 to December 2020 were initially identified. Six additional patients with SJS/TEN were identified from the database of the Department of Anatomical and Cellular Pathology of PWH. Forty-seven patients were excluded due to alternative diagnoses. In total, 123 patients with 125 cases of SJS/TEN were included in the study (Fig).
 

Figure. Flowchart of patient identification and exclusion
 
Demographic characteristics and disease classification
Among the 123 patients with SJS/TEN, 53 were men and 70 were women; the female-to-male ratio was 1.32:1 (Table 1). The mean age at onset was 51.4 years, and most patients were Chinese. Of the 125 cases, 59 were SJS, 27 were SJS-TEN overlap, and 39 were TEN. A small number of patients (n=18, 14.4%) were admitted for other medical issues but developed SJS/TEN after hospitalisation.
 

Table 1. Demographic characteristics of patients (n=123) and disease classification in each case (n=125)
 
Clinical characteristics and clinical course
The mean time from disease onset to hospitalisation was 4.9 days (Table 2). Fever was present on admission in 88 cases (70.4%). The mean maximal BSA of epidermal detachment was 23%. Mucosal involvement was common; only five cases (4.0%) lacked mucosal involvement. Skin biopsy was performed in 87 cases (69.6%) and the mean SCORTEN prognostic score was 2.17.
 

Table 2. Clinical characteristics and clinical course (n=125)
 
Burns unit or intensive care unit admission was required in 40 cases (32.0%) and half of these cases required invasive mechanical ventilation. In total, 63 cases (50.4%) involved concomitant infection from various sources. Multiorgan failure or disseminated intravascular coagulation occurred in 29 cases (23.2%). The mean length of stay in the hospital was 23.9 days (Table 2).
 
Aetiology
Stevens–Johnson syndrome and TEN onset was attributed to a drug in 114 of 124 cases (91.9%); one patient developed a second case of SJS/TEN upon accidental re-exposure to the same culprit drug (paracetamol). Four cases (3.2%) were caused by infection, and no cause was identified in six cases (4.8%). The identified culprit drugs are shown in Table 3. The mean time from initiation of the culprit drug to onset of SJS/TEN was 20.5 ± 16.7 days (range, 1-87; median, 15.5).
 

Table 3. Culprit drugs for Stevens–Johnson syndrome and toxic epidermal necrolysis identified (n=114)
 
Treatment
All patients received the best supportive medical care available. In some cases, patients received additional treatment. The numbers and proportions of cases treated with different regimens are shown in Table 4. The mean time between disease onset and active treatment initiation was 7.4 ± 6.1 days (range, 1-34; median, 6).
 

Table 4. Treatment regimens (n=125)
 
Intravenous immunoglobulin (IVIG) was administered in 54 cases. The mean total dose of IVIG was 3.2 g/kg (range, 1.5-6; administered over 2-6 days). High-dose IVIG, defined as ≥3 g/kg, was administered in 40 cases. Systemic steroid regimens considerably varied, with daily doses of prednisolone ranging from 20 to 120 mg (or an equivalent dose). The cyclosporine regimen was 3 mg/kg/day, tapered over 20 to 30 days.
 
Mortality
There were 27 deaths in the study cohort, and the overall mortality rate was 21.6%. The mean time from SJS/TEN onset to death was 23.6 ± 19.0 days (range, 5-76). Most patients died from multiorgan failure and/or fulminant sepsis (n=22, 81.5%); other causes of death were acute coronary syndrome (n=2), liver failure (n=1), and sudden cardiac arrest (n=2).
 
The observed mortality rates were 16.9%, 22.2%, and 28.2% for SJS, SJS-TEN overlap, and TEN, respectively. The SCORTEN-based predicted mortality rates were 14.1%, 28.5%, and 36.9% for SJS, SJS-TEN overlap, and TEN, respectively. Inpatient-onset SJS/TEN had a high mortality rate of 77.8%: 14 deaths among 18 patients who developed SJS/TEN after admission.
 
Discussion
Epidemiology
Stevens–Johnson syndrome and TEN are recognised worldwide, with several epidemiological studies conducted in Europe and the US. In the 1990s, Roujeau et al8 reported that the annual incidence of TEN in France was 1.2 cases per million; during the same period, the estimated overall annual incidences of SJS/TEN were 1.89 cases per million in Germany9 and 4.2 cases per million in the US.10 In the past decade, two large epidemiological studies in the US11 and United Kingdom12 revealed that the overall annual incidences of SJS/TEN were 12.7 and 5.76 cases per million, respectively. In contrast, the epidemiology of SJS/TEN in Asia is not well-documented.13 In Singapore, based on a small retrospective hospital-based study of 20 patients with TEN, the estimated annual incidence of TEN was 1.4 cases per million14; in Korea, a large population-based study indicated that the overall annual incidence of SJS/TEN was 4.9 to 6.5 cases per million.15 In the present study, there were 125 cases of SJS/TEN during the 17-year study period. Notably, 13 of these cases were transferred from hospitals outside of the NTEC: one was SJS, three were SJS/TEN overlap, and nine were TEN. The NTEC serves a population of 1.3 million.16 The estimated annual incidence of TEN alone and combined annual incidence of SJS, SJS-TEN overlap, and TEN were 1.36 and 5.07 cases per million, respectively; these incidences are comparable with findings from studies in other countries.
 
Stevens–Johnson syndrome is approximately threefold more common than TEN.15 17 However, in the current study, fewer than half of the cases (47.2%) were SJS, whereas 31.2% were TEN. This may be related to referral bias, whereby more severe cases were transferred to the study hospitals, whereas ‘milder’ cases were managed in regional hospitals where the patients were initially admitted. Prince of Wales Hospital is a tertiary referral centre and one of the few hospitals in Hong Kong with both on-site dermatologists and burns unit support. In our cohort, 31 cases (24.8%) were transferred from peripheral hospitals: 18 (14.4%) arrived from hospitals within the NTEC, whereas 13 (10.4%) arrived from hospitals outside of the NTEC.
 
Aetiology
Stevens–Johnson syndrome and TEN are most often drug-induced, and a culprit drug is identified in approximately 85% of cases.7 18 The reactions usually occur between 7 days and 8 weeks after drug ingestion.19 However, upon rechallenge with the culprit drug, SJS/TEN can develop within hours.17 19 Efforts to identify causative drugs were guided by the ALDEN score.7 In cases of SJS/TEN, the most commonly implicated high-risk medications are anticonvulsants, allopurinol, antimicrobials, and oxicam non-steroidal anti-inflammatory drugs.19 20 In the present study, SJS/TEN onset was attributed to a drug in 114 of 124 cases (91.9%). The mean time between drug initiation and SJS/TEN onset was 20.5 days. Among these 114 cases, 81.6% were caused by the high-risk medications listed above. These findings are comparable with previous reports.
 
Mortality
Stevens–Johnson syndrome and TEN are associated with high mortality rates, with 1% to 5% in cases of SJS and 25% to 30% in cases of TEN. Survival analyses in multinational European studies (EuroSCAR [European Study of Severe Cutaneous Adverse Reactions] and RegiSCAR [Registry of Severe Cutaneous Adverse Reactions]) have indicated that the overall mortality rate in cases of SJS/TEN is approximately 22% to 23%.18 20 21 22 23 In Asia, reported overall mortality rates vary from 12.3% to 25%.24 25 26 27 Sepsis leading to multiorgan failure is the most common cause of death.21 Despite the substantial mortality, there currently is no therapeutic regimen with a clear benefit for patients with SJS/TEN.18 21 Considering the rarity of these diseases, it is difficult to conduct randomised trials. Early recognition, rapid withdrawal of offending agents, and best supportive care remain the primary components of clinical management.
 
In the current study, the overall mortality rate was 21.6%; in 81.5% of these cases, the patient died of fulminant sepsis or multiorgan failure. These findings are consistent with existing literature. However, the mortality rate in cases of SJS was much higher in the present study than in previous studies. In the 59 cases of SJS, there were 10 deaths; the mortality rate was 16.9%. Among the 10 patients who died, six experienced complete skin re-epithelisation before death from other medical conditions, which include massive duodenal ulcer bleeding, acute coronary syndrome, metastatic lung cancer, acute liver and renal failure due to herbs, aspiration pneumonia, and sudden cardiac arrest. The remaining four patients had inpatient-onset SJS; they were initially admitted for traumatic intracranial haemorrhage, post-hepatectomy liver failure, convulsions caused by metastatic lung cancer, and post-stroke seizure, respectively. These patients exhibited skin-specific improvements but soon died of aspiration pneumonia and acute renal failure, liver failure, metastatic lung cancer with respiratory failure and liver failure, and sudden cardiac arrest, respectively. The high mortality rate among patients with SJS in the present study could be related to referral bias (as noted in the Epidemiology subsection above); specifically, more severe cases of SJS with co-morbidities and/or complications may have been transferred to our tertiary centre for medical care, whereas less severe cases of SJS might have been managed in regional hospitals where the patients were initially admitted. Indeed, the predicted mortality rate (based on the SCORTEN prognostic score) among cases of SJS in our cohort was 14%; this rate was similar to the observed mortality rate.
 
In the present study, inpatient-onset SJS/TEN had a high mortality rate (77.8%). Although high mortality of inpatient-onset SJS/TEN was not previously described in the literature, we speculate that this high mortality was related to the underlying medical conditions for which patients were initially admitted. The clinical characteristics of the 14 patients who died are presented in Table 5.
 

Table 5. Subgroup analysis of patients with inpatient-onset Stevens–Johnson syndrome and toxic epidermal necrolysis
 
In addition to high mortality, SJS/TEN were associated with high rates of burns unit/intensive care unit admission (32%) and prolonged length of stay (mean=23.9 days) [Table 2], placing a considerable burden on the public healthcare system.
 
Limitations and strengths
As a retrospective cohort study, the present study had some intrinsic limitations. Some hospital case notes (ie, from earlier in the study period) were no longer retrievable. Clinical characteristics such as the exact date of disease onset, precise total BSA involved, and detailed drug history (including over-the-counter medications/medications prescribed by private doctors) might not have been available for some of these older cases. Skin biopsies were performed in 70% of cases and might have been omitted in cases of terminal illness. Many patients with milder cases were lost to follow-up after discharge; thus, long-term sequelae were not well-documented.
 
Additionally, referral bias may have been present because PWH is a tertiary referral centre. Such bias could have led to underestimation of the true incidence of SJS and overestimation of the incidence of TEN; milder cases of SJS might have been managed in regional hospitals, whereas more severe cases of TEN were transferred to our centre for better care. Similarly, there may have been overestimation of various outcome measures including length of stay, complications, and mortality.
 
Nonetheless, this study had several strengths. To our knowledge, this is one of the largest single-centre studies regarding SJS/TEN in Asia; it included a homogenous group of predominantly Chinese patients. The patients were managed by the same dermatology team with a consistent diagnostic and therapeutic approach throughout the study period. Data collection was adequate, and exhaustive evaluation of drug history was feasible for cases with access to both electronic records and hospital case notes. To ensure accurate identification of causative drugs, the ALDEN score was retrospectively evaluated by two independent dermatology doctors during the study.
 
Conclusion
This is the first large study in Hong Kong to provide data regarding the epidemiology, disease characteristics and clinical course, aetiology, treatment regimen, and mortality of SJS/TEN. Although uncommon, SJS/TEN is associated with substantial morbidity and mortality. Therefore, in addition to increasing awareness of SJS/TEN among patients and clinicians, efforts should be made to optimise inpatient care among public hospitals in Hong Kong by establishing dedicated multidisciplinary teams that are experienced in the management of SJS/TEN.
 
Author contributions
Concept or design: CMT Cheung, MM Chang.
Acquisition of data: All authors.
Analysis or interpretation of data: All authors.
Drafting of the manuscript: CMT Cheung, AWS Chan.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This research was approved by the Joint Chinese University of Hong Kong–New Territories East Cluster Clinical Research Ethics Committee (Ref No.: 2017.424). The requirement for informed consent was waived by the Committee due to the retrospective nature of the research.
 
References
1. Bastuji-Garin S, Rzany B, Stern RS, Shear NH, Naldi L, Roujeau JC. Clinical classification of cases of toxic epidermal necrolysis, Stevens–Johnson syndrome, and erythema multiforme. Arch Dermatol 1993;129:92-6. Crossref
2. Roujeau JC. Stevens–Johnson syndrome and toxic epidermal necrolysis are severity variants of the same disease which differs from erythema multiforme. J Dermatol 1997;24:726-9. Crossref
3. Roujeau JC, Chosidow O, Saiag P, Guillaume JC. Toxic epidermal necrolysis (Lyell syndrome). J Am Acad Dermatol 1990;23:1039-58. Crossref
4. Ying S, Ho W, Chan HH. Toxic epidermal necrolysis: 10 years experience of a burns centre in Hong Kong. Burns 2001;27:372-5. Crossref
5. Yeung CK. Intravenous immunoglobulin treatment for Stevens–Johnson syndrome and toxic epidermal necrolysis [dissertation]. Queen Mary Hospital, The University of Hong Kong; 2004.
6. Bastuji-Garin S, Fouchard N, Bertocchi M, Roujeau JC, Revuz J, Wolkenstein P. SCORTEN: a severity-of-illness score for toxic epidermal necrolysis. J Invest Dermatol 2000;115:149-53. Crossref
7. Sassolas B, Haddad C, Mockenhaupt M, et al. ALDEN, an algorithm for assessment of drug causality in Stevens–Johnson syndrome and toxic epidermal necrolysis: comparison with case-control analysis. Clin Pharmacol Ther 2010;88:60-8. Crossref
8. Roujeau JC, Guillaume JC, Fabre JP, Penso D, Fléchet ML, Girre JP. Toxic epidermal necrolysis (Lyell syndrome). Incidence and drug etiology in France, 1981-1985. Arch Dermatol 1990;126:37-42. Crossref
9. Rzany B, Mockenhaupt M, Baur S, et al. Epidemiology of erythema exsudativum multiforme majus, Stevens–Johnson syndrome, and toxic epidermal necrolysis in Germany (1990-1992): structure and results of a population-based registry. J Clin Epidemiol 1996;49:769-73. Crossref
10. Chan HL, Stern RS, Arndt KA, et al. The incidence of erythema multiforme, Stevens–Johnson syndrome, and toxic epidermal necrolysis. A population-based study with particular reference to reactions caused by drugs among outpatients. Arch Dermatol 1990;126:43-7. Crossref
11. Hsu DY, Brieva J, Silverberg NB, Silverberg JI. Morbidity and mortality of Stevens–Johnson syndrome and toxic epidermal necrolysis in United States adults. J Invest Dermatol 2016;136:138797. Crossref
12. Frey N, Jossi J, Bodmer M, et al. The epidemiology of Stevens–Johnson syndrome and toxic epidermal necrolysis in the UK. J Invest Dermatol 2017;137:1240-7. Crossref
13. Lee HY, Martanto W, Thirumoorthy T. Epidemiology of Stevens–Johnson syndrome and toxic epidermal necrolysis in Southeast Asia. Dermatologica Sinica 2013;31:217-20. Crossref
14. Chan HL. Toxic epidermal necrolysis in Singapore, 1989 through 1993: incidence and antecedent drug exposure. Arch Dermatol 1995;131:1212-3. Crossref
15. Yang MS, Lee JY, Kim J, et al. Incidence of Stevens–Johnson syndrome and toxic epidermal necrolysis: a nationwide population-based study using national health insurance database in Korea. PLoS One 2016;11:e0165933. Crossref
16. Hospital Authority, Hong Kong SAR Government. New Territories East Cluster biennial report 2018-2020. Available from: https://www3.ha.org.hk/ntec/clusterreport/clusterreport2018-20/index.html. Accessed 8 Feb 2024.
17. Schwartz RA, McDonough PH, Lee BW. Toxic epidermal necrolysis: Part I. Introduction, history, classification, clinical features, systemic manifestations, etiology, and immunopathogenesis. J Am Acad Dermatol 2013;69:173. e1-13; quiz 185-6. Crossref
18. Creamer D, Walsh SA, Dziewulski P, et al. UK guidelines for the management of Stevens–Johnson syndrome/toxic epidermal necrolysis in adults 2016. J Plast Reconstr Aesthet Surg 2016;69:e119-53. Crossref
19. Roujeau JC, Kelly JP, Naldi L, et al. Medication use and the risk of Stevens–Johnson syndrome or toxic epidermal necrolysis. N Engl J Med 1995;333:1600-7. Crossref
20. Mockenhaupt M, Viboud C, Dunant A, et al. Stevens–Johnson syndrome and toxic epidermal necrolysis: assessment of medication risks with emphasis on recently marketed drugs. The EuroSCAR study. J Invest Dermatol 2008;128:35-44. Crossref
21. Schwartz RA, McDonough PH, Lee BW. Toxic epidermal necrolysis: Part II. Prognosis, sequelae, diagnosis, differential diagnosis, prevention, and treatment. J Am Acad Dermatol 2013;69:187.e1-16; quiz 203-4. Crossref
22. Roujeau JC, Stern RS. Severe adverse cutaneous reactions to drugs. N Engl J Med 1994;331:1272-85. Crossref
23. Sekula P, Dunant A, Mockenhaupt M, et al. Comprehensive survival analysis of a cohort of patients with Stevens–Johnson syndrome and toxic epidermal necrolysis. J Invest Dermatol 2013;133:1197-204. Crossref
24. Barvaliya M, Sanmukhani J, Patel T, Paliwal N, Shah H, Tripathi C. Drug-induced Stevens–Johnson syndrome (SJS), toxic epidermal necrolysis (TEN), and SJS-TEN overlap: a multicentric retrospective study. J Postgrad Med 2011;57:115-9. Crossref
25. Roongpisuthipong W, Prompongsa S, Klangjareonchai T. Retrospective analysis of corticosteroid treatment in Stevens–Johnson syndrome and/or toxic epidermal necrolysis over a period of 10 years in Vajira Hospital, Navamindradhiraj University, Bangkok. Dermatol Res Pract 2014;2014:237821. Crossref
26. Suwarsa O, Yuwita W, Dharmadji HP, Sutedja E. Stevens–Johnson syndrome and toxic epidermal necrolysis in Dr Hasan Sadikin General Hospital Bandung, Indonesia from 2009-2013. Asia Pac Allergy 2016;6:43-7. Crossref
27. Lee HY, Fook-Chong S, Koh HY, Thirumoorthy T, Pang SM. Cyclosporine treatment for Stevens–Johnson syndrome/toxic epidermal necrolysis: retrospective analysis of a cohort treated in a specialized referral center. J Am Acad Dermatol 2017;76:106-13. Crossref

Impact of a novel pre-hospital stroke notification programme on acute stroke care key performance indicators in Hong Kong: a multicentre prospective cohort study with historical controls

Hong Kong Med J 2024 Apr;30(2):94-101 | Epub 5 Apr 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE  CME
Impact of a novel pre-hospital stroke notification programme on acute stroke care key performance indicators in Hong Kong: a multicentre prospective cohort study with historical controls
KY Cheng, FHKCEM, FHKAM (Emergency Medicine)1; Ellen LM Yu, BSc, MSc (Epi/Biostat)2; Tafu Yamamoto, MB, ChB1; Julie CL Kwong, BHS, MBA3; YK Ho, MB, BS, FHKAM (Emergency Medicine)4; HK Ngan, MB, BS, FHKAM (Emergency Medicine)1; WH Lin, MB, BS1; Jessica MT Lau, FHKCEM, FHKAM (Emergency Medicine)5; CH Cheung, MB, ChB, MRCP (UK)6; Gordon PC Lee, FHKCEM, FHKAM (Emergency Medicine)4; LH Siu, FHKAM (Medicine), FHKCP3; Bun Sheng, MSc, MB, ChB6; Winnie WY Wong, FHKAM (Medicine), FRCP3; WY Man, BNurs, MSc6; Cathy CC Cheung, BNurs, MSc5; CT Tse, MB, BS, FHKAM (Medicine)6
1 Department of Accident and Emergency, Yan Chai Hospital, Hong Kong SAR, China
2 Clinical Research Centre, Kowloon West Cluster, Hospital Authority, Hong Kong SAR, China
3 Division of Neurology, Department of Medicine and Geriatrics, Caritas Medical Centre, Hong Kong SAR, China
4 Department of Accident and Emergency, Caritas Medical Centre, Hong Kong SAR, China
5 Department of Accident and Emergency, North Lantau Hospital, Hong Kong SAR, China
6 Division of Neurology, Department of Medicine and Geriatrics, Princess Margaret Hospital, Hong Kong SAR, China
 
Corresponding author: Dr KY Cheng (pkycheng31@fellow.hkam.hk)
 
 Full paper in PDF
 
Abstract
Introduction: Early identification and initiation of reperfusion therapy is essential for suspected acute ischaemic stroke. A pre-hospital stroke notification (PSN) protocol using FASE (facial drooping, arm weakness, speech difficulties, and eye palsy) was implemented to improve key performance indicators (KPIs) in acute stroke care delivery. We assessed KPIs and clinical outcomes before and after PSN implementation in Hong Kong.
 
Methods: This prospective cohort study with historical controls was conducted in the Accident and Emergency Departments of four public hospitals in Hong Kong. Patients were screened using the PSN protocol between August 2021 and February 2022. Suspected stroke patients between August 2020 and February 2021 were included as historical controls. Door-to-needle (DTN) and door-to–computed tomography (DTC) times before and after PSN implementation were compared. Clinical outcomes including National Institutes of Health Stroke Scale score at 24 hours and modified Rankin Scale score at 3 months after intravenous recombinant tissue-type plasminogen activator (IV-rtPA) were also assessed.
 
Results: Among the 715 patients (266 PSN and 449 non-PSN) included, 50.8% of PSN patients and 37.7% of non-PSN patients had a DTC time within 25 minutes (P<0.001). For the 58 PSN and 134 non-PSN patients given IV-rtPA, median DTN times were 67 and 75.5 minutes, respectively (P=0.007). The percentage of patients with a DTN time within 60 minutes was higher in the PSN group than in the non-PSN group (37.9% vs 21.6%; P=0.019). No statistically significant differences in clinical outcomes were observed.
 
Conclusion: Although the PSN protocol shortened DTC and DTN times, clinical outcomes did not significantly differ.
 
 
New knowledge added by this study
  • This study validates findings from a previous study that pre-hospital stroke notification (PSN) improves key performance indicators among stroke patients in Hong Kong.
  • It is unclear whether PSN improves overall clinical outcomes among stroke patients.
Implications for clinical practice or policy
  • Further research is warranted to assess whether PSN improves patient outcomes and other acute care parameters.
  • Considering the resource-intensive nature of PSN, its cost-effectiveness requires additional investigation.
 
 
Introduction
In Hong Kong, approximately 3000 stroke-related deaths occur annually; stroke is among the top three reasons for hospital admission.1 Strokes lead to prolonged hospital stays, and affected patients are likely to require long-term residential care.2 Early accurate identification of acute ischaemic stroke and initiation of reperfusion therapy have been associated with significant improvements in functional outcomes and a lower likelihood of hospital mortality.3 4 Therefore, efforts to shorten any steps within the stroke onset-to-treatment cascade can enhance outcomes for these patients.
 
The 2019 update to the American Stroke Association (ASA) 2018 guidelines for the management of acute ischaemic stroke recommends early stroke recognition and notification during initial medical contact using validated screening tools in suspected stroke patients.5 Pre-hospital notification to the receiving hospital allows early resource mobilisation prior to arrival of the suspected stroke patient, ensuring timely management. In Hong Kong, a recent study demonstrated improvements in several major benchmarks for acute stroke care.6 In August 2021, a pre-hospital stroke notification (PSN) protocol using the FASE protocol (facial drooping, arm weakness, speech difficulties, and eye palsy) was implemented across the Kowloon West Cluster, the largest service cluster in Hong Kong, which serves nearly 2 million residents.7 The inclusion of eye palsy in FASE aims to detect often-missed cases of posterior stroke8 9 and aid the identification of large vessel occlusion (LVO).10 In this study, we aimed to assess key performance indicators (KPIs) and clinical outcomes before and after the implementation of PSN.
 
Methods
Study design
This multicentre prospective cohort study with historical controls involved four Accident and Emergency Departments (AEDs) in the Kowloon West Cluster, namely, Princess Margaret Hospital, North Lantau Hospital, Caritas Medical Centre and Yan Chai Hospital, and their respective neurology divisions. Prior to implementation of the PSN FASE protocol, there was no established emergency medical services (EMS) ambulance protocol for pre-hospital notification of suspected stroke patients. The non-PSN FAST protocol (facial drooping, arm weakness, speech difficulty, and time) was used at the AED to screen suspected stroke patients. In this study, all suspected stroke patients between August 2021 and February 2022 were screened using the PSN FASE protocol and included in the PSN group; similar patients between August 2020 and February 2021 served as historical controls in the non-PSN group. Data were collected from each hospital’s neurology division and clinical data system; accuracy was confirmed by two independent authors. Suspected LVO was defined as the presence of clinical signs and symptoms compatible with internal carotid artery, middle cerebral artery, or basilar artery infarcts, along with radiological evidence from computed tomography (CT) brain scans, as reviewed by a neurologist. Confirmed LVO was defined as the presence of LVO on computed tomography angiography (CTA).
 
Patients
The PSN FASE protocol was implemented during initial contact by EMS personnel during ambulance transfer. This protocol specifies that the patient must be aged ≥18 years and exhibits acute stroke symptoms of facial weakness, unilateral arm and/or leg weakness, speech disturbance, or eye palsy within 4 hours. Protocol exclusion criteria included symptoms with suspected trauma aetiology, Glasgow Coma Scale score ≤8, systolic blood pressure <100 mm Hg, previous medical history of seizure/epilepsy, or long-term chairbound or bedbound status. If a patient meets inclusion criteria with no exclusion criteria, EMS personnel activate the PSN protocol by informing the closest AED to prepare for the incoming stroke patient. In the present study, patients transported with this protocol constituted suspected stroke patients in the PSN group.
 
In contrast, the non-PSN FAST protocol is activated by a physician in the AED. This protocol requires the patient to display acute stroke symptoms of facial asymmetry, limb weakness, or speech disturbance, while meeting all of the following criteria: (1) age ≥18 years; (2) onset of stroke symptoms within 3.5 hours before the request for intravenous recombinant tissue-type plasminogen activator (IV-rtPA) administration; (3) signs and symptoms compatible with acute stroke; and (4) reasonable premorbid functional status (at least not bedbound). Protocol exclusion criteria included active internal bleeding, recent severe head trauma or intracranial/spinal surgery within the preceding 3 months, clinical presentation suggestive of subarachnoid haemorrhage or aortic dissection, acute stroke symptoms in the context of infective endocarditis, intra-axial intracranial neoplasm, coagulopathy (platelet count <100 × 109/L or international normalised ratio >1.7), or ongoing use of anticoagulant medication.
 
FASE protocol of pre-hospital stroke notification
In the PSN FASE protocol, EMS personnel are trained to screen potentially IV-rtPA–eligible stroke patients and to notify the receiving AED about patients with thrombolytic eligibility. An AED physician and a nursing team are prepared for immediate assessment upon patient arrival; an experienced on-duty stroke nurse is notified prior to arrival. The AED physician immediately determines whether the patient should be considered for thrombolytic therapy. If the thrombolytic therapy criteria are met, the patient undergoes a plain CT brain scan and assessment by an on-call neurologist for intravenous thrombolytic therapy. If IV-rtPA treatment is approved by the on-call neurologist, IV-rtPA is administered to the patient; this administration was similar for both historical and prospective groups.
 
Outcomes measurement
The primary outcome in this study was door-to-needle (DTN) time, which the ASA recommends to be within 60 minutes. The secondary outcomes were onset-to-door (OTD) and door-to-CT (DTC) times. The recommended DTC time is within 25 minutes, but no specific recommendation exists for OTD time.11 The National Institutes of Health Stroke Scale (NIHSS) score at 24 hours post-rtPA and modified Rankin Scale (mRS) score at 3 months post-rtPA were also recorded. A good clinical outcome was defined as a reduction of ≥4 in NIHSS score at 24 hours post-rtPA or an mRS score of 0 to 1 at 3 months post-rtPA.
 
Statistical analysis
Baseline characteristics, KPIs, and clinical outcomes were presented as count (%), mean ± standard deviation, or median (interquartile range). The Pearson Chi squared test, Fisher’s exact test, Mann-Whitney U test, and independent t test were used to compare the PSN and non-PSN groups. Further comparisons between the two groups were performed after one-to-one matching based on hospital, sex, age-group (≤80 years and >80 years), and NIHSS score at onset. Sensitivity, specificity, accuracy, positive predictive value (PPV) and negative predictive value (NPV), along with 95% confidence intervals, were computed for the PSN group using the FAS protocol (facial drooping, arm weakness, and speech difficulties) with or without eye palsy, as well as eye palsy alone. The PPVs of the protocols were compared using relative predictive values in a paired study design, as proposed by Moskowitz and Pepe.12 Statistical analyses were performed using SPSS software (Windows version 26.0; IBM Corp, Armonk [NY], United States) and the DTComPair package in R software (version 3.6.1). P values <0.05 were considered statistically significant.
 
Results
In total, 715 suspected stroke patients were included, with 449 in the non-PSN group and 266 in the PSN group. Intravenous recombinant tissue-type plasminogen activator was administered to 134 (29.8%) patients and 58 (21.8%) patients in the non-PSN and PSN groups, respectively (P=0.019) [Table 1]. Among the remaining 208 patients (78.2%) not given IV-rtPA in the PSN group, 43 patients were beyond the IV-rtPA window, and 46 patients had alternative unknown diagnoses at AED attendance. Twenty-one patients had symptoms that resolved or improved by the time of AED attendance (Fig).
 

Table 1. Comparison of baseline characteristics and key performance indicators of suspected stroke patients between pre-hospital stroke notification (PSN) and non-PSN groups
 

Figure. Summary of stroke patients screened using the pre-hospital stroke notification protocol
 
Comparison in all suspected stroke patients
Demographic characteristics were compared between the non-PSN and PSN groups, as shown in Table 1. Age and hyperlipidaemia significantly differed between the two groups. The median ages were 69.7 years in the non-PSN group and 72.4 years in the PSN group (P=0.022). The percentages of patients with hyperlipidaemia were 53.9% in the non-PSN group and 43.9% in the PSN group (P=0.010). Door-to-CT time was significantly shorter in the PSN group than in the non-PSN group (24.5 vs 31 minutes; P<0.001). The percentage of patients achieving the DCT time goal of 25 minutes was greater in the PSN group than in the non-PSN group (50.8% vs 37.7%; P<0.001). However, the median OTD time was longer in the PSN group than in the non-PSN group (97 vs 85.5 minutes; P=0.003).
 
Comparison in patients given intravenous recombinant tissue-type plasminogen activator
Among stroke patients given IV-rtPA, sex, hypertension, and hyperlipidaemia significantly differed between the two groups, as illustrated in Table 2. In the non-PSN group, 53.7% of patients were men, compared with 69.0% in the PSN group (P=0.049). Regarding key risk factors for ischaemic stroke, the respective prevalences of hypertension and hyperlipidaemia were 74.6% and 61.9% in the non-PSN group, whereas they were 53.4% and 44.8% in the PSN group. The NIHSS scores at symptom onset were similar between the non-PSN and PSN groups. The percentages of patients with suspected LVO were also similar between the PSN and non-PSN groups (40.4% vs 37.3%; P=0.759), as were the percentages of patients with CTA-confirmed LVO (52.6% vs 60.0%; P=1.000).
 

Table 2. Comparison of baseline characteristics and key performance indicators of stroke patients given intravenous recombinant tissue-type plasminogen activator between pre-hospital stroke notification (PSN) and non-PSN groups
 
The DTN time was shorter in the PSN group than in the non-PSN group (67 vs 75.5 minutes; P=0.007). Additionally, the percentage of patients achieving the DTN time goal of 60 minutes was greater in the PSN group (37.9% vs 21.6%; P=0.019). However, there were no differences in median DTC time and percentage of patients achieving the DTC time goal of 25 minutes (Table 2). As shown in Table 3, the percentages of patients with good clinical outcomes after IV-rtPA were similar between non-PSN and PSN groups, as indicated by a reduction of ≥4 in NIHSS score at 24 hours (50.8% vs 49.0%; P=0.829) and an mRS score of 0 to 1 at 90 days (43.3% vs 35.4%; P=0.342).
 

Table 3. Comparison of short-term and long-term clinical outcomes of stroke patients given intravenous recombinant tissue-type plasminogen activator between pre-hospital stroke notification (PSN) and non-PSN group
 
Matched comparison of patients given intravenous recombinant tissue-type plasminogen activator
The non-PSN and PSN groups were matched based on hospital, sex, age-group, and NIHSS score at onset. After matching, the percentage of patients achieving the DTC time goal of 25 minutes was greater in the PSN group than in the non-PSN group (64.0% vs 44.0%; P=0.045). The median DTN time was also shorter in the PSN group (65.5 vs 76.5 minutes; P=0.003). Moreover, the percentage of patients achieving the DTN time goal of 60 minutes was greater in the PSN group than in the non-PSN group (42.0% vs 18.0%; P=0.009) [Table 2]. Finally, the percentages of patients with good clinical outcomes after IV-rtPA were similar between non-PSN and PSN groups, as evidenced by a reduction of ≥4 in NIHSS score at 24 hours (47.7% vs 48.9%; P=0.913) and an mRS score of 0 to 1 at 90 days (36.0% vs 33.3%; P=0.789) [Table 3].
 
Predictive value of eye palsy assessment in the pre-hospital stroke notification protocol
Among the 22 patients with eye palsy in the PSN group, 18 patients had either facial drooping, arm weakness or speech difficulties; seven patients were administered IV-rtPA. In the PSN group, the PPVs for using FAS, eye palsy alone, and FAS with eye palsy to identify stroke patients eligible for IV-rtPA were 22.14%, 31.82%, and 38.89%, respectively (Table 4). Compared with the PPV of FAS, the PPV of FAS with eye palsy was significantly higher (P=0.046), whereas the PPV of eye palsy alone did not significantly differ (P=0.223).
 

Table 4. Eye palsy as a predictive factor for intravenous recombinant tissue-type plasminogen activator among suspected stroke patients in pre-hospital stroke notification group only (n=266)
 
Discussion
The AHA and ASA recommend specific time goals for KPIs in stroke patients, such as OTD, DTC, and DTN times. Early recognition of stroke and utilisation of PSN for these patients are emphasised in the recent ASA guidelines as recommendations that can facilitate achievement of these goals. The recent adoption of a PSN protocol by the public hospital system in Hong Kong is intended to improve these KPIs and, ultimately, clinical outcomes among stroke patients.
 
In the present study, the PSN FASE protocol resulted in shorter DTC and DTN times, compared with the non-PSN protocol. A shorter DTN is associated with improved patient outcomes3 4 and enables more patients to receive IV-rtPA within the therapeutic window.13 However, the onset-to-needle time did not differ between the two groups (144.5 vs 159 minutes; P=0.525) [Table 2], which may be explained by the longer OTD time in the PSN group than in the non-PSN group (97 vs 85.5 minutes; P=0.003) [Table 1]. To control for potential confounding factors, we matched the non-PSN and PSN stroke patients based on multiple variables; the results confirmed that DTC and DTN times were shorter in the PSN group.
 
However, these improvements in KPIs did not lead to statistically significant improvements in clinical outcomes, as evidenced by a reduction of ≥4 in NIHSS score at 24 hours post-rtPA (50.8% vs 49.0%; P=0.829) and an mRS score of 0 to 1 at 90 days (43.3% vs 35.4%; P=0.342) [Table 3]. The results of previous studies have suggested favourable mRS score outcomes in 33% to 41% of stroke patients given IV-rtPA12 14; the absence of favourable neurological outcomes in the present study may be attributed to the higher baseline level of neurological improvement in the non-PSN group. Moreover, the relatively small sample sizes in the PSN and non-PSN groups (58 vs 134; P=0.019) [Table 1] may explain the lack of statistically significant clinical benefit in this study; future studies with larger sample sizes may provide further insights. The longer OTD time in the PSN group compared with the non-PSN group suggests that patients in the PSN group were administered IV-rtPA later than patients in the non-PSN group, potentially resulting in worse clinical outcomes. Finally, the lack of statistically significant improvements in clinical outcomes may be explained by the higher NIHSS score at onset in the PSN group (15 vs 11; P=0.573) [Table 2]; regardless of matching to control for potential confounding factors, we did not observe any statistically significant improvement in clinical outcomes.
 
A higher percentage of stroke patients received IV-rtPA in the non-PSN group compared with the PSN group, which may differ from the findings in some recent studies.6 15 This discrepancy may be attributed to the learning curve associated with the new FASE protocol in the PSN group; EMS personnel may have engaged in ‘over-activation’ for borderline suspected stroke patients during early implementation. Additionally, because screening in the PSN group was performed by EMS personnel, it may have been less accurate than screening by physicians (ie, in the non-PSN group). The longer OTD time in the PSN group suggested that patients in the PSN group presented to the AED later than patients in the non-PSN group, increasing the likelihood that they would miss the 4-hour window for IV-rtPA administration.
 
The inclusion of eye palsy in the FASE protocol is intended to identify potential cases of posterior stroke8 9 and aid the identification of LVO.10 Although we found that the FASE protocol had a higher PPV (compared with the FAST protocol) for identifying stroke patients eligible for IV-rtPA, we did not assess whether the FASE protocol reliably identified patients with posterior strokes. Future studies validating the FASE protocol would provide additional insights. Considering the role of conjugate eye deviation in identifying LVO strokes,16 17 research exploring the ability of the FASE protocol to identify these patients would be valuable. Investigations of EMS personnel accuracy in eye palsy recognition may also be useful.
 
Limitations
Possible limitations of this study include the potential for experimenter bias, considering that most investigators were also clinicians involved in patient management. However, it may be difficult to address this bias due to staffing constraints in peripheral acute hospitals, where researchers also serve as clinicians. Furthermore, the findings in this study are consistent with the results of other studies regarding pre-hospital notification protocols for suspected stroke patients.
 
We also included the percentage of stroke patients with CTA-confirmed LVO to provide a more comprehensive analysis, considering that LVO strokes have been linked to worse clinical outcomes compared with non-LVO strokes.18 We observed no statistically significant differences in the percentages of suspected LVO and CTA-confirmed LVO strokes between the two study groups. However, because logistical considerations and resource limitations hindered our ability to perform diagnostic CTA for all patients, the true number of CTA-confirmed LVO strokes may be underestimated. Finally, the relatively small sample size may restrict our capacity to draw definitive conclusions.
 
Conclusion
This study validated the previous finding that a PSN protocol improves multiple stroke KPIs in Hong Kong. It also improves the understanding of whether a PSN protocol directly improves overall clinical outcomes among stroke patients, an area with limited evidence in current literature.19 The implementation of a PSN protocol using the new FASE assessment guideline shortened DTN and DTC times compared with a non-PSN protocol. However, this study did not reveal any statistically significant improvement in overall clinical neurological outcomes between these two protocols. Further research may be warranted to assess whether PSN improves patient outcomes and other acute care parameters.
 
Author contributions
Concept or design: KY Cheng, ELM Yu.
Acquisition of data: KY Cheng, T Yamamoto.
Analysis or interpretation of data: KY Cheng, ELM Yu.
Drafting of the manuscript: KY Cheng, ELM Yu.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Acknowledgement
The authors thank Mr Kin-wah Tam from North Lantau Hospital for data retrieval at the Hospital.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This research was approved by the Kowloon West Cluster Research Ethics Committee of Hospital Authority, Hong Kong [Ref No.: KW/EX-21-134(163-12)]. A waiver of patient consent was granted by the Committee since the data had been collected prior to this research and the risk of identification is minimal, and no new additional data was required for the research.
 
References
1. Centre for Health Protection, Department of Health, Hong Kong SAR Government. Physical activity: a major strategy for stroke prevention. Non-communicable diseases watch. October 2022. Available from: https://www.chp.gov.hk/files/pdf/ncd_watch_oct_2022.pdf. Accessed 28 Mar 2024.
2. Woo J, Ho SC, Goggins W, Chau PH, Lo SV. Stroke incidence and mortality trends in Hong Kong: implications for public health education efforts and health resource utilisation. Hong Kong Med J 2014;20(3 Suppl 3):S24-9.
3. Saver JL, Fonarow GC, Smith EE, et al. Time to treatment with intravenous tissue plasminogen activator and outcome from acute ischemic stroke. JAMA 2013;309:2480-8. Crossref
4. Man S, Xian Y, Holmes DN, et al. Association between thrombolytic door-to-needle time and 1-year mortality and readmission in patients with acute ischemic stroke. JAMA 2020;323:2170-84. Crossref
5. Powers WJ, Rabinstein AA, Ackerson T, et al. 2018 guidelines for the early management of patients with acute ischemic stroke: a guideline for healthcare professionals from the American Heart Association/American Stroke Association. Stroke 2018;49:e46-110. Crossref
6. Leung WC, Teo KC, Kwok WM, et al. Pre-hospital stroke screening and notification of patients with reperfusion-eligible acute ischaemic stroke using modified Face Arm Speech Time test. Hong Kong Med J 2020;26:479-85. Crossref
8. Kleindorfer DO, Miller R, Moomaw CJ, et al. Designing a message for public education regarding stroke: does FAST capture enough stroke? Stroke 2017;38:2864-8. Crossref
9. Aroor S, Singh R, Goldstein LB. BE-FAST (Balance, Eyes, Face, Arm, Speech, Time): reducing the proportion of strokes missed using the FAST mnemonic. Stroke 2017;48:479-81. Crossref
10. Beume L, Hieber M, Kaller CP, et al. Large vessel occlusion in acute stroke. Stroke 2018;49:2323-9. Crossref
11. Matsuo R, Yamaguchi Y, Matsushita T, et al. Association between onset-to-door time and clinical outcomes after ischemic stroke. Stroke 2017;48:3049-56. Crossref
12. Moskowitz CS, Pepe MS. Comparing the predictive values of diagnostic tests: sample size and analysis for paired study designs. Clin Trials 2006;3:272-9. Crossref
13. Wardlaw JM, Murray V, Berge E, et al. Recombinant tissue plasminogen activator for acute ischaemic stroke: an updated systematic review and meta-analysis. Lancet 2012;379:2364-72. Crossref
14. Emberson J, Lees KR, Lyden P, et al. Effect of treatment delay, age, and stroke severity on the effects of intravenous thrombolysis with alteplase for acute ischaemic stroke: a meta-analysis of individual patient data from randomised trials. Lancet 2014;384:1929-35. Crossref
15. Hsieh MJ, Tang SC, Chiang WC, et al. Effect of prehospital notification on acute stroke care: a multicenter study. Scand J Trauma Resusc Emerg Med 2016;24:57. Crossref
16. Ollikainen JP, Janhunen HV, Tynkkynen JA, et al. The Finnish Prehospital Stroke Scale detects thrombectomy and thrombolysis candidates—a propensity score–matched study. J Stroke Cerebrovasc Dis 2018;27:771-7. Crossref
17. Keenan KJ, Kircher C, McMullan JT. Prehospital prediction of large vessel occlusion in suspected stroke patients. Curr Atheroscler Rep 2018;20:34. Crossref
18. Malhotra K, Gornbein J, Saver JL. Ischemic strokes due to large-vessel occlusions contribute disproportionately to stroke-related dependence and death: a review. Front Neurol 2017;8:651. Crossref
19. Sangari A, Akhoundzadeh K, Vahedian M, Sharifipour E. Effect of pre-hospital notification on delays and neurological outcomes in acute ischemic stroke. Australas Emerg Care 2022;25:172-5. Crossref

Prediction of hospital mortality among critically ill patients in a single centre in Asia: comparison of artificial neural networks and logistic regression–based model

Hong Kong Med J 2024 Apr;30(2):130-8 | Epub 28 Mar 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Prediction of hospital mortality among critically ill patients in a single centre in Asia: comparison of artificial neural networks and logistic regression–based model
Swan Lau, BSc, MB, BS1; HP Shum, MD, FRCP2; Carol CY Chan, FHKCA, FHKAM (Anaesthesiology)2; MY Man, MRCP (UK), FHKAM (Medicine)2; KB Tang, FHKCA, FHKAM (Anaesthesiology)2; Kenny KC Chan, MStat, FHKAM (Anaesthesiology)3; Anne KH Leung, FHKCA (IC), FCICM4; WW Yan, FRCP, FHKAM (Medicine)2
1 Department of Anaesthesia, Pain and Perioperative Medicine, Queen Mary Hospital, Hong Kong SAR, China
2 Department of Intensive Care, Pamela Youde Nethersole Eastern Hospital, Hong Kong SAR, China
3 Department of Anaesthesia and Intensive Care, Tuen Mun Hospital, Hong Kong SAR, China
4 Department of Intensive Care, Queen Elizabeth Hospital, Hong Kong SAR, China
 
Corresponding author: Dr Swan Lau (ls037@ha.org.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This study compared the performance of the artificial neural network (ANN) model with the Acute Physiologic and Chronic Health Evaluation (APACHE) II and IV models for predicting hospital mortality among critically ill patients in Hong Kong.
 
Methods: This retrospective analysis included all patients admitted to the intensive care unit of Pamela Youde Nethersole Eastern Hospital from January 2010 to December 2019. The ANN model was constructed using parameters identical to the APACHE IV model. Discrimination performance was assessed using area under the receiver operating characteristic curve (AUROC); calibration performance was evaluated using the Brier score and Hosmer–Lemeshow statistic.
 
Results: In total, 14 503 patients were included, with 10% in the validation set and 90% in the ANN model development set. The ANN model (AUROC=0.88, 95% confidence interval [CI]=0.86-0.90, Brier score=0.10; P in Hosmer–Lemeshow test=0.37) outperformed the APACHE II model (AUROC=0.85, 95% CI=0.80-0.85, Brier score=0.14; P<0.001 for both comparisons of AUROCs and Brier scores) but showed performance similar to the APACHE IV model (AUROC=0.87, 95% CI=0.85-0.89, Brier score=0.11; P=0.34 for comparison of AUROCs, and P=0.05 for comparison of Brier scores). The ANN model demonstrated better calibration than the APACHE II and APACHE IV models.
 
Conclusion: Our ANN model outperformed the APACHE II model but was similar to the APACHE IV model in terms of predicting hospital mortality in Hong Kong. Artificial neural networks are valuable tools that can enhance real-time prognostic prediction.
 
 
New knowledge added by this study
  • An artificial neural network model outperformed the Acute Physiologic and Chronic Health Evaluation (APACHE) II model but was similar to the APACHE IV model in terms of predicting hospital mortality.
  • The three most important predictor variables were the highest sodium level, highest bilirubin level, and lowest white cell count within 24 hours of intensive care unit admission.
  • External validation studies using data from other hospitals are recommended to confirm these findings.
Implications for clinical practice or policy
  • Prediction of mortality among critically patients is challenging.
  • Artificial neural networks, along with other machine learning techniques, are valuable tools that can enhance real-time prognostic prediction.
 
 
Introduction
Intensive care treatments are primarily intended to improve patient outcomes. Considering the high operating costs of intensive care units (ICUs), a reliable, decision-supporting, risk stratification system is needed to predict patient outcomes and facilitate cost-effective use of ICU beds. Several disease severity scoring systems, such as the Acute Physiology and Chronic Health Evaluation (APACHE) system and the Simplified Acute Physiology Score system, are currently used to objectively assess outcomes and recovery potential in this complex and diverse group of patients.1 2
 
The APACHE system, one of the most commonly used benchmark severity scoring systems worldwide, can measure disease severity and predict hospital mortality among ICU patients. In the 40 years since its initial development, the APACHE system has undergone multiple revisions to improve statistical power and discrimination performance by modifying the numbers and weights of included variables.3 4 5 6 The underlying statistical principle is multivariable logistic regression based on data from an American population. The results are easy to interpret and allow robust outcome prediction for individuals with characteristics similar to the original population. However, the APACHE system has limited capacity to manage non-linear relationships between predictor and outcome variables, interactions between variables, and missing data. Although the value of the APACHE system for mortality prediction has been established, especially in Western countries, its discrimination performance and calibration are inconsistent when applied outside of the US.7 8 9 10 Since 2008, the Hospital Authority in Hong Kong has utilised the APACHE IV model to assess outcomes in critically ill patients. Nevertheless, the APACHE II model remains the most extensively validated version; it is widely used for research and reference purposes.11
 
In the early 1990s, artificial neural networks (ANNs), a type of machine learning algorithm, were proposed as alternative statistical techniques to logistic regression–based method. Similar to the organisation and data processing configurations in human brains, these networks consist of input and output layers with at least one or more intermediate (hidden) layers for pattern recognition. Each layer contains several ‘artificial neurons’, known as nodes, for data extraction; these nodes are connected with each other through variable ‘weights’.12 Artificial neural networks identify representative patterns from input data and observed output data within a training set, then fine-tune the variable weights; thus, they can predict outcomes when provided novel information. This method has considerable advantages in terms of managing non-linear relationships and multivariable interactions.13
 
A review of 28 studies comparing ANN and regression-based models showed that ANN outperformed regression-based models in 10 studies (36%), was outperformed by regression-based models in four studies (14%), and had similar performance in the remaining 14 studies (50%).14 Multiple recent studies also demonstrated that the integration of machine learning with electronic health records provided more accurate and reliable predictive performance compared with conventional prognostic models.15 16
 
This study was conducted to compare ANN performance with the performances of extensively validated and benchmark scoring systems—APACHE II and APACHE IV—in terms of predicting hospital mortality among critically ill patients in Hong Kong.
 
Methods
This retrospective analysis included all patients aged ≥18 years with first-time admissions to the ICU of Pamela Youde Nethersole Eastern Hospital between 1 January 2010 and 31 December 2019. The hospital is a 2000-bed tertiary care regional hospital that provides comprehensive services except for cardiothoracic surgery, transplant surgery, and burn management. The ICU is a 24-bed, closed, mixed medical-surgical unit with an average of 1600 patients admitted annually.
 
Demographic characteristics and hospital mortality data were retrospectively recorded. The worst value of each physiological parameter during the first 24 hours after ICU admission was used to generate an APACHE score. The predicted mortality risk was calculated based on published methods.3 5 Included parameters were age, sex, systolic and diastolic blood pressures, temperature, heart rate, respiratory rate, glucose level, blood urea nitrogen level, serum sodium level, creatinine level, haematocrit level, white cell count, albumin level, bilirubin level, pH, fraction of inspired oxygen, partial pressures of carbon dioxide and oxygen, bicarbonate, and urine output during the first 24 hours after ICU admission. For patients who had multiple ICU admissions during a single hospital stay, only the first admission was included. Patients were excluded if they died or were discharged from the ICU within 4 hours after admission.
 
Instances of incomplete data were resolved by multiple imputation using the Markov chain Monte Carlo algorithm (ie, fully conditional specification). This method fits a univariate (single dependent variable) model using all other available variables in the model as predictors, then imputes missing values for the dependent variable. The method continues until the maximum number of iterations is reached; the resulting imputed values are saved to the imputed dataset.
 
Neural network models were constructed with SPSS software (Windows version 25.0; IBM Corp, Armonk [NY], US) using the same parameters as in the APACHE IV model (online supplementary Fig); SPSS software was also used to examine model precision. The multilayer perceptron procedure, a class of feed-forward learning model, consists of ≥3 layers of nodes: input, hidden, and output.17 Automatic architecture building, which computes the best number of units in a hidden layer, was performed with SPSS software. Each hidden unit is an activation function of the weighted sum of the inputs; the values of the weights are determined by an estimation algorithm. In this study, the hidden layer consisted of 12 units (nodes). A hyperbolic tangent activation function was also employed for the hidden layers. Softmax activation and cross-entropy error functions were used for the output layer. The multilayer perceptron procedure utilised a backpropagation technique for supervised training. Learning occurred in the recognition phase for each piece of data via changes to connection weights based on the amount of error in the output compared with the expected result (gradient descent method).18
 
The training process was terminated when no further decreases in calculated error were observed. Subsequently, network weights were identified and used to compute test values. The importance of an independent variable was regarded as a measure of the extent to which network model–predicted values differed from observed values. Normalised importance, expressed as a percentage, constituted the ratio between the importance of each predictor variable and the largest importance value. Model stability was assessed by tenfold cross-validation. Oversampling of minority classes was performed via duplication to manage imbalances in outcome data.
 
Categorical and continuous variables were expressed as numbers (percentages) and medians (interquartile ranges). The Chi squared test or Fisher’s exact test was used for comparisons of categorical data; the Mann-Whitney U test was used for comparisons of continuous data. The performances of ANN, APACHE II, and APACHE IV models were evaluated in terms of discrimination and calibration power. Discrimination, which constitutes the ability of a predictive model to separate data into classes (eg, death or survival), was evaluated using the area under the receiver operating characteristic curve (AUROC). The AUROCs of the models were compared using the DeLong test. Calibration, which represents the closeness of model probability to the underlying probability of the study population, was evaluated using the Brier score, Hosmer–Lemeshow statistic, and calibration curves.19 All P values were two-sided, and values < 0.05 were considered statistically significant. All analyses were performed with SPSS software and MedCalc statistical software (version 19.6.1).
 
Results
In total, 14 503 patients were included. The demographic characteristics and hospital mortality data of the study cohort were shown in Table 1, while the physiological and laboratory parameters required to generate an APACHE score were presented in Table 2. Among the recruited patients, 4.93% had at least one missing data point, and the overall rate of missing data was 0.48%. Furthermore, 1400 (9.7%) of the recruited patients were randomly assigned to the validation set; the remaining patients (n=13 103, 90.3%) were assigned to the model development set. With respect to the ANN model, 70% and 30% of the development set were used for training and testing purposes, respectively. The median age was 67 years (interquartile range [IQR]=54-78), median APACHE II score was 18 (IQR=13-25), and median APACHE IV score was 66 (IQR=46-91). The overall hospital and ICU mortality rates were 19.3% (n=2799) and 9.6% (n=1392), respectively.
 

Table 1. Patient characteristics and outcome parameters
 

Table 2. Physiological and laboratory parameters during the first 24 hours after admission to the intensive care unit
 
The baseline co-morbidities, source of admission, disease category, APACHE II score, and APACHE IV score were similar in the test and validation sets (Table 1). More patients in the validation set received continuous renal replacement therapy (18.3% vs 16.1%; P=0.04). Concerning the worst physiological and laboratory parameters within the first 24 hours (Table 2), there were almost no significant differences between the development and validation sets; notably, the haemoglobin level was lower in the validation set (11.3 g/dL vs 11.5 g/dL; P=0.02).
 
In the development set, the ANN model (AUROC=0.89, 95% confidence interval [CI]=0.88-0.92, Brier score=0.10; P in Hosmer–Lemeshow test=0.34) outperformed the APACHE II model (AUROC=0.80, 95% CI=0.79-0.81, Brier score=0.15; P<0.001) and APACHE IV model (AUROC=0.84, 95% CI=0.83-0.85, Brier score=0.12; P<0.001) for prediction of hospital mortality. The cross-validation accuracy ranged from 0.98 to 1 (mean=0.99), indicating that our ANN model had good stability. There was no statistically significant difference between our ANN model and an ANN model created by oversampling of minority classes (AUROC=0.89, 95% CI=0.89-0.90; P=0.103).
 
In the validation set, the ANN model (AUROC=0.88, 95% CI=0.86-0.90, Brier score=0.10, P in Hosmer–Lemeshow test=0.37) was superior to the APACHE II model (AUROC=0.85, 95% CI=0.80-0.85, Brier score=0.14; P<0.001 for both comparisons of AUROCs and Brier scores) but similar to the APACHE IV model (AUROC=0.87, 95% CI=0.85-0.89, Brier score=0.11; P=0.34 for comparison of AUROCs, and P=0.05 for comparison of Brier scores) [Fig 1].
 

Figure 1. Receiver operating characteristic curves for different models (validation set)
 
The calibration curve for the validation set showed that the ANN model (Fig 2a) outperformed the APACHE IV model (Fig 2b) and the APACHE II model (Fig 2c).
 

Figure 2. Calibration curves for different models (validation set). (a) Artificial neural network model. (b) Acute Physiology and Chronic Health Evaluation (APACHE) IV model. (c) APACHE II model
 
The importances of the predictor variables in predictions of hospital mortality using the ANN model were evaluated. Within 24 hours of ICU admission, the highest sodium level was the most important variable, followed by the highest bilirubin level and the lowest white cell count. Details regarding the normalised importance of each covariate are presented in online supplementary Tables 1 and 2.
 
Discussion
To our knowledge, this is the first study in Asia to assess the performance of ANN and compare it with the performances of two extensively validated and benchmark scoring systems—APACHE II and APACHE IV—in terms of predicting hospital mortality among critically ill patients. We found that the ANN model provided better discrimination and calibration compared with the APACHE II model. However, the difference between the ANN and APACHE IV models was less prominent. Calibration was slightly better with the ANN model, but discrimination was similar between the ANN and APACHE IV models.
 
Conventional logistic regression–based APACHE systems often lose calibration over time and require regular updates to maintain performance.6 11 20 21 22 The original APACHE II model was developed over 30 years ago using data from 13 different hospitals in the US; it was validated in the country before clinical application.2 Studies in Hong Kong7 and Singapore23 have shown that the APACHE II model has good discrimination but poor calibration for ICU patients in Asia. Calibration remained suboptimal regardless of customisation as demonstrated by Lew et al,23 indicating the need for a new prognostic prediction model. Wong and Young24 showed that the APACHE II model had equivalent performance status compared with an ANN model that had been trained and validated using the original APACHE II data. In a medicalneurological ICU in India, an ANN model trained on an Indian population (with or without redundant variables) demonstrated better calibration compared with the APACHE II model.25 The authors speculated that this finding was partly related to differences in standards of care and resources between American and Indian ICUs.25 Overall, differences in case mix, advances in medical technology, and the use of more recent data may explain the superiority of our ANN model compared with the APACHE II model.
 
Compared with ICU patients in the US, it is fivefold more common for Hong Kong ICU patients to begin renal replacement therapy.26 More than 50% of critically ill patients in Hong Kong require mechanical ventilation, compared with 28% in the US.26 27 A recent population-based study of all patients admitted to adult ICUs in Hong Kong between 2008 and 2018 showed that the APACHE IV standardised mortality ratio decreased from 0.81 to 0.65 during the study period, implying a gradual decline in the performance of the APACHE IV model.26 This model, which was established using data derived from >100 000 ICU patients in 45 US hospitals between 2002 and 2003,5 also tends to overestimate hospital mortality among ICU patients in Hong Kong. In contrast to our study population, where Asian ethnicities were most common, 70% of the patients in APACHE IV reference population were Caucasian.5 The subtle differences in performance between our ANN model and the APACHE IV model could be related to differences in timing during the development of the models. Nevertheless, our ANN model trained on a Hong Kong population was better calibrated for prediction in such a population, compared with the APACHE IV model. This improved calibration could be related to differences in target population (Asian vs Caucasian), epidemiology, and disease profile.
 
The selection of appropriate variables is a key aspect of model development. The inclusion of additional predictor variables does not necessarily improve a model’s overall performance. Redundant variables may result in overfitting and produce a complicated predictive model without additional benefits. A recently published large national cohort study from Sweden showed that a simplified ANN model with eight parameters outperformed the Simplified Acute Physiology Score III model in terms of discrimination and calibration.28 Among the eight parameters, age and leukocyte count were the most and least important variables, respectively. Notably, leukocyte count was the most important variable in terms of predicting mortality among patients on continuous renal replacement therapy.29 Similar to the present study, Kang et al29 found that age was the 12th most important variable. The overall performance of an ANN model trained with APACHE II parameters in an Indian population could be maintained with the 15 highest information gain variables, including serum sodium level and leukocyte count.25
 
Among the 53 parameters in our ANN model, the highest sodium level, highest bilirubin level, and lowest white cell count within 24 hours of ICU admission were the top three most important predictor variables (online supplementary Table 1). The association between acquired hypernatraemia and increased hospital mortality among critically patients has consistently been demonstrated in multiple studies.30 31 Hyperbilirubinaemia, another complication in patients with sepsis, was associated with the onset of acute respiratory distress syndrome.32 Sepsis and gastrointestinal/hepatobiliary diseases caused ICU admission in approximately 40% of our patients, possibly explaining the importance of hyperbilirubinaemia in our ANN model. Although the importance of leukocyte count has been demonstrated in other mortality prediction models, the previous models did not specify whether the count was high or low.25 28 29 In the present study, the lowest white cell count was more important than the highest white cell count. Another intriguing observation was that age constituted the 11th most important predictor in our ANN model (online supplementary Table 1). Age is a predictor of survival in many prognostic models.3 5 28 Increasing biological age is often associated with multiple co-morbidities and a progressive decline in physiological reserve, leading to increased mortality. However, a recently published systematic review of 129 studies showed large variations in ICU and hospital mortality rates among older ICU patients, ranging from 1% to 51% in single-centre retrospective studies and 6% to 28% in multicentre retrospective studies.33 These results could be related to differences in admission policies, premorbid functional status, and the intensity of provided to older critically ill patients.
 
Our ANN model was trained and internally validated on a large number of representative data samples that included most patients admitted to a tertiary ICU in Hong Kong over the past decade. This approach addressed the small sample size limitation that was common in previous studies.24 25 34 All data were automatically collected by a computer system, eliminating the risk of human error during data extraction. Healthcare system digitalisation and advances in information technology have enabled effortless generation of abundant clinical data (eg, physiological parameters, laboratory results, and radiological findings), which can facilitate data collection and development of a new risk prediction model via machine learning.35 36 We hope that generalisability to other ICUs in Asia can be achieved through external validation studies.
 
Limitations
This study had some limitations. Although the sample size was large, all data were collected from a single centre; in contrast, data for the APACHE scoring system were derived from multiple large centres. Because the primary objective of the present study was comparison of performance between our ANN model and the APACHE II and APACHE IV models using identical parameters, we did not attempt to determine the optimal subset of parameters that would maintain high ANN performance.25 28 Furthermore, our ANN model may not be applicable to other centres with different case mixes and medical approaches. The lack of external validation may lead to concerns about overfitting, which is a common challenge in ANN model development. Because mortality prediction among ICU patients is a dynamic process, other limitations include the use of static data and the lack of a fixed time point for mortality assessment.
 
Conclusion
Mortality prediction among critically patients is a challenging endeavour. Our ANN model, which was trained with representative data from a Hong Kong population, outperformed the internationally validated APACHE II model with respect to critically ill patients in Hong Kong. In contrast to the APACHE IV model, our ANN model demonstrated better calibration but similar discrimination performance. External validation studies using data from other hospitals are recommended to confirm our findings. Future studies should explore the feasibility of reducing the number of variables while preserving the discrimination and calibration power of the ANN model. The widespread use of computerised information systems, rather than paper records, in ICU and general ward settings has led to increased data availability. Artificial neural networks, along with other machine learning techniques, are valuable tools that can enhance real-time prognostic prediction.
 
Author contributions
Concept or design: S Lau, HP Shum, CCY Chan.
Acquisition of data: S Lau, HP Shum, CCY Chan.
Analysis or interpretation of data: S Lau, HP Shum, CCY Chan.
Drafting of the manuscript: S Lau.
Critical revision of the manuscript for important intellectual content: MY Man, KB Tang, KKC Chan, AKH Leung, WW Yan.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interests.
 
Declaration
Part of the research was presented at the 34th Annual Congress of the European Society of Intensive Care Medicine (3-6 October 2021, virtual) and the Annual Scientific Meeting 2021 of Hong Kong Society of Critical Care Medicine (12 December 2021, virtual).
 
Funding/support
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
 
Ethics approval
The study protocol complies with the Declaration of Helsinki and was approved by the Hong Kong East Cluster Research Ethics Committee of Hospital Authority, Hong Kong (Ref No.: HKECREC-2021-024). The requirement for patient consent was waived by the Committee due to the retrospective nature of the study.
 
Supplementary material
The supplementary material was provided by the authors and some information may not have been peer reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by the Hong Kong Academy of Medicine and the Hong Kong Medical Association. The Hong Kong Academy of Medicine and the Hong Kong Medical Association disclaim all liability and responsibility arising from any reliance placed on the content.
 
References
1. Keegan MT, Gajic O, Afessa B. Severity of illness scoring systems in the intensive care unit. Crit Care Med 2011;39:163-9. Crossref
2. Breslow MJ, Badawi O. Severity scoring in the critically ill: part 1—interpretation and accuracy of outcome prediction scoring systems. Chest 2012;141:245-52. Crossref
3. Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med 1985;13:818-29. Crossref
4. APACHE III study design: analytic plan for evaluation of severity and outcome [editorial]. Crit Care Med 1989;17:S169-221.
5. Zimmerman JE, Kramer AA, McNair DS, Malila FM, Shaffer VL. Intensive care unit length of stay: benchmarking based on Acute Physiology and Chronic Health Evaluation (APACHE) IV. Crit Care Med 2006;34:2517-29. Crossref
6. Zimmerman JE, Kramer AA. Outcome prediction in critical care: the Acute Physiology and Chronic Health Evaluation models. Curr Opin Crit Care 2008;14:491-7. Crossref
7. Tan IK. APACHE II and SAPS II are poorly calibrated in a Hong Kong intensive care unit. Ann Acad Med Singap 1998;27:318-22.
8. Gupta R, Arora VK. Performance evaluation of APACHE II score for an Indian patient with respiratory problems. Indian J Med Res 2004;119:273-82.
9. Choi JW, Park YS, Lee YS, et al. The ability of the Acute Physiology and Chronic Health Evaluation (APACHE) IV score to predict mortality in a single tertiary hospital. Korean J Crit Care Med 2017;32:275-83. Crossref
10. Ghorbani M, Ghaem H, Rezaianzadeh A, Shayan Z, Zand F, Nikandish R. A study on the efficacy of APACHEIV for predicting mortality and length of stay in an intensive care unit in Iran. F1000Res 2017;6:2032. Crossref
11. Ko M, Shim M, Lee SM, Kim Y, Yoon S. Performance of APACHE IV in medical intensive care unit patients: comparisons with APACHE II, SAPS 3, and MPM0 III. Acute Crit Care 2018;33:216-21. Crossref
12. Xie J, Su B, Li C, et al. A review of modeling methods for predicting in-hospital mortality of patients in intensive care unit. J Emerg Crit Care Med 2017;1:18. Crossref
13. Tu JV. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J Clin Epidemiol 1996;49:1225-31. Crossref
14. Sargent DJ. Comparison of artificial neural networks with other statistical approaches: results from medical data sets. Cancer 2001;91(8 Suppl):1636-42. Crossref
15. Meiring C, Dixit A, Harris S, et al. Optimal intensive care outcome prediction over time using machine learning. PLoS One 2018;13:e0206862. Crossref
16. Rajkomar A, Oren E, Chen K, et al. Scalable and accurate deep learning with electronic health records. NPJ Digit Med 2018;1:18. Crossref
17. Norgaard M, Ravn O, Poulsen NK, Hansen LK. Neural Networks for Modelling and Control of Dynamic Systems: A Practitioner’s Handbook. London: Springer; 2000.
18. Ludermir TB, Yamazaki A, Zanchettin C. An optimization methodology for neural network weights and architectures. IEEE Trans Neural Netw 2006;17:1452-9. Crossref
19. Steyerberg EW, Vickers AJ, Cook NR, et al. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology 2010;21:128-38. Crossref
20. Lam KW, Lai KY. Evaluation of outcome and performance of an intensive care unit in Hong Kong by APACHE IV model: 2007-2014. J Emerg Crit Care Med 2017;1:16. Crossref
21. Paul E, Bailey M, Van Lint A, Pilcher V. Performance of APACHE III over time in Australia and New Zealand: a retrospective cohort study. Anaesth Intensive Care 2012;40:980-94. Crossref
22. Mann SL, Marshall MR, Holt A, Woodford B, Williams AB. Illness severity scoring for intensive care at Middlemore Hospital, New Zealand: past and future. N Z Med J 2010;123:47-65.
23. Lew CC, Wong GJ, Tan CK, Miller M. Performance of the Acute Physiology and Chronic Health Evaluation II (APACHE II) in the prediction of hospital mortality in a mixed ICU in Singapore. Proc Singapore Healthc 2019;28:147-52. Crossref
24. Wong LS, Young JD. A comparison of ICU mortality prediction using the APACHE II scoring system and artificial neural networks. Anaesthesia 1999;54:1048-54. Crossref
25. Nimgaonkar A, Karnad DR, Sudarshan S, Ohno-Machado L, Kohane I. Prediction of mortality in an Indian intensive care unit. Comparison between APACHE II and artificial neural networks. Intensive Care Med 2004;30:248-53. Crossref
26. Ling L, Ho CM, Ng PY, et al. Characteristics and outcomes of patients admitted to adult intensive care units in Hong Kong: a population retrospective cohort study from 2008 to 2018. J Intensive Care 2021;9:2. Crossref
27. Wunsch H, Angus DC, Harrison DA, Linde-Zwirble WT, Rowan KM. Comparison of medical admissions to intensive care units in the United States and United Kingdom. Am J Respir Crit Care Med 2011;183:1666-73. Crossref
28. Holmgren G, Andersson P, Jakobsson A, Frigyesi A. Artificial neural networks improve and simplify intensive care mortality prognostication: a national cohort study of 217,289 first-time intensive care unit admissions. J Intensive Care 2019;7:44. Crossref
29. Kang MW, Kim J, Kim DK, et al. Machine learning algorithm to predict mortality in patients undergoing continuous renal replacement therapy. Crit Care 2020;24:42. Crossref
30. O’Donoghue SD, Dulhunty JM, Bandeshe HK, Senthuran S, Gowardman JR. Acquired hypernatraemia is an independent predictor of mortality in critically ill patients. Anaesthesia 2009;64:514-20. Crossref
31. Olsen MH, Møller M, Romano S, et al. Association between ICU-acquired hypernatremia and in-hospital mortality: data from the medical information mart for intensive care III and the electronic ICU collaborative research database. Crit Care Explor 2020;2:e0304. Crossref
32. Zhai R, Sheu CC, Su L, et al. Serum bilirubin levels on ICU admission are associated with ARDS development and mortality in sepsis. Thorax 2009;64:784-90. Crossref
33. Vallet H, Schwarz GL, Flaatten H, de Lange DW, Guidet B, Dechartres A. Mortality of older patients admitted to an ICU: a systematic review. Crit Care Med 2021;49:324-34. Crossref
34. Clermont G, Angus DC, DiRusso SM, Griffin M, Linde-Zwirble WT. Predicting hospital mortality for patients in the intensive care unit: a comparison of artificial neural networks with logistic regression models. Crit Care Med 2001;29:291-6. Crossref
35. Kim S, Kim W, Park RW. A comparison of intensive care unit mortality prediction models through the use of data mining techniques. Healthc Inform Res 2011;17:232-43. Crossref
36. Bulgarelli L, Deliberato RO, Johnson AE. Prediction on critically ill patients: the role of “big data”. J Crit Care 2020;60:64-8. Crossref

Pages