Reduction of operative mortality after implementation of Surgical Outcomes Monitoring and Improvement Programme by Hong Kong Hospital Authority

Hong Kong Med J 2018 Apr;24(2):137–44 | Epub 6 Apr 2018

DOI: 10.12809/hkmj177118

ORIGINAL ARTICLE

WC Yuen, FHKAM (Surgery)¹; K Wong, MSc²; YS Cheung, FHKAM (Surgery)³; Paul BS Lai, FHKAM (Surgery)³

¹ Department of Surgery, Ruttonjee and Tang Shiu Kin Hospital, Hospital Authority, Hong Kong

² Jockey Club School of Public Health and Primary Care, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong

³ Department of Surgery, Prince of Wales Hospital, The Chinese University of Hong Kong, Shatin, Hong Kong

Corresponding author: Prof Paul BS Lai (paullai@surgery.cuhk.edu.hk)

Full paper in PDF

Abstract

Introduction: Since 2008, the Hong Kong Hospital Authority has implemented a Surgical Outcomes Monitoring and Improvement Programme (SOMIP) at 17 public hospitals with surgical departments. This study aimed to assess the change in operative mortality rate after implementation of SOMIP.

Methods: The SOMIP included all Hospital Authority patients undergoing major/ultra-major procedures in general surgery, urology, plastic surgery, and paediatric surgery. Patients undergoing liver or renal transplantation or who had multiple trauma or massive bowel ischaemia were excluded. In SOMIP, data retrieval from the Hospital Authority patient database was performed by six full-time nurse reviewers following a set of precise data definitions. A total of 230 variables were collected for each patient, on demographics, preoperative and operative variables, laboratory test results, and postoperative complications up to 30 days after surgery. In this study, we used SOMIP cumulative 5-year data to generate risk-adjusted 30-day mortality models by hierarchical logistic regression for both emergency and elective operations. The models expressed overall performance as an annual observed-to-expected mortality ratio.

Results: From 2009/2010 to 2015/2016, the overall crude mortality rate decreased from 10.8% to 5.6% for emergency procedures and from 0.9% to 0.4% for elective procedures. From 2011/2012 to 2015/2016, the risk-adjusted observed-to-expected mortality ratios showed a significant downward trend for both emergency and elective operations: from 1.126 to 0.796 and from 1.150 to 0.859, respectively (Mann-Kendall statistic = –0.8; P<0.05 for both).

Conclusion: The Hospital Authority’s overall crude mortality rates and risk-adjusted observed-to-expected mortality ratios for emergency and elective operations significantly declined after SOMIP was implemented.

New knowledge added by this study

A Surgical Outcomes Monitoring and Improvement Programme allows monitoring of performance and fair comparison of individual Hospital Authority hospitals against the overall Hospital Authority average. It enhances the understanding of surgical performance and helps identify areas for improvement.
The Hospital Authority’s overall crude mortality rates and risk-adjusted observed-to-expected mortality ratios for emergency and elective operations significantly declined after SOMIP was implemented.

Implications for clinical practice or policy

A properly organised, risk-adjusted clinical audit can accurately measure surgical outcomes and provide information for surgeons to deliver quality improvement.

[Abstract in Chinese]

Introduction

Audits of surgical mortality are used worldwide to monitor surgical outcome and achieve quality assurance.1 By measuring and comparing properly collected, risk-adjusted surgical outcome data, quality of surgical care could be enhanced in participating institutions.2 It has been demonstrated in several countries that adoption of a national surgical audit programme can reduce mortality.2 3

The Hong Kong Hospital Authority (HA) was established in 1991. It is a government statutory body responsible for the management of 42 public hospitals and institutions, 47 specialist out-patient clinics, and 73 general out-patient clinics in Hong Kong. Seventeen HA hospitals have surgical departments; all of them provide an elective surgery service and 14 also provide an emergency surgical service. In 2016, over 80% of all hospital admissions in Hong Kong were under the management of the HA.4 Therefore, it is important for the HA to develop tools with which to measure and improve performance. For this purpose, the HA Coordinating Committee of Surgery set up a Central Surgical Audit Unit in 2001. From 2002 to 2007, the unit conducted clinical audits based on retrospective cumulative data to compare the performance of the 17 surgical departments. One to two major or ultra-major operations, such as major hepatectomy, oesophagectomy, and major lung resection, were selected each year for comparison. The risk-adjustment model used was based on the Physiological and Operative Severity Score for Enumeration of Mortality and Morbidity (POSSUM).5 6 Those audits were procedure-based and mainly focused on mortality, and hence a more comprehensive risk-adjusted outcome audit was needed.

In 2008, a new programme—namely, the Surgical Outcomes Monitoring and Improvement Programme (SOMIP)—was launched by the HA with the aim of monitoring and improving surgical quality. The programme was governed by a steering committee comprising surgeons, an anaesthetist, a physician, administrative managers, and statisticians. Risk-adjusted mortality and morbidity rates of elective and emergency major and ultra-major operations at each hospital are measured and reported yearly. An annual forum is held to disseminate the results and allow the sharing of best practices.

This study aimed to assess the changes in overall surgical outcomes for patients after implementation of SOMIP.

Methods

Between July 2008 and June 2016, SOMIP captured data of all elective and emergency major/ultra-major operations (except those in children younger than 1 year) that were performed by general surgery, urology, plastic and reconstructive surgery, and paediatric surgery teams at all 17 HA hospitals. A total of 230 variables were collected from each patient: 10 patient demographic variables, 83 preoperative and operative variables, 31 laboratory test results, and 40 postoperative events and 66 postoperative adverse events in the first 30 days after surgery. Demographic data and laboratory test results were mostly automatically retrieved from various HA clinical information systems. For data that required manual retrieval, six full-time SOMIP nurse reviewers were employed by the HA head office for this purpose. Preoperative and operative variables, as well as postoperative complications occurring up to 30 days after the index operation, were retrieved from patient records by the SOMIP nurse reviewers. Mortality at 30 days, 60 days, and 90 days were also retrieved from the HA electronic database. These data were endorsed and submitted by each surgical department’s surgical supervisor within 60 days of surgery.

Both the manually captured and automatically captured data were entered into a tailor-made SOMIP electronic database. Data variable definitions were listed in the operation manual of the programme. To ensure data validity and consistency, all nurse reviewers completed comprehensive training on data definition and criteria, and regular nurse reviewer meetings were held to clarify any queries. All data were endorsed by the surgical supervisor of the respective surgical department. When necessary, data definitions were modified.

An inter-rater reliability test was performed each May and completed within a month so as to ensure consistency among nurse reviewers as well as data accuracy. Fifty cases were sampled for evaluation using a stratified systematic sampling method. For those selected cases, an independent nurse from the SOMIP team repeated the data collection using a designated data template, without prior knowledge of the information recorded by the original nurse reviewer. Data quality was measured by comparing two sets of data, and inferred by a score defined as the percentage of agreement between nurse reviewer and the SOMIP working team for each data item. The mean score of all data items was used to assess overall performance. The overall result was satisfactory and the mean score of all data items was 99.3% (range of individual item scores, 95.2%-100%).

Among the variables collected, preoperative risk factors including demographic data; general health and lifestyle variables; and major respiratory, cardiovascular, hepatobiliary, renal, vascular, central nervous system, and immune co-morbidities were deemed particularly important. These risk factors were modified from those in the American College of Surgeons National Surgical Quality and Improvement Program (NSQIP) to suit the local context.7 8 9 Operative variables included intra-operative blood loss, American Society of Anesthesiologists physical status classification, procedure complexity score, surgical subspecialty, wound classification, operative magnitude, and operative time. Before analysis and reporting, SOMIP data were cleaned and verified by four surgeons and the SOMIP surgical supervisors of each hospital. Questionable cases were reviewed accordingly. Cases of liver transplantation, renal transplantation, multiple trauma, and major bowel ischaemia (Table 1)10 11 12 13 14 15 16 17 were excluded from the risk-adjusted model analysis.

Table 1. No. of cases excluded and reasons, from 2008/2009 to 2015/201610 11 12 13 14 15 16 17

Comparative risk-adjusted models (different models for emergency operations and elective operations) were generated using hierarchical logistic regression. The 30-day risk-adjusted mortality models expressed hospital performance as expected odds ratios. A risk-adjusted observed-to-expected mortality ratio (O/E ratio) was then calculated for each hospital. The O/E ratio is a quotient between the observed number of deaths and the expected number of deaths; the latter was calculated by a logistic regression method based on significant independent risk factors. Together with the 90% confidence intervals, O/E ratios were depicted by caterpillar plots and boxplots. Hospitals with the lower limit of the 90% confidence interval of O/E ratios greater than 1 were defined as ‘high outliers’; hospitals with the upper limit of the 90% confidence interval of O/E ratios lower than 1 were defined as ‘low outliers’. The risk-adjusted outcome of a ‘high-outlier’ hospital was probably worse than the average outcome, and that of a ‘low-outlier’ hospital was probably better than the average outcome.

In addition to risk-adjusted postoperative mortality, various general medical and surgical complications, as well as specific complications (anastomotic leakage, surgical site infection, acute myocardial infarction, pneumonia), were recorded. The list of complications recorded and the method to derive the SOMIP risk-adjustment model have been described in detail in the annual SOMIP Report.10 11 12 13 14 15 16 17 Different levels of confidence were used for different outcome variables—90% confidence interval for mortality rates, 95% confidence interval for major complications, and 99% confidence interval for morbidity rates. Using SOMIP data together with other useful information extracted from the HA Executive Information System (eg, bed occupancy, nursing manpower, intensive care unit support, and surgeon workload), root-cause analyses were performed using multilevel logistic regression, as described in the annual SOMIP Report.10 11 12 13 14 15 16 17

The discriminative power of the risk-adjusted models was measured by the C-index, area under the receiver-operating characteristic curve (AUC). The closer the C-index is to 1, the better the discriminative power of the model is: a C-index of ≥0.8 indicates excellent discriminative power.18 The calibration accuracy of the models was assessed by the Hosmer-Lemeshow goodness-of-fit test (HL test). The calibration of the model was rejected if P<0.05. The Mann-Kendall non-parametric trend test was used to identify trends (positive or negative) in the annual data series for both crude mortality rates (2009/2010 to 2015/2016) and risk-adjusted O/E ratios (2011/2012 to 2015/2016). A very high positive value of the Mann-Kendall statistic (S) indicated an increasing trend; a very low negative value indicated a decreasing trend. The test statistic Z-score was used as a measure of trend significance.

Results

Descriptive data

Age distribution trends are summarised in Table 2.10 11 12 13 14 15 16 17 From 2008/2009 to 2015/2016, the proportion of people aged 61-70 years increased by 7 percentage points (from 19% to 26%), whereas the proportions of people aged 41-50 years and 71-80 years decreased by 2 and 5 percentage points, respectively.

Table 2. Distribution of patients by age-group, from 2008/2009 to 2015/2016.10 11 12 13 14 15 16 17

Table 3 10 11 12 13 14 15 16 17 shows proportions of patients taking regular medication for diabetes mellitus and hypertension before surgery from 2009/2010 to 2015/2016, as well as those currently smoking (within 1 year) and drinking more than 2 units of alcohol per day in the previous 2 weeks. Over 70% of patients had at least one of these four conditions, whereas about 40% had a history of regular use of hypertension medication before surgery.

Table 3. Distribution of patients by habit before surgery/admission, from 2009/2010 to 2015/201610 11 12 13 14 15 16 17

Annual numbers of elective operations by specialty are summarised in Table 4.10 11 12 13 14 15 16 17 For the 10 listed elective operations, the most frequently performed were in urology, consistently constituting 28% of the caseload from 2008/09 to 2015/16. The least frequently performed procedures were parotid surgery (1%) and paediatric surgery (1%).

Table 4. Elective operations by specialty, from 2008/2009 to 2015/201610 11 12 13 14 15 16 17

Overall crude mortality rates and risk-adjusted observed-to-expected mortality ratios

From July 2008 to June 2016, eight SOMIP reports were published.10 11 12 13 14 15 16 17 They showed that the HA overall crude mortality rate approximately halved over this time. The crude 30-day mortality rate for emergency operations dropped gradually from 10.8% in the year 2009/2010 to 5.6% in 2015/2016 (Fig 1). Similarly, the crude 30-day mortality rate for elective operations more than halved: from 0.9% in 2009/2010 to 0.4% in 2015/2016 (Fig 2).

Figure 1. Crude 30-day mortality rates for emergency operations, 2009/2010 to 2015/2016

Figure 2. Crude 30-day mortality rates in elective operations, 2009/2010 to 2015/2016

In the 5-year cumulative comparison analysis (2011/2012 to 2015/2016), both models had excellent discriminative power and good calibration accuracy. For emergency operations, the AUC was >0.9 and the HL test statistic was >0.1; for elective operations, the AUC was >0.89 and the HL test statistic was >0.2. The risk-adjusted observed-to-expected 30-day mortality ratio for both types of surgery showed a similar downward trend to the crude mortality rates. For emergency operations, the risk-adjusted O/E ratios were 1.126, 1.022, 1.113, 0.921, and 0.796 across the 5 years (Fig 3). These values show a statistically significant downward (negative) trend (S= –0.8; P<0.05). The results of the Mann-Kendall analyses are summarised in Table 4. For elective operations, risk-adjusted O/E ratios were 1.150, 1.229, 0.881, 0.862, and 0.859 across the 5 years (Fig 4). These values show a statistically significant downward (negative) trend (S= –0.8; P<0.05) [Table 5].

Figure 3. Risk-adjusted observed-to-expected 30-day mortality ratio for emergency operations, 2011/2012 to 2015/2016

Figure 4. Observed-to-expected 30-day mortality ratio for elective operations, 2011/2012 to 2015/2016

Table 5. Mann-Kendall non-parametric trend test results

Discussion

Before the turn of the century, most hospital records in HA hospitals were handwritten and retained by individual hospitals. There was no convenient means by which to share patient details among hospitals. Around 2001, the HA implemented a number of clinical management electronic systems, such as the electronic patient record, Operation Theatre Record System, and Clinical Data Analysis and Reporting System, at all HA hospitals. By virtue of this infrastructure, patient records and information about diagnoses and operations could be accessed at a central level. Based on this central clinical database, a Quality Assurance Subcommittee under the Coordinating Committee of Surgery commenced small-scale comparative clinical audits for ultra-major operations in 2002, focusing on one to two ultra-major operations per year. The audits provided basic information about hospital performance for the selected operation, such as number of procedures, age distribution of patients, and mortality rate.

Subsequently, the Coordinating Committee of Surgery developed a more robust system to monitor more major operations at the same time. It was decided to follow the framework of NSQIP, which was developed in 1994 by the Veterans Affairs Hospitals in the United States to monitor risk-adjusted surgical operation outcomes. Like HA hospitals, Veterans Affairs Hospitals are managed by a central governing body and equipped with a comprehensive electronic medical records system. Studies19 20 showed a significant improvement in both mortality and morbidity over time and thus, in 2004, NSQIP was extended to private hospitals and endorsed by the American College of Surgeons. Using NSQIP as the blueprint, the HA launched SOMIP in 2008. The SOMIP adopted similar risk-adjustment variables, use of nurse reviewers to collect data, a focus on hospital performance rather than individual surgeon performance, and similar methods of data analysis and determination of outliers. Moreover, SOMIP allowed individual HA hospitals to benchmark their performance against the overall HA average.

As all surgical patients have a different health status, their operation outcomes will likewise differ. Appropriate adjustment for different patient risks is essential when interpreting hospital mortality rates. To adjust for different risk factors, over 100 patient risk factors were captured for each enrolled patient. For NSQIP, one risk-adjustment mortality model was constructed for all operations. In contrast to NSQIP, separate models were devised for emergency and elective operations in SOMIP.

Hospital outliers can be identified by O/E ratios if the confidence interval of the O/E ratio is greater than 1, meaning that after balancing the different risks of hospital patients, their clinical outcomes are most likely different from the rest. From the results of this study, it was encouraging to find a significant trend of reduction in crude mortality rates and O/E ratios for both elective and emergency operations at HA hospitals over the past 5 years. According to the significant reduction in expected odds ratios over the years, this improvement is genuine and not due to patient selection.

There are several possible reasons behind the changes: public identification as a poor performer is a strong incentive for change in HA hospitals; sharing best practices on perioperative patient care is an important educational activity that takes place annually through the SOMIP Forum; the HA Head Office invests more resources into deficient hospitals; and changing attitudes towards managing surgical complications by other colleagues from the intensive care unit are also helpful. All of these may have contributed to the change.

Limitations

The SOMIP has a number of limitations. The coverage of SOMIP is not as complete as that of NSQIP, since many surgical departments such as orthopaedics and neurosurgery are not included. Monitoring is done by retrospective annual case collection (from 1 July to 30 June) because it takes 13 months to complete case enrolment and an additional 3 months for data verification, model building, and statistical analysis. Because of the small number of events, the current programme is not able to determine the risk-adjusted outcomes of individual operations or surgeons. Furthermore, because this programme relies heavily on the HA central electronic database, it is not easy to extend it to hospitals without this information infrastructure. Although the trend of reduction in mortality was statistically significant, we were not able to demonstrate a causal relationship with SOMIP implementation.

Potential issues with data quality may have affected the outcomes. In the inter-rater reliability test, the nurse reviewers were not blinded and this may have caused information bias. Also, quality of data collection in the initial 2 years may have been unreliable. As a result, the 5-year cumulative comparison analysis for emergency and elective operations commenced from 2011/2012, rather than 2008/2009. Furthermore, data definitions are updated regularly in the operation manual and could have affected the time trend analysis. Nonetheless, the SOMIP team considered changes in data definitions to be minor and did not expect a significant impact on the risk models.

Mortality could be influenced by many factors; ensuring risk adjustments are adequate and appropriate would be a challenge. Disease factors, stage of disease, and treatment options may not be fully taken into account by the risk-adjusted models, and data readiness and availability are further constraints. Surgeon skill and experience was another aspect that could not be accommodated and was difficult to adjust for. In the HA, surgical operations are performed by a team; therefore, it would be difficult to separate individual surgeon experience and credentials from those of the whole team.

Conclusion

From 2008 to 2016, the HA’s overall crude mortality rates and risk-adjusted O/E ratios showed a significant trend of reduction for both emergency and elective operations. The SOMIP enhances understanding of surgical performance and helps identify areas for improvement. It allows individual HA hospitals to benchmark their performance against the overall HA average through risk-adjusted O/E ratios.

Acknowledgements

We thank the SOMIP Steering Committee of the Hospital Authority, the Coordinating Committee of Surgery, and the Biostatistics team of The Chinese University of Hong Kong for their contributions and helpful comments on this manuscript.

Declaration

The authors have no conflicts of interest to disclose.

References

1. Lui CW, Boyle FM, Wysocki AP, et al. How participation in surgical mortality audit impacts surgical practice. BMC Surg 2017;17:42. Crossref

2. Khuri SF, Daley J, Henderson W, et al. Risk adjustment of the postoperative mortality rate for the comparative assessment of the quality of surgical care: results of the National Veterans Affairs surgical risk study. J Am Coll Surg 1997;185:315-27. Crossref

3. Kiermeier A, Babidge WJ, McCulloch GA, Maddern GJ, Watters DA, Aitken RJ. National surgical mortality audit may be associated with reduced mortality after emergency admission. ANZ J Surg 2017;87:830-6. Crossref

4. Hospital Authority Statistical Report (2015-16). Available from: http://www.ha.org.hk/haho/ho/stat/HASR15_16.pdf. Accessed 10 Nov 2017.

5. Copeland G, Jones D, Walters M. POSSUM: A scoring system for surgical audit. Br J Surg 1991;78:355-60. Crossref

6. Copeland G. The POSSUM system of surgical audit. Arch Surg 2002;137:15-9. Crossref

7. Khuri S, Daley J, Henderson W, et al. The National VA Surgical Risk Study: risk adjustment for the comparative assessment of the quality of surgical care. J Am Coll Surg 1995;180:519-31.

8. Daley J, Khuri SF, Henderson W, et al. Risk adjustment of the postoperative morbidity rate for the comparative assessment of the quality of surgical care: results of the National Veterans Affairs surgical risk study. J Am Coll Surg 1997;185:328-40. Crossref

9. Daley J. Validating risk-adjusted surgical outcomes: site visit assessment of process and structure. J Am Coll Surg 1997;185:341-51. Crossref

10. Surgical Outcomes Monitoring & Improvement Program (SOMIP) report. Volume One: July 2008-June 2009. Hospital Authority, Hong Kong SAR Government; 2010.

11. Surgical Outcomes Monitoring & Improvement Program (SOMIP) report. Volume Two: July 2009-June 2010. Hospital Authority, Hong Kong SAR Government; 2011.

12. Surgical Outcomes Monitoring & Improvement Program (SOMIP) report. Volume Three: July 2010-June 2011. Hospital Authority, Hong Kong SAR Government; 2012.

13. Surgical Outcomes Monitoring & Improvement Program (SOMIP) report. Volume Four: July 2011-June 2012. Hospital Authority, Hong Kong SAR Government; 2013.

14. Surgical Outcomes Monitoring & Improvement Program (SOMIP) report. Volume Five: July 2012-June 2013. Hospital Authority, Hong Kong SAR Government; 2014.

15. Surgical Outcomes Monitoring & Improvement Program (SOMIP) report. Volume Six: July 2013-June 2014. Hospital Authority, Hong Kong SAR Government; 2015.

16. Surgical Outcomes Monitoring & Improvement Program (SOMIP) report. Volume Seven: July 2014-June 2015. Hospital Authority, Hong Kong SAR Government; 2016.

17. Surgical Outcomes Monitoring & Improvement Program (SOMIP) report. Volume Eight: July 2015-June 2016. Hospital Authority, Hong Kong SAR Government; 2017.

18. Ash A, Schwartz M. Evaluating the performance of risk-adjustment methods: dichotomous variables. In: Iezzoni L, editor. Risk adjustment for measuring health care outcomes. Ann Arbor, MI: Health Administration Press; 1994: 313-46.

19. Hall BL, Hamilton BH, Richards K, Bilimoria KY, Cohen ME, Ko CY. Does surgical quality improve in the American College of Surgeons National Surgical Quality Improvement Program: An evaluation of all participating hospitals. Ann Surg 2009;250:363-76. Crossref

20. Cohen ME, Liu Y, Ko CY, Hall BL. Improved surgical outcomes for ACS NSQIP hospitals over time: evaluation of hospital cohorts with up to 8 years of participation. Ann Surg 2016;263:267-73. Crossref