首页    期刊浏览 2025年02月12日 星期三
登录注册

文章基本信息

  • 标题:A new approach to measuring health system output and productivity.
  • 作者:Castelli, Adriana ; Dawson, Diane ; Gravelle, Hugh
  • 期刊名称:National Institute Economic Review
  • 印刷版ISSN:0027-9501
  • 出版年度:2007
  • 期号:April
  • 语种:English
  • 出版社:National Institute of Economic and Social Research
  • 摘要:Keywords: Health output; productivity; quality adjustments
  • 关键词:Health care industry;Medical care;Medical care quality;Multifactor productivity

A new approach to measuring health system output and productivity.


Castelli, Adriana ; Dawson, Diane ; Gravelle, Hugh 等


This paper considers methods to measure output and productivity in the delivery of health services, with an application to the NHS hospital sector. It first develops a theoretical framework for measuring quality adjusted outputs and then considers how this might be implemented given available data. Measures of input use are discussed and productivity growth estimates are presented for the period 1998/9-2003/4. The paper concludes that available data are unlikely fully to capture quality improvements.

Keywords: Health output; productivity; quality adjustments

JEL classifications: D24; 111; L8

1. Introduction and context

Previous papers in this issue have set out the methodology and results from applying the standard growth accounting approach to measuring productivity. The EU KLEMS database includes estimates of output and productivity growth for the non-market services including public administration, education and health and social services, employing data from the National Accounts in each country. However, output in these services is known to be very poorly measured with little yet by way of international consistency. This is now changing following the guidelines set out in the Eurostat (2001) Handbook on price and volume measures in National Accounts and the publication of the Atkinson Review (Atkinson 2005)--see discussion below.

Measurement of output and productivity in non-market services is not just of interest to those working within the national accounts tradition but is also important for policymakers charged with providing, funding and/or regulating these services. In March 2004 the UK Department of Health (DH) commissioned a research team from the Centre for Health Economics at the University of York and the National Institute of Economic and Social Research to develop new approaches to measuring NHS outputs and productivity. This paper summarises the main findings from that work. The main focus of this paper is to set out the methodology put forward by the research team to measure outputs and inputs and presents a few sample results for the UK NHS. Readers interested in details of the methods, data and calculations should consult the final report to DH, Dawson et al. (2005), and the method applied to selected health treatments in Castelli et al. (2007).

The Eurostat Handbook made important recommendations on measurement of non-market output. For purposes of national accounting the preferred method is to measure outputs (e.g. treatment received by a patient) rather than activities (number of operations or prescriptions), and outputs should be quality adjusted. Eurostat deemed the previously widely employed method of measuring outputs by inputs as unacceptable and as a result EU countries are moving away from this method in large areas of non-market services in the direction of the preferred method. (1) In major respects Atkinson (2005) recommended a methodology for measuring NHS output growth advocated in the preliminary reports of the York/NIESR team as applied to health services, as detailed below.

In this paper we formulate an index of health service productivity that incorporates quality change and calculate the annual value of this index for the English NHS from 1997/8 to 2003/4. The next section describes the approach taken to output measurement and we then describe the data used to populate the output index (section 3) and the results (section 4). The input index, its construction, data and sources are briefly described in section 5, which also includes estimates of NHS hospital sector productivity growth. Concluding comments are offered in section 6.

2. Measuring output

An aggregate indicator of the movement of the output of an industry which produces a range of goods is constructed by measuring the percentage volume change in each output and weighting the resulting percentage changes by the share of the value of each product in the value of total output. Thus, in Laspeyres form, where activities are valued in the base period, the index takes the form:

[I.sup.1] [J.summation over j=1][x.sub.jt+1][v.sub.jt] / [J.summation over j=1][x.sub.jt][v.sub.jt] (1)

where [x.sub.jt] is the volume of output j in period t, and [v.sub.jt] is the marginal social value of output J. For many goods and services, prices are taken to reflect the consumer's marginal willingness to pay for them. So, in the National Accounts, prices are used to value private sector output. But consumers do not face full price for goods and services that are financed by the public sector so values must be imputed in other ways. Eurostat recommends the use of unit costs for this purpose. In this case the output index becomes:

[I.sup.2] [J.summation over j=1][x.sub.jt+1][c.sub.jt] / [J.summation over j=1][x.sub.jt][c.sub.jt] (2)

where [c.sub.jt] is the cost of output j at time t. There are two major deficiencies with this approach. First, it is questionable whether costs are proportional to marginal social values. Proportionality requires the use of marginal costs and that public sector resources are allocated in line with social preferences (i.e. that the system is allocatively efficient). Second, the index takes no account of changes in quality. For example, when the health system adopts more cost effective ways of treating patients, a cost weighted output index may record a fall in output, depending on the relative magnitudes of unit cost reductions and increased treatments.

It is helpful to distinguish between activities (operative procedures, diagnostic tests, outpatient visits, consultations), outputs (courses of treatment which may require a bundle of activities), and outcomes (the characteristics of output which affect utility). The distinction between outputs and outcomes is identical to that between goods and characteristics in consumption technology models (Deaton and Muellbauer, 1980, Ch. 10; Lancaster, 1971) where consumers value goods because of the bundle of characteristics that yield utility.

Measurement in the private sector focuses on outputs rather than the characteristics they produce because of the assumption that the market price of the output measures the consumers' marginal valuation of the bundle of characteristics from consuming the output. In the private sector we do not need to concern ourselves with measuring characteristics or counting activities because they are embodied in the outputs which are produced and the prices at which they are sold. In the public sector there are no prices to reveal patients' marginal valuations of services so we have to find other means of estimating their value and we define the quality of the output as a function of the vector of outcomes it produces. This means that values can be found by combining information on outcomes with that about outputs or activities; see the discussion for the health sector in Triplett (2001) or non-market services in general in O'Mahony and Stevens (2006).

How do we combine information on outcomes with outputs? We can do so in two equivalent ways: we can measure the outputs and attempt to estimate the marginal valuations attached to them, or we can measure the outcomes produced by each unit of output and attempt to estimate marginal valuations of the outcomes. The bundle of outcomes produced by a unit of output is likely to change over time in the health sector because of, among other things, changes in technology or treatment thresholds. In a private market the price of output would change to reflect this. But in the absence of market prices for health sector outputs it is likely to be easier to calculate the change in the marginal value of output by focusing on the change in the vector of outcomes. We show below how the changing mix of outcomes (quality change) may be allowed for in principle. We discuss in section 4 how quality adjustments based on the currently available data can be incorporated into an output index and show the results of applying these methods to calculate experimental quality adjusted indices. We have made suggestions as to how the quality adjustment can be improved by the collection of additional data in the concluding section.

Health services have many characteristics which patients regard as important. These include the impact of treatment on health outcomes, the length of time waited for treatment, the degree of uncertainty attached to the waiting time, distance and travel time to services, the interpersonal skills of GPs, the range of choice and quality of hospital food, the politeness of the practice receptionist, the degree to which patients feel involved in decisions about their treatment, etc. In this paper we consider how to incorporate two important dimensions of quality in a cost-weighted index: the health outcomes arising from treatment and how long patients have to wait before receiving treatment.

The most important characteristic of health care is the contribution it makes to health status. The measure of health outcome should indicate the 'value-added' to health as a result of contact with the health system. Given the objectives of the health system, it is important to measure the health gain accruing to patients as a consequence of treatment. A unit of measurement used in a number of countries is the Quality Adjusted Life Year (QALY). This measures the change in the quality of life and the duration of the benefit and, therefore, allows both for treatments that improve the quality of life without affecting life expectancy and treatments that improve life expectancy. Health gains accruing over time are discounted and appear as the present value of the benefit of treatment. In theory the health outcome consequent on treatment is the difference between QALYs with treatment and without treatment. In figure 1, the area under the 'with treatment' curve less the area below the 'without treatment' curve gives the health benefit of the treatment. In practice, even with routinely collected data on health outcomes, it will not be possible to measure this true treatment effect. First, for ethical reasons, few patients are left without treatment. All we will observe is the difference between health state before treatment and after treatment. Second, even routinely collected outcomes data would not provide a continuous monitoring of post treatment health state. We would only have snapshot estimates at particular points in time.

[FIGURE 1 OMITTED]

Lack of data on without treatment health states may not be a serious deficiency for construction of an output index designed to measure the rate of growth of quality adjusted output. If, over time, improved treatment results in after treatment health state [h.sup.*] in year t+1 being higher than in year t, an index using observational data will record an increase in quality adjusted output. Where the aim is to measure the rate of growth of output, we are interested in whether the rate of growth of [DELTA]h = [h.sup.*] - [h.sup.o] is a reasonable approximation to the rate of growth of the effect of treatment on the discounted sum of QALYs. The important issue is how well the rate of change in measures based on the snapshots [h.sup.*], [h.sup.o] approximates the rate of change in the areas under the two time profiles of health streams with treatment [h.sup.*](s) and without treatment [h.sup.o](s).

Hence, despite the imperfections of the difference between snapshots of before and after treatment health status for calculating the level of productivity, we suggest that rates of change of measures based on snapshot measures of [h.sup.*], [h.sup.o] will improve estimates of health sector output growth compared to estimates where such information is not used.

In practice, including health outcomes in the output index has proved difficult, mainly because of a lack of data. One important outcome is whether patients survive their treatment. Survival rates, though, are only part of the story. In England, fewer than 3 per cent of NHS patients die within 30 days of their hospital treatment (Dawson et al., 2005). Routine data on health outcomes are not collected for the majority of patients treated in most health care systems so there is no information with which to quality-adjust the vast majority of health care activities.

In the absence of data on health effects of treatment we have no alternative to using unit costs to weight outputs. Hence we suggest indices that make use of currently available data to quality adjust cost weighted outputs. Setting health status when dead to zero, the expected increase in discounted QALYs from treatment j at time t is:

[q.sub.jt] = (1 - [m.sub.jt])[q.sup.*.sub.jt] - [q.sup.o.sub.jt] (3)

where [m.sub.jt] is the probability of death within a short period of treatment for condition j, [q.sup.*] are discounted QALYs if treated and [q.sup.o] are discounted QALYs if patients were left untreated. Letting a denote the survival rate (= 1 - m), the ratio of expected discounted QALYs across time period t + 1 and t, on the assumption that [q.sup.*.sub.jt] - [q.sup.*.sub.jt+1] does not change over time is given by:

[q.sub.jt+1] / [q.sub.jt] = [a.sub.jt+1][q.sup.*.sub.jt] - [q.sup.o.sub.jt] / [a.sub.jt][q.sup.*.sub.jt] - [q.sup.o.sub.jt] = [a.sub.jt+1] - [k.sub.jt] / [a.sub.jt] - [k.sub.jt] (4)

where [k.sub.jt] = [q.sup.o.sub.jt] / [q.sup.*.sub.jt]. The only reason why [q.sub.jt] changes over time is that the post-operative survival rate changes. Clearly increases in [a.sub.jt+1], other things equal, lead to a higher quality adjustment factor.

Two variants of equation (4) were used to estimate a quality adjusted output index. The first assumed that the no treatment outcome would have been death so [q.sup.0] and hence k equals 0. In this case, assuming resources are optimally allocated so unit costs are a valid measure of the value of treatment, we can calculate the survival adjusted cost weighted output index:

[I.sub.1] = [[summation].sub.j][x.sub.jt+1]([a.sub.jt+1] / [a.sub.jt])[c.sub.jt] / [summation.sub.j] [x.sub.jt][c.sub.jt] (5)

where x are activities, c are unit costs and a are the survival rates from treatment. In practice there may be conditions whereby the no treatment outcome, [q.sup.0], would have been survival but at a reduced level of health. Suppose the ratio [k.sub.jt] = [q.sup.o.sub.jt] / [q.sup.*.sub.jt]) was constant through time. Then the quality adjusted cost weighted activity index is given by:

[I.sub.2] = [[summation].sub.j][x.sub.jt+1]([a.sub.jt+1] - [k.sub.j] / [a.sub.jt] - [k.sub.j])[c.sub.jt] / [summation.sub.j] [x.sub.jt][c.sub.jt] (6)

which we call the survival adjusted cost weighted output index with health effect.

Dawson et al. (2005) show that the index (5) will underestimate the more general index (6). Recall that these indexes are merely taking account of increases in survival. Suppose we compare two patients, one whose condition is such that he has zero chance of survival if not treated (k = 0) and another who would continue to live without treatment, although at a reduced health level. Suppose there is an increase in the mortality rate from treatment. The loss to the patient who would have died anyway is small relative to the loss experienced by the patient who chooses treatment rather than enjoying her remaining QALYs had she foregone treatment. Conversely, if a new treatment increases the survival rate for both, the risk from undertaking the operation is reduced more for the patient who had some chance of survival. Sample calculations of both the simple survival adjustment and that involving health effects are presented in section 4 below.

An important observation is that unless we are willing to assume that k = 0, we cannot divorce a survival adjustment from the health effect. In addition the assumptions underlying these calculations are unlikely to hold in practice, especially given that post-treatment outcomes do not change over time. Without data on impacts on health status before and after treatment, quality adjustments based on health outcomes are likely to be minimal. The assumption underlying equations (5) and (6) that resources are allocated efficiently is also unlikely to hold in most health systems. If unit costs are not proportional to health gains, then the index is likely to be biased in directions difficult to fathom. For example an increase in survival will have a smaller effect on the index the smaller is the cost weight [c.sub.jt]. As long as unit costs are proportional to health gains this is reasonable. The increase in the health effect from an increase in survival is proportional to [q.sup.*.sub.jt] and the smaller is [c.sub.jt] the smaller is [q.sub.jt] = [a.sub.jt][a.sup.*.sub.jt] - [q.sup.o.sub.jt] and the more likely is [q.sub.jt] to be small. But if this condition does not hold, the fact that survival gains in low cost activities will have smaller effects on the index than survival gains in high cost activities is less appealing if the health gains from those low cost activities are significant.

Instead, if patients' marginal valuations of the gains from treatment were available, one could use a value weighted activity index--see examples of the application of this alternative in Castelli et al. (2007).

The index in equation (6) assumes that there is no change in life expectancy over time. We can argue that, for some treatments, changes in life expectancy after treatment and without treatment are primarily due to factors outside the control of the health system and so should not affect the calculation of the growth rate of health output from one period to the next. However, the age structure of patients treated may change over time and, if younger patients are treated, they will have longer to enjoy the increased health status post treatment. Therefore we can modify the basic index to take account of the age of treated patients--the modification to the basic equation to account for this is given in the Appendix table.

In conclusion, on health effects, in future it may be possible to estimate [q.sup.*.sub.jt], [q.sup.o.sub.jt], using new data on longer-term survival and on health status from surveys of patients before and after treatment and from the results of evaluations of different types of treatment. Until then the quality adjustments for this characteristic are likely to be a serious underestimate of the true change in output of the health sector.

The second characteristic which the research project focused most attention on was waiting times. Waits for diagnostic tests and treatment may affect individuals in two ways. First, they may dislike waiting per se irrespective of the effect of treatment on the discounted sum of their quality adjusted life year ([q.sub.jt]). Thus waiting time is regarded as a separate characteristic of health care, distinct from its effect on health. Second, longer waits delay and hence reduce the health gain from treatment. Delay may be associated with deterioration in the patient's condition and the pain and distress while waiting for treatment results in a loss of quality adjusted life years for the patients affected. Here, the waiting adjustment is akin to a scaling factor multiplying the health effect.

Two main ways of modelling waiting as a scaling factor were developed. The first is to value treatment at the time the patient is placed on the waiting list; health effects are therefore discounted to this date. The second is to value treatment at the date it is actually received; health effects are discounted to the date of treatment. The first has the property that waiting has a cost, in units of health, which increases with the length of the wait but at a decreasing rate; an extra day after a long wait costs less than an extra delay after a short wait. The second adjustment is more consistent with the timing implicit in equations (5) and (6) above--we measure activity when it takes place, which suggests that we should measure the benefit from treatment at the time it takes place. In this case discounting acts like an interest charge--the longer the wait the greater the charge, so that reducing long waits has a greater impact than reductions in short ones.

Finally we also considered incorporating waiting times as a separate characteristic where results from contingency valuation studies were used as weighting factors. These studies ask patients how much money they would be prepared to forego for both an increase in health level and a reduction in waiting times. Equations incorporating waiting times adjustments are given in the Appendix table.

3. Data for the output index

Activity data

We calculate output growth in the NHS from 1998 to 2004 (Dawson et al., 2005). It has been possible to make the NHS output index more comprehensive over this period, due to better data collection. The number of different activities for which data were available expanded from 1,321 categories in 1998/9 to 2,060 by 2003/4. In table 1, these activities are grouped by sector under broad headings for the hospital and primary & community care sectors.

Given space constraints, this paper concentrates solely on hospital activity--readers are referred to Dawson et al. (2005) for details for other activities. The best data in the UK relate to hospital inpatient activity and detailed electronic information is available for every patient treated in the NHS hospitals (the Hospital Episode Statistics, HES). We define a unit of hospital output as 'continuous inpatient spells of NHS care' (CIPS). These are defined as continuous periods of patient care anywhere within the NHS. An NHS spell might comprise multiple episodes, including the transfer from one hospital to another as well as the move from one consultant to another within a given hospital (Lakhani et al., 2005). CIPS capture the 'completed treatment' to the maximum extent that is currently feasible in the NHS.

Activity is described using 565 Healthcare Resource Groups (HRG), which are the UK equivalent to Diagnostic Resource Groups (DRG). Activity in each HRG is reported separately for elective and non-elective (emergency) treatment. Figure 2 graphs the number of episodes for each year from 1998/9 to 2003/4. It shows little change in electives up to 2001/2 with some growth thereafter. Non-electives show more significant growth, especially in the final year.

[FIGURE 2 OMITTED]

Health outcomes

The health outcomes associated with each treatment are not observed directly, so have to be estimated by piecing together the probability of short-term survival from treatment, post-treatment health status and healthy life expectancy. Thirty day survival rates are derived from HES and show a continuous improvement from 1998/9 to 2003/4 (table 2).

Healthy life expectancy is derived from life tables and the 1996 Health Survey for England (Prescott-Clarke et al., 1998) and estimated according to the average age of patients in each HRG.

Waiting times

A survey of the literature on the cost of waiting (Hurst and Siciliani, 2003) found some evidence on deterioration of health and premature death associated with waiting for cardiology treatment but little for other procedures. The main variant of our suggested scaling waiting time adjustment has a charge for waiting which represents the welfare loss as a result of not having been treated immediately. This is an offset against the benefit of the treatment which applies for the residual life span. Interest is charged on the cost of waiting and the marginal disutility of waiting increases with the length of the wait. We refer to this as 'discounting to date of treatment with charge for waiting'.

We calculate indices using a measure of inpatient waiting time termed the 'certainty equivalent wait'. This reflects the possibility that the disutility of being on a waiting list depends not just on the average wait but also on the risk of a longer than average wait. We measure the certainty equivalent wait as the waiting time for patients at the 80th percentile of the waiting time distribution for each elective HRG. Reductions in these relatively high waiting times deliver benefits to all patients on the waiting list by reducing the uncertainty of a long wait.

Waiting times are measured as the time between being placed on the inpatient waiting list and being admitted. Year-on-year changes in waiting times were volatile over the period, whether measured as the mean days spent waiting or at the 80 per cent percentile of the distribution (table 2).

Cost weights

Since 1998, every NHS provider has been required to provide information on its cost and volume of activity by HRG to the Department of Health. This information is published annually as the National Schedule of Reference Costs. These costs are the basis for setting tariffs for hospital care under 'Payment by Results' (Department of Health, 2006a). The procedure for assigning costs to CIPS is detailed in Dawson et al. (2005). The coverage of the NSRC has increased over time, to include costs of outpatient care, critical care services, accident & emergency, pathology, radiology, chemotherapy, and rehabilitation (Department of Health, 2006b).

4. Results on output growth

We first present calculations of output growth to illustrate the sensitivity of estimates to various quality adjustments to take account of survival, for inpatient activity (table 3). The first quality adjustment includes the survival adjustment alone. The next two include the health effect; assuming a value of k = 0.8 across all treatments except where mortality (m) is high (column 3) and the previous calculation but including the life expectancy adjustment (column 4). The value of k = 0.8 is derived from the sample of procedures where data on before and after treatment outcomes are available (Castelli et al., 2007). Looking at the average across the five years of the study, including a simple survival adjustment increases growth in output by a small amount. Including a health effect increases the adjustment for the reasons set out above. Finally the inclusion of the life expectancy term reduces the quality adjustment reflecting the increasing average age of patients treated in the NHS during this time (Dawson et al., 2005).

We next consider the impact of allowing for changes in waiting times. The result was that none of the waiting time adjustments had any marked effect on growth rates, which is unsurprising given the relatively small changes observed in table 2. We report summary results in table 4 in the form of average annual growth rates over the period 1998/9 to 2003/4. All are based on the same survival adjustment with k = [q.sup.0]/[q.sup.*]=0.8 and the mortality cut off set to m = 0.10. Other uniform survival adjustments made little difference to the effects of the waiting time adjustments.

The first and third rows show that there is little difference in the effect of the forms of the waiting time adjustment. Comparison of the first and second row shows that the choice of waiting time measure has no impact. Since the waiting times and life expectancy factors are non-linear, and there is a variation in waiting times and in ages within an HRG in a given year, it is possible that our use of a single waiting time and life expectancy estimate for each HRG may lead to misleading results. To explore this, we used individual-level data to compute the equivalent of the waiting time adjustment with discounting to date of treatment with a charge for waiting. The results are in the fourth row and again differ little from the same index form using the 80th percentile wait in the first row.

The results show little or no impact from adjusting for changes in waiting times, regardless of the formulae or measures of waiting time employed. The small effects of waiting time adjustments are largely driven by the lack of change in waiting times rather than the methods used. To see this, suppose waiting times for the 80th percentile were reduced by 10 per cent for all HRGs comparing 2003/4 with 2002/3. Then the discount to date of treatment with charge for wait and low discount rates equal to 1.5 per cent would add 0.16 percentage points. With the same discount rates, reducing waits at the 80th percentile by 50 per cent would add 1.12 percentage points to the growth rate in that year. While further significant reductions in waiting time will increase the growth of NHS output, it is important to note that the index measures changes in both health gain through treatment and reduced waiting time. Treatment may generate improved quality of life for ten years while a reduction in waiting from six months to three months will add only a fraction to the overall gain in output.

In addition, the impact of changes in waiting times is dependent on the cost shares used to weight activities. If reductions in waiting time are mainly concentrated in low cost procedures, they will have a lower impact on the aggregate cost weighted output index than if reductions were more uniformly distributed. This appears to be true for the data employed in this study, as illustrated in figure 3 for the final growth period. In this year 63 per cent of the elective HRGs showed a decrease in waiting times, with a smaller 52 per cent showing reduction in 80th percentile waits. Very large reductions in either mean waits or 80th percentile waits appear to be concentrated in low cost procedures.

[FIGURE 3 OMITTED]

In summary, the results above suggest that the impact of incorporating survival adjustments into a cost weighted output index depends on assumptions about the health effect but, even so, the adjustments are quite small. These results are hardly surprising since the majority of health procedures have little impact on survival. The data necessary to incorporate changes over time in health status from treatment are not yet available. Waiting time reductions do not appear to have a significant impact in the time period considered, regardless of which model or measure of waiting times is used. This is due to the small reductions in waiting times over the period and to the limited impact of these reductions on health status.

5. The input index and productivity

To calculate total factor productivity we calculate an index of total input use over the period:

[Z.sub.t] = [summation over i][[bar.[omega]].sub.it][z.sub.it] (7)

where [z.sub.it] is the amount of the i'th input used in producing the activities in period t and [[bar.omega].sub.it] is the social cost per unit of the i'th input.

As in previous papers in this issue, we divide inputs into three broad categories: labour, capital and purchased inputs. These papers emphasised that changes in the quality of factor inputs are important in explaining productivity change. Labour is by far the most important input used in producing health services, accounting for about 75 per cent of total hospital expenditures. Current practice by DH and ONS is to calculate labour input by deflating payments to labour by a wage index. This is an indirect measure of the volume of labour input. It is more usual to estimate labour inputs based on direct measures, such as the number of persons engaged or hours worked. In particular, the OECD productivity manual recommends the use of annual actual hours worked, to allow for time paid including overtime, holidays and sick leave.

In addition, labour input should be adjusted to take account of variations in types of workers employed, in particular the changing use of skilled workers. The growth accounting method employed to adjust for the greater productivity of highly skilled workers mirrors that in previous papers in this issue. This divides labour hours by skill type and then weights the growth in hours of each type by their wage bill shares. This captures the fact that more highly skilled workers get paid more than the unskilled, and, under competitive market conditions, the wage paid reflects the marginal productivity of workers of different types.

Two principal sources of data about NHS labour are available--the NHS Workforce Census, an annual census carried out by the Department of Health, and the Labour Force Survey, a quarterly sample survey carried out by ONS. The two sources show similar but not identical trends through time (Dawson et al., 2005). Data from the NHS Workforce Census was used to obtain a headcount measure of labour input, while data from the LFS was used to incorporate hours worked, to incorporate quality adjustments, and to adjust for agency staff.

To construct a quality adjusted measure of labour input, data on the proportion of workers in each skill group and their wage rates from the LFS were combined with the NHS Census data, which acted as control totals. The data show evidence of upskilling across the NHS workforce, not only among doctors, nurses and other health professionals, but also at the lower end of the skill distribution. There has also been a marked decline in the share of the workforce with no skills.

However, changes in the skill use pattern will only impact if there are also significant differences in wage rates across skill groups. This appears to be the case in the NHS - on average those with higher degrees were earning about four times the average unskilled wages.

Additional refinements to the quality adjusted index were also considered. First, a further disaggregation of doctors was carried out using data from the NHS staff earnings survey--this resulted in growth in quality adjusted doctors about 30 per cent above numbers employed, suggesting the qualification data alone (which resulted in an increase of about 4 per cent) are not sufficient to capture all quality change. However, with doctors representing less than 1.5 per cent of the NHS Trusts wage bill, this leads to only a very small adjustment for the hospital sector.

Secondly, an adjustment for training received by an individual in addition to their certified qualifications was also made. In 2003/4, for example, LFS data showed that just over 50 per cent of staff in the NHS had received some form of job related training. Clearly, the impact of training on workers' productivity will vary considerably depending on the type of training in question. The results of adjusting for job related training were to raise the quality adjusted labour input growth rate on average from 1999/2000 by about 5 per cent, a small but not insignificant impact. Therefore our estimates of quality adjusted labour were scaled up to reflect the impact of on-the-job training.

Intermediate and capital inputs

Intermediate input expenditure data for the hospital sector comes from Trust Financial Returns (TFR) and is deflated by a modified version of the DH Health Services Cost Index (HSCI) to derive a volume measure. Intermediate input was defined as all current non pay expenditure items in the TFR, and hence excluded all purchases of capital equipment and capital maintenance expenditures as these items cannot be allocated to a particular year's output. The share of hospital drugs in intermediate expenditure has been rising rapidly, from about 24 per cent in 1998/9 to 34 per cent in 2003/4. The share of external purchase of health care from non-NHS bodies has also increased through time but remains small at about 6 per cent of total intermediate expenditure in 2003/4.

These numbers for intermediate input were deflated by an aggregate price index, derived as a chain linked index of corresponding HSCI items. This resulted in a very small upward adjustment in the intermediate input deflator compared with one using all items in the HSCI, as the prices of capital items have been growing more slowly than current items and in the case of computers have been falling. Capital input for the hospital sector is measured by depreciation reported in the Trust Financial Returns, deflated by the ONS capital consumption deflator, plus an allowance for depreciation of capital purchases in the current year, deflated by a chain linked deflator for capital items in the HSCI.

It should be noted that the prices and therefore also the quantities of drugs used are measured in a way which almost certainly does not take account of improvements in quality over time. Thus, compared with a quality adjusted index of inputs, we probably understate the growth in inputs used by the NHS and overstate the rise in productivity.

Table 5 shows average period input shares and average growth in the three inputs where labour input is the quality adjusted variant. Labour represents the highest share followed by intermediate inputs. The growth rates of each of the three input categories are very close in this time period.

Productivity

Combining the input shares with growth in real inputs allows the calculation of total input growth. Subtracting this from output growth yields total factor productivity growth rates. In order to calculate hospital level productivity growth rates, it is necessary to incorporate activities other than inpatients into the analysis, to maintain consistency between output and input measures. Thus we added outpatients, accident and emergency and other procedures carried out in hospital to our cost weighted output index; the only quality adjustment for these additional activities was the inclusion of waiting times for outpatients. In addition before-and-after measures of health status were available for a handful of HRGs (Castelli et al., 2007). In order to apply equation (6) to all inpatient activities we have to assume values for k. This is included mainly for illustrative purposes. For elective procedures we assume that the ratio of before-and-after health status was 0.8, as suggested by the mean for the elective procedures for which estimates were available. For non-elective HRGs we assume [h.sup.o.sub.j]/[h.sup.*.sub.j] = 0.4 on the grounds that non-elective patients may have worse health than elective patients if not treated.

The fact that we do not have comprehensive data on before-and-after treatment outcomes means that our uniform adjustment understates health outcomes for some HRGs and overstates health outcomes for others. Most especially, some patients are admitted to hospital when the expectation of death is very high. For HRGs where the mortality rate is high, we make no health effect adjustment and use only the change in survival. Finally we included adjustments for waiting times using 80th percentile waits and by assuming discounting to date of treatment with charge for wait, with discount rates on waits and health equal to 1.5 per cent.

Table 6 shows the TFP growth rates for both the unadjusted and quality adjusted variants of our output measures. In most years this is negative, with a slight tendency to improve over time. The finding that TFP growth is negative is not unusual in the private sector, as can he gleaned from the EU KLEMS database. Negative TFP growth is most likely to occur in service sectors where output is poorly measured and quality adjustment is minimal. When inputs are measured correctly, with adjustments for quality change, then the TFP residual is close to a measure of pure technical change so long as output is also measured correctly. But as emphasised in many parts of this paper, we are only capturing part of the improvement in quality of care via our proposed adjustments for survival, health effects and waiting times. Because of this incomplete adjustment for quality change we expect to underestimate TFP growth. There are also reasons why in the short term at least we might expect negative growth rates. The literature on the impact of information technology on productivity in the private sector points to an important role of organisational changes in facilitating benefits from new technology, with the suggestion that we could observe declining TFP in the short run due to disruption of production processes. There is no doubt that the NHS is undergoing significant organisational change.

Of more consequence for the health sector is the notion that there are diminishing returns as increased activity allows treatment of more complex and hence most costly cases. Some evidence in support of this is provided by the increased average age of patients treated in hospitals, from 48.6 years in 1999/00 to 50 years in 2003/4. In addition there has been some increase in the expenditure shares of HRG categories with the title 'complex elderly' from 3.4 per cent of expenditures to 4.2 per cent over the same period. Changes in the case mix are likely to be larger within than across HRGs but we lack the necessary data to examine this. Data that identified the characteristics of patients would also be useful in identifying the extent to which changes in health sector productivity are affected by diminishing returns.

6. ONS work

The Office for National Statistics has built on the work described here (UKCEMGA, 2006) and has added a number of extra dimensions to the approach. The study rightly explores a number of issues not covered in this paper, such as benefits of improved primary health care; progress obviously needs to be made with such issues. Nevertheless they draw public attention to some issues which we feel deserve comment.

The ONS study focuses on the benefits of statins in reducing coronary heart disease. This brings to the fore a question we allude to only in passing; the benefits from statins obviously lead to an increase in the gross output of the health service. But do they reflect an increase in total factor productivity? In other words, are they an input supplied by the pharmaceutical industry or are the benefits resulting from them entirely a result of the skills of health service staff and nothing to do with the pharmaceutical industry? However this issue is to be resolved, it seems to us unlikely that the answer is that adopted in the ONS study--that the benefits are entirely due to medical staff and nothing to do with the industry which supplies the drugs. Thus we fear that the ONS study overstates the productivity gains arising from this source. To address this issue the same sort of effort needs to be put into measuring the output of the pharmaceutical industry, which has been deployed in measuring production of computers.

Secondly the study includes an element reflecting changes to 'patient experience' based on data collected from patients about how happy they have been with a range of aspects of their treatment, such as politeness of doctors and quality of hospital food. Quantifying and aggregating such data in order to produce a single index raises all sorts of problems. But even if these have been resolved there is a more important one which the ONS study does not address. How much does patient experience matter, relative to what we assume to be the core function of the health service, treating patients successfully? The ONS study assumes that a 1 per cent improvement in patient experience is of equal importance in an overall output index to a 1 per cent point increase in survival rates. It seems to us very unlikely that most patients, when presented with this sort of equivalence, would say that it reflected their views. A weighting of even one to ten comparing 'patient experience' to survival rates might seem to be on the high side for the former. In any case some thought needs to be given to the issue before it is possible to produce a satisfactory composite index.

Finally, the ONS study presents figures taking account of the Atkinson adjustment to allow for the fact that the value of health increases with time and probably broadly in line with real earnings. While we do not have any difficulty with the proposition that the value of health increases with time, we have yet to be convinced of the case for the Atkinson adjustment, noting that it is difficult to relate to an analysis of the link between national accounts, the concept of income and economic welfare (Sefton and Weale, 2006).

7. Conclusion

We have demonstrated how quality change can be accounted for in calculating indices of output and input for the health sector and explored the influence of different assumptions about the form of quality adjustment. These indices have been estimated for the hospital sector of the English NHS using the best available data. As more data become available it will be possible to obtain a clearer impression of the progress that the health sector makes in advancing social welfare.

Dawson et al. (2005) recommend routine collection of data on outcomes before and after treatment. They suggest that the following points need to be addressed in such a data collection exercise. First, since the scale of NHS activity is so broad and the potential volume of patients is so large, sampling seems a more sensible strategy than attempting to measure health effects for all NHS patients in a sector. Second, the timing of before and after health status measurement may depend on the type of activity; clinical advice could be used to determine the appropriate period for follow-up surveys. Third, given the pace of technological change in medicine, a continuous sampling of the NHS patient population will be required to capture trends in the impact of NHS services on patients.

Finally, the research team considered the feasibility of a programme of routine collection of outcomes data. They suggest this could start with a few high volume elective and medical conditions that would permit sampling rather than complete coverage. The data would also be immensely useful for other purposes, including monitoring of Trust performance and improved cost-effectiveness analysis of particular treatments.

ACKNOWLEDGEMENTS

This research was funded by the Department of Health, to whom we owe thanks. The views expressed here are not necessarily those of the Department of Health. This report has benefited greatly from discussion with numerous experts, including the members of the Steering Group Committee, Jack Triplett (consultant to the project team), Sir Tony Atkinson, Barbara Fraumeni, Andrew Jackson, Azim Lakhani, Phillip Lee, Alan Maynard, Alistair McGuire and Alan Williams. A number of individuals and organisations contributed to assemble the data used in this report. We are grateful to Kate Byram, Mike Fleming, Geoff Hardman, James Hemingway, Sue Hennessy, Sue Macran, Paula Monteith, Casey Quinn, Sarah Scobie, Bryn Shorney, Craig Spence, Karen Wagner, Chris Watson, BUPA, the Cardiff Research Consortium, Health Outcomes Group and York Hospitals NHS Trust. Any errors and omissions remain the sole responsibility of the authors.

REFERENCES

Atkinson, T. (2005), 'Measurement of government output and productivity for the national accounts', Atkinson Review: Final Report, HMSO, 31 January.

Castelli, A., Dawson, D., Gravelle, H., and Street, A. (2007), 'Improving the measurement of health system output growth', Health Economics (forthcoming).

Dawson, D., Gravelle, H., O'Mahony, M., Street, A. and Weale, M., Castelli, A., Jacobs, R., Kind, P., Loveridge, P., Martin, S., Stevens, P. and Stokes, L. (2005), 'Developing new approaches to measuring NHS outputs and productivity: final report', York: Centre for Health Economics, Research Paper 6: 2005.

Deaton, A. and Muellbauer, J. (1980), Economics and Consumer Behaviour, Cambridge, Cambridge University Press.

Department of Health (2006a), Implementing Payment by Results: Technical Guidance 2006/07: Executive Summary, London, Department of Health.

--(2006b), Reference Costs 2006 Collection: Costing and Activity Guidance and Requirements, London, Department of Health.

Eurostat (2001), Handbook on Price and Volume Measures in National Accounts, Luxembourg, Office for Official Publications of the European Communities.

Hurst, J. and Siciliani, L. (2003), 'Tackling excessive waiting times for elective surgery: a comparison of policies in twelve OECD countries', OECD Health Working Paper 6, Paris, OECD, available at www.oecd.org/health.

Lakhani, A., Coles, J., Eayres, D., Spence, C. and Rachet, B. (2005), 'Creative use of existing clinical and health outcomes data to assess NHS performance in England: Part I--performance indicators closely linked to clinical care', British Medical Journal, 330, pp.1426-31.

Lancaster, K. (1971), Consumer Demand: A New Approach, New York, Columbia University Press.

O'Mahony, M. and Stevens, P. (2006), 'International comparisons of output and productivity in public services provision: a review', in Boyne, G.A., Meier, K.A., O'Toole, L.J. and Walker, R.M. (eds), Public Service Performance: Perspectives on Measurement and Management, Cambridge, Cambridge University Press.

Prescott-Clarke, P. and Primatesta, P. (1998), Health Survey for England 1996, London, The Stationery Office.

Sefton, J. and Weale, M.R. (2006), 'The concept of income in a general equilibrium', Review of Economic Studies, 73, pp. 219-49.

Triplett, J. E. (2001), 'What's different about health? Human repair and car repair in national accounts and national health accounts', in Cutler, D. and Berndt, E. (eds), Medical Care Output and Productivity, Chicago, Chicago University Press.

UKCEMGA (2006), 'Public Service Productivity: Health', http:// www.statistics.gov.uk/cci/article.asp?ID=1429.

NOTE

(1) The use of quantity indicators is seen as feasible for services provided to individuals, such as education, health and social services, but less likely to be implementable for collective services such as defence--inputs are likely to remain used for measuring output in these services.
Table 1. Components of the NHS output index

 Activity categories Cost share
 2003/4 2001/2

Hospitai sector
Inpatients 1136 33.48
Outpatients 299 10.99
Accident & Emergency 9 2.17
Critical care services 12 3.97

Primary & Community care sector
GP & practice nurse consultations 5 12.44
Dentists 1 4.69
Prescriptions 184 16.48
Mental Health 19 9.56
Other * 99 6.20
Total 2060 100.00

Note: * includes audiological services, renal dialysis, pathology,
radiotherapy, chemotherapy, spinal injuries, rehabilitation, ambulance
journeys, ophthalmic services etc.

Table 2. Change in mortality rates and waiting times

 Waiting time (days)
 30 day
 survival rate (%) mean 80th percentile

1998/99 96.92 88.7 132.2
1999/00 96.94 80.8 117.7
2000/01 97.07 82.3 119.0
2001/02 97.01 85.2 124.4
2002/03 97.14 88.5 128.9
2003/04 97.24 85.9 126.8

Table 3

Laspeyres CWOI index, CIPS, adjusted for
survival, 30 day mortality rates. Average annual growth
rates 1998/9 to 2003/4 (% p.a.)

 Health Health effect
 effect and life
 expectancy

 Unadjusted Simple [q.sup.0]/[q.sup.*]=0.8
 survival if m<0.10,
 [q.sup.0]/q =0 otherwise

1998/9-1999/00 1.87 1.27 0.78 1.12
1999/00-2000/1 0.91 1.16 1.58 1.37
2000/1-2001/2 0.95 0.89 0.91 0.76
2001/2-2002/3 4.44 5.37 6.59 6.31
2002/3-2003/4 5.81 6.37 7.15 7.13

Average all years 2.78 2.99 3.36 3.30

Table 4. Laspeyres CWOI index, CIPS, adjustments for
changes in waiting times. Average annual growth rates
1998/9 to 2003/4 (% p.a.)

 [r.sub.W] =
Form of waiting Measure of [r.sub.L] = [r.sub.W] = 10%,
time adjustment waiting time 1.5% [r.sub.L] = 1.5%

Discount to date
 treated, with
 charge for waiting 80th percentile 3.33 3.34
Discount to date
 treated, with
 charge for waiting mean wait 3.32 3.32
Discount to date
 on list 80th percentile 3.24
Discount to date
 treated, with
 charge for waiting individual data 3.48 3.49
Survival adjustment
 only 3.30

Note: [r.sub.W] = discount rate on waits, [r.sub.L] = discount rate on
life expectancy.

Table 5. Average period input shares and average growth
in inputs, 1998/9-2003/4, hospital sector.

Total NHS Shares Input growth

Labour 0.72 4.34
Intermediate 0.20 4.93
Capital 0.08 4.76

Table 6. Total factor productivity growth, hospitals, per
cent per annum

 Unadjusted Quality adjusted

1998/9-1999/00 -2.82 -3.53
1999/00-2000/1 0.30 0.56
2000/1-2001/2 0.17 0.01
2001/2-2002/3 -2.01 -0.95
2002/3-2003/4 -1.13 -0.39

Average -1.11 -0.87
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有