 |
 |

Analysis of the Performance Characteristics of the University of Washington Quality of Life Instrument and Its Modification (UW-QOL-R)
Ernest A. Weymuller, Jr, MD;
Ramsey Alsarraf, MD, MPH;
Bevan Yueh, MD, MPH;
Frederic W.-B. Deleyiannis, MD, MPhil, MPH;
Marc D. Coltrera, MD
Arch Otolaryngol Head Neck Surg. 2001;127:489-493.
ABSTRACT
 |  |
Background During a 5-year period, we analyzed 3 patient subsets from the University
of Washington Quality of Life (UW-QOL) Registry and published the results.
In each instance, editorial review has raised legitimate concerns regarding
the UW-QOL instrument that deserve public comment. We present our response
to these criticisms. Since our original publication (1993), we have added
domains to the original UW-QOL instrument. These additions reflected our concern
that we might be missing important elements in the spectrum of disease-specific
response to treatment. Using the data we have accumulated in the last 5 years,
we present an analysis of the internal consistency of the UW-QOL. We have
identified those domains that are responsive (or not responsive) to treatment
effect and have revised the UW-QOL accordingly to create the UW-QOL-R, which
is recommended for future use.
Design The project began January 1, 1993, after approval by the UW Human Subjects
Committee. Critical comments offered by external review were collated and
responded to. Internal consistency was evaluated by interitem correlation
matrix (Cronbach ) testing.
Subjects All new patients presenting to the UW Medical Center (Seattle) with
a diagnosis of head and neck cancer were asked to participate in a prospective
analysis of QOL changes during and after treatment.
Intervention Patients completed the pretreatment QOL questionnaire on the day of
their initial workup. The format for the pretreatment test was an interviewer-supervised
self-administered test; the subsequent tests were self-administered and were
completed at 3, 6, 12, 24, and 36 months. Other data entered for each patient
included site, stage, treatment, histologic classification, reconstruction,
and current status. A QOL registrar was responsible for patient follow-up,
data collection, and collation. All data were entered into the departmental
relational database.
Results Criticisms by external review included the following: "it is improper
to call it [UW-QOL] a measure of quality of life"; "the summary scale is problematic
because it implies that each of the subscales are weighted or valued'
equally"; "some domain questions relate to surgery specific issues . . . while
others are specific to radiation"; "we were confused by the scoring"; and
"the UW-QOL index does not specifically address the psychological impact of
the disease and its treatment." After evaluation of internal consistency,
the UW-QOL was modified by removing 2 domains that correlated poorly with
the others. This resulted in a 10-item instrument (UW-QOL-R) with an overall
internal consistency score of 0.85.
Conclusions The UW-QOL can be effectively and accurately used to compare treatment
effects in the management of head and neck cancer. With this revised instrument,
the 10 items appear to measure the domains of overall QOL in a highly consistent
and reliable fashion over time.
INTRODUCTION
HAVING prospectively administered the University of Washington Quality
of Life instrument (UW-QOL) to more than 500 patients, we took the opportunity
to reevaluate the instrument. During a 5-year period (1993-1998), we analyzed
3 patient subsets and published the results.1-3
In each instance, editorial review has raised legitimate concerns regarding
the instrument that deserve public comment. In this article, we present our
response to these criticisms.
At our institution, we have added 3 domains since the original publication.4 Because our own analysis of the responses in each
domain suggested a domain "cancellation effect," we further assessed the internal
consistency of the UW-QOL and modified the instrument to form the revised
UW-QOL (UW-QOL-R), which we recommend for future use.
PATIENTS, MATERIALS, AND METHODS
PATIENTS
The project began January 1, 1993, after approval by the UW Human Subjects
Committee. All new patients presenting to the UW Medical Center (Seattle)
with a diagnosis of head and neck cancer were asked to participate in a prospective
analysis of QOL changes during and after treatment. Patients completed the
pretreatment QOL information on the day of their initial workup. The format
for the pretreatment test was an interviewer-supervised self-administered
test; the subsequent tests were self-administered and were completed at 3,
6, 12, 24, and 36 months. Other data entered for each patient included site,
stage, treatment, histologic classification, reconstruction, and current status.
A QOL registrar was responsible for patient follow-up, data collection, and
collation. All data were entered into the departmental relational database.
DEFINITION OF TERMS
Composite UW-QOL score is the arithmetic mean
of the 10 individual domain scores (maximum score, 100). Domain score was determined by offering participating patients a set
of options (Likert scale) for each domain. The maximum (best) score is 100,
the minimum is 0. As an example, the domain pain offers the following options:
100, I have no pain; 75, there is mild pain not requiring medication; 50,
I have moderate pain that requires regular medication (codeine or nonnarcotic);
25, I have severe pain controlled only by narcotics; and 0, I have severe
pain not controlled by antibiotics. Beginning in 1997, global score was determined by asking all patients to "consider everything
that contributes to your personal well-beinghow would you rate your
overall quality of life during the past seven days." The possible responses
were excellent, very good, good, fair, poor, or very poor.
ANALYSIS OF INTERNAL CONSISTENCY
Analysis of internal consistency was performed by calculating the interitem
correlation coefficients (Pearson r) for each of
the time points of the 36-month period and the overall collection of data
points. Cronbach coefficients were then calculated using the SPSS
computer software program (SPSS Inc, Chicago, Ill). Least correlated items
were then eliminated from the revised instrument, and these coefficients
were then recalculated for the UW-QOL-R.
RESULTS
In the process of this analysis, 2 of the 12 domains of the UW-QOL were
found to correlate poorly with the other 10 domains. These domains were "dryness"
and "employment." Both of these items had wide variations compared with the
other measured QOL items and were found to show a bimodal rather than more
normal distribution. These items were removed from our original instrument
to form the UW-QOL-R (Figure 1).
Internal consistency improved across each of the follow-up periods, with a
range of 0.78 to 0.87, and an overall internal consistency score for the UW-QOL-R
of 0.85 (Table 1). With this revised
instrument, there is no "cancellation effect," since each of the 10 items
appears to measure the domains of overall QOL in a highly consistent and reliable
fashion over time (Figure 2).
|
|
|
|
Figure 1. Revised University of Washington
Quality of Life instrument and global quality of life questions.
|
|
|
|
|
|
|
University of Washington Quality of Life (UW-QOL) Instrument Internal
Consistency*
|
|
|
|
|
|
|
Figure 2. Internal consistency of the revised
University of Washington Quality of Life instrument (n = 1831). Each line
represents 1 of the 10 quality of life variables included in the revised instrument,
demonstrating graphically a consistent response to quality of life changes
over time. In descending order (along the y-axis) these variables are shoulder
function, speech, swallowing, saliva, chewing, taste, appearance, recreation,
activity, and pain.
|
|
|
COMMENT
RESPONSES TO EDITORIAL CRITICISM
Criticism: "It is improper to call it [UW-QOL]
a measure of quality of life despite its title . . . the authors should consider
the use of the term disease-specific functional status instead of quality
of life."
Response: In his text, Guide
to Clinical Trials, Spilker comments that
there is no ideal test at present to evaluate quality of life.
A test that could become a gold standard would be rapid to complete; be reproducible;
be valid either in a single patient population or across a large number of
diseases; be widely accepted; not require excessive training of staff to administer;
be easy to interpret; [and] yield objective results. . . . The view of some
researchers is that finding a test that meets all of these criteria is as
difficult as finding the Holy Grail. It is an even greater challenge to devise
a test that would be applicable to patients in different national cultures.5(p377)
In his text, Quality of Life Assessments in Clinical
Trials, Spilker says,
The conceptual formulation which has emerged, and which is gaining
acceptance, defines quality of life functionally by patients' perception of
performance in four areas: physical and occupational function, psychological
state, social interaction, and somatic sensation. In this model the patient
serves as his own control, the comparisons being made against expectation
of function.6(p11)
Spilker also notes that
The simplest classification of quality of life tests divides
them into indexes, profiles, and batteries. . . . An index is a test that
yields a single number at its conclusion. It usually evaluates multiple domains
and often tests multiple components of each domain. The test may include measures
of the quantity as well as the quality of life.5(p371)
This description most closely fits the UW-QOL instrument.
Functional disability scales (which are discussed as a subset of QOL
assessment by Spilker) are more strictly used for
the periodic assessment of physical disabilities . . . increasingly,
clinicians and researchers need reliable and validated measures of functional
disability to measure clinical progress, evaluate programs and establish appropriate
eligibility for social and insurance programs.6(p115)
The UW-QOL instrument does not fit these criteria.
The inclusion of a global question in the process of QOL analysis is
an important adjunct. According to Spilker, "Some authors and tests focus
on objective criteria to define and measure quality of life, whereas others
stress the measurement of subjective aspects of this concept. Using both approaches
is best."5(p729) Spilker also says that "Global
quality of life questions do not need to be, and in fact cannot be, validated."5(p374) We believe that the inclusion of a global QOL
question in the UW-QOL meets the criteria identified by Spilker, who confirms
the appropriateness of using a Likert scale to address the issue of global
QOL.
Criticism: "The summary scale in a QOL instrument
is problematic because it implies that each of the subscales are weighted
or valued' equally, which is probably not a good assumption in a multi-scale
instrument."
Response: We believe that the opinion of Spilker
once again holds:
Another limitation of health status questionnaires is the unresolvable
issue of whether each item should be equally weighted. From a clinical perspective,
not all activities of daily living are equal for a patient, and the technology
of deriving weights leaves the clinician dissatisfied."6(p451)
Spilker also asks,
How does one weight the individual domain scores so as to arrive
at a reasonable overall quality of life score? At the present time no studies
resolve the issue. It may be that the relative weightings of quality of life
domain scores are themselves variable over time, and hence not amenable to
fixed weighting . . . many researchers are now more confident in the use of
quality of life subscores as probes (not as diagnostics), and suggest that
an assessment of quality of life may include both an overall score as defined
precisely for the instruments being used and component subscores. From an
analytic point of view, this makes it possible to begin to dissect out component
factors of quality of life and the variable impact treatment may have on each.6(p20)
Our conclusion is that importance weighting may add a level of complexity
that is generally not worth the trouble, since we do not believe that the
process of weighting is well defined or appropriate in this setting. When
specific research inquiry warrants importance weighting, it is reasonable
to ask the patient how he or she would weight the importance of a particular
domain. This technique was used by Deleyiannis et al2
in the analysis of postlaryngectomy QOL.
Criticism: "Some of the domain questions relate
to surgery-specific issues (shoulder function), while others are specific
to radiation effects (saliva, dryness)."
Response: It is our conclusion that since these
issues are germane to the effect of various forms of treatment for head and
neck cancer, and since we now have demonstrated the internal consistency of
the UW-QOL, it can be effectively and accurately used to compare treatment
effects in the management of head and neck cancer. It is important that future
studies that use the UW-QOL consider separate analysis of each domain to appreciate
the differential effects of the treatments under analysis.
Criticism: "We were confused by the fact that
it appears the lowest score for each item is either 20 or 25, and yet the
range is reported as being 0-100."
Response: As a result of the preceding analysis,
the UW-QOL-R will contain 10 domain questions. The maximum total QOL score
will be 1000. Each domain has a range of 0 to 100.
Criticism: "The UW-QOL index does not specifically
address the psychological impact of the disease and its treatment."
Response: As noted by D'Antonio et al,7 there is an inverse relationship between measured
QOL using disease-specific instruments and depression. Although we have considered
including items about the emotional impact of cancer, we believe the brevity
of the UW-QOL is one of its distinct advantages. We recognize that for some
studies the assessment of depression will be appropriate. In these studies,
the need to measure psychological impact may warrant the use of additional,
disease-specific scales on depression or anxiety. A number of such instruments
have been developed, including the Center for Epidemiological Studies Depression
Scale,8 the Beck Depression Inventory,9 and selected portions of the Patient Health Questionnaire.10
INTERNAL CONSISTENCY
We performed this analysis because, over time, we had added domains
to the original UW-QOL. These additions reflected our concern that we might
be missing important elements in the spectrum of disease-specific response
to treatment. In assessing internal consistency, we have identified those
domains that are responsive (or not responsive) to treatment effect.
Internal consistency refers to the reliability of each item or domain
in a given instrument to provide a QOL measurement in a fashion that is similar
to each other item or domain of that same instrument.11
In the process of this analysis, 2 of the 12 domains of this instrument were
found to correlate poorly with the other 10 domains. These domains were "dryness"
and "employment." Both of these items had wide variations compared with the
other measured QOL items and were found to show a bimodal rather than more
normal distribution. We believe that it is likely that scores for both of
these variables are influenced by factors external to the manner in which
treatment affects the other QOL domains (ie, whether or not an individual
underwent radiation therapy and whether or not an individual was employed
to begin with) and thus do not tap the changes in QOL over time as accurately
as the other domains. Thus, these items were removed from our revised instrument.
With this revised instrument, there is no "cancellation effect," because each
of the 10 items appears to measure the domains of overall QOL in a highly
consistent and reliable fashion over time (Figure 2).
The Cronbach scores of internal consistency are quite high across
the entire 36-month follow-up period for the original UW-QOL. For example,
the range of these internal consistency values was 0.74 to 0.84, and the overall
internal consistency score for the original UW-QOL was 0.81 (Table 1).
CONSIDERATION OF QOL STUDY COSTS
It is also important to consider the cost of QOL studies before embarking
on a longitudinal project. In discussing clinical trials to evaluate QOL,
Spilker indicates that
implicit in this strategy is the large volume of data that flows
from a quality of life study. Several variables are measured at each encounter
and patients are followed for a considerable time. In addition to the usual
clinical information, several items of quality of life data will be collected.
This has clear workload implication.6(p20)
We have found this statement to be painfully accurate. To pursue QOL
data for longitudinal analysis, one must be committed to thorough data collection
and account for the attendant costs. Collection of our data for 4 years generated
costs in excess of $250 000 in personnel salary alone.
CONCLUSIONS
After administering this instrument to more than 500 patients, we can
indicate that the UW-QOL meets the following desirable characteristics articulated
by Spilker: (1) it is short and rapid to complete; (2) it is reproducible,
reliable, and valid in a population of head and neck cancer patients; (3)
it does not require excessive training to administer; and (4) it is easy to
interpret and yields objective results (separation by site and stage).12
We conclude that inclusion of a global measure of posttreatment QOL
is a critical part of QOL assessment. Therefore, we recommend the UW-QOL-R
for future use.
AUTHOR INFORMATION
Accepted for publication September 22, 2000.
Dr Yueh is supported by a Career Development Award from the Health Services
Research and Development Service of the Veterans Administration. Dr Yueh is
also part of the Health Services Research and Development Service and Surgery
Service, VA Puget Sound Health Care System.
Corresponding author: Ernest A. Weymuller, Jr, MD, University of
Washington, Department of OtolaryngologyHead and Neck Surgery, Box
356515, Seattle, WA 98195 (e-mail: eaw{at}u.washington.edu).
From the Department of OtolaryngologyHead and Neck Surgery,
University of Washington, Seattle.
REFERENCES
 |  |
1. Deleyiannis FW-B, Weymuller EA Jr, Coltrera MD. Quality of life of disease-free survivors of advanced (stage III or
IV) oropharyngeal cancer. Head Neck. 1997;19:466-473.
FULL TEXT
|
ISI
| PUBMED
2. Deleyiannis FW-B, Weymuller EA Jr, Coltrera MD, Futran NF. Quality of life after laryngectomy: are functional disabilities important? Head Neck. 1999;21:319-324.
FULL TEXT
|
ISI
| PUBMED
3. Kuntz AL, Weymuller EA Jr. The impact of neck dissection on quality of life. Laryngoscope. 1999;109:1334-1338.
FULL TEXT
|
ISI
| PUBMED
4. Hassan SJ, Weymuller EA Jr. Assessment of quality of life in head and neck cancer patients. Head Neck. 1993;15:485-496.
ISI
| PUBMED
5. Spilker B. Guide to Clinical Trials. New York, NY: Raven Press; 1991.
6. Spilker B. Quality of Life Assessments in Clinical Trials. New York, NY: Raven Press; 1990.
7. D'Antonio LL, Long SA, Zimmerman GJ, Peterman AH, Petti GH, Chonkich GD. Relationship between quality of life and depression in patients with
head and neck cancer. Laryngoscope. 1998;108:806-811.
FULL TEXT
|
ISI
| PUBMED
8. Radoff LS. The CES-D Scale: a self report depression scale for research in a general
population. Appl Psychol Meas. 1977;1:385-401.
FULL TEXT
9. Beck AT, Steer RA. Internal consistencies of the original and revised Beck Depression
Inventory. J Clin Psychol. 1984;40:1365-1367.
ISI
| PUBMED
10. Spitzer RL, Kroenke K, Williams JB. Validation and utility of a self-report version of PRIME-MD: the PHQ
Primary Care Study. JAMA. 1999;282:1737-1744.
FREE FULL TEXT
11. Patrick DL, Erickson P. Health Status and Health Policy: Quality of Life
in Health Care Evaluation and Resource Allocation. New York, NY: Oxford University Press; 1993.
12. Weymuller EA Jr, Yueh B, Deleyiannis FW-B, Kuntz AL, Alsarraf R, Coltrera MD. Quality of life in patients with head and neck cancer: lessons learned
from 549 prospectively evaluated patients. Arch Otolaryngol Head Neck Surg. 2000;126:329-335.
FREE FULL TEXT
RELATED ARTICLE
Archives of OtolaryngologyHead & Neck Surgery Reader's Choice: Continuing Medical Education
Arch Otolaryngol Head Neck Surg. 2001;127(5):606-608.
FULL TEXT
THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES
 |
Relational Databases for Rare Disease Study: Application to Vascular Anomalies
Perkins and Coltrera
Arch Otolaryngol Head Neck Surg 2008;134:62-66.
ABSTRACT
| FULL TEXT
Long-term Quality-of-Life Evaluation After Head and Neck Cancer Treatment in a Developing Country
Vartanian et al.
Arch Otolaryngol Head Neck Surg 2004;130:1209-1213.
ABSTRACT
| FULL TEXT
Clinical Predictors of Quality of Life in Patients With Head and Neck Cancer
Terrell et al.
Arch Otolaryngol Head Neck Surg 2004;130:401-408.
ABSTRACT
| FULL TEXT
Concurrent Chemotherapy and Radiotherapy for Organ Preservation in Advanced Laryngeal Cancer
Forastiere et al.
NEJM 2003;349:2091-2098.
ABSTRACT
| FULL TEXT
Functional Outcomes After Treatment of Squamous Cell Carcinoma of the Base of the Tongue
Perlmutter et al.
Arch Otolaryngol Head Neck Surg 2002;128:887-891.
ABSTRACT
| FULL TEXT
|