Professional Documents
Culture Documents
Paper 124-27
Communicating the Results of Predictive Models to
Non-Technical Audiences
Andrew H. Karp
Sierra Information Services, Inc.
Sonoma, California USA
Abstract
Many statisticians, business analysts and
other quantitative analysts are required to limited, which in turn limits their ability to
present the results of their modeling efforts to understand what the data analyst is
non-quantitatively trained decision-makers, or presenting them.
to decision-makers whose level of statistical
training may be very limited. In the predictive Data analysts therefore often have trouble
modeling realm, where very sophisticated communicating with the end-users of their
statistical tools such as logistic regression, craft. The Physician-in-Chief or Vice
neural networks and Chi-Square Automatic President of Marketing probably knows very
Interaction Detection (CHAID) are commonly little about maximum likelihood estimation,
employed, the knowledge gap between the and probably prefers to not have it explained
provider and consumer of the analysis can to them. Nor are they going to "buy in" to a
become quite wide. When this occurs, the model because of the obtained value of the
fruits of the analysis may go un- or under- Akaike Information Criterion (AIC) or that the
utilized. -2LOGL measure reported by, for example,
PROC LOGISTIC, is "statistically significant."
This paper discusses several ways that the
results of predictive models can be effectively In many situations the decision-maker to
communicated to non-technical decision- whom the results of the model will be
makers. No matter how elegant your model explained may have extensive financial or
is, or how well it satisfies various statistical other resources "at risk" based on their
criteria, its usefulness within the organization decision to deploy a model. While they may
is hampered unless those who are tasked intuitively understand the value of a predictive
with implementing it can understand what the model, they are reluctant to commit scarce
model will do for them, and what the financial or other organizational resources to
anticipated benefits are of deploying the a project just because a model is "statistically
model. significant." Here are two scenarios:
1
SUGI 27 Data Presentation
treatment costs) among its members while a customer retention campaign just because
not vaccinating those for whom the risk is the LeGrange Multiplier (or SCORE) measure
low. has a low p-value. Most likely not.
2
SUGI 27 Data Presentation
diagnosis," or that "customers who have a example, can correctly "rule in" all patients
safe deposit box are twice as likely to who have the clinical condition of interest,
respond to the insurance offer than those and correctly "rules out" half of the patients
who don't. who really don't have the condition.
When a continuous independent variable is Context drives which of the two measures is
used, the odds ratios might be very small, most important in an applied modeling
depending on how your data are prepared. context. It's almost impossible to come up
For example, a one pound change in an adult with a model that has 100% sensitivity and
patient's weight might be "statistically 100% specificity. In many marketing
significant," but clinically irrelevant to health situations the analysts and project managers
care providers. But a 10 or 20 pound change want models that find (that is correctly
might be. Tools such as the UNITS option in identify) as many responders to a product
PROC LOGISTIC can create customized offer as possible. For them, high sensitivity is
odds ratios at user-specified values of the often more important than high specificity. In
independent variable. Creating customized many medical situations, however, a model
odds ratios is easy to do and can improve the which correctly rules out a diagnosis (i.e.,
"odds" your clients will be able to grasp how specificity) may be more important. In this
the model is working. context, a model which can correctly avoid
expensive (and perhaps painful),
Sensitivity/Specificity unnecessary medical procedures
The model's results can also be articulated
using the concepts of sensitivity and Financial Considerations
specificity. These measures are available One of the best ways to articulate the results
when the CTABLE (classification table) option of your model is to portray it in terms of its
is specified in PROC LOGISTIC, or can be likely financial impacts. If you know, for
computed by hand from the output generated example, the cost per customer contact and
by other SAS procedures, such as PROC the likely revenue/profit to be obtained from
FREQ. each sale, you can easily calculate the
potential revenue/profit expected from
Sensitivity measures the ability of a model to implementing your model. These amounts
correctly predict, or "rule in" the event of can be calculated within the Enterprise Miner,
interest among those observations under or by hand if other SAS System analytic tools
analysis in which the event occurred. So, a are utilized in the modeling effort.
model sensitivity of 90% means that if 100
observations in your data set HAD the event Gains/Lift Charts
of interest, the model correctly predicted that These charts are available from the
90 of them would have the event. Enterprise Miner or can be created using a
combination of Data Step programming and a
Specificity, on the other hand, measures the BASE SAS Procedure called PROC RANK.
ability of a model to "rule out" the event in A gains chart tells you how well model
those observations under analysis in which "captures" observations with the event of
the event DID NOT occur. If the model interest, and can quickly communicate how
specificity is 90%, that means that out of 100 well the model can be expected to perform.
observations that did NOT have the event of
interest, the model correctly predicted that 90 The core of a gains chart is the allocation (or
of them would not have the event. assignment, or "binning") of each observation
under analysis in to 10 (sometimes 20)
These terms are also frequently used to groups, with the observations in the first
articulate the efficacy of diagnostic medical group having the highest probability of event
tests and procedures. A diagnostic test that outcome, etc.
has 100% sensitivity and 50% specificity, for
3
SUGI 27 Data Presentation
References