Volume 2, Issue 1
October 2005    
© Copyright 2004 GUAA. All rights reserved.
Site designed by porterdesign.com

In today’s world of soaring medical costs, margins for HMOs and insurers are being squeezed more tightly than ever. The present point in the underwriting cycle requires underwriters to focus on increasing membership and margins. Passing along “standard” price increases to “low risk” accounts will not be sustainable in the coming years. Thus, actuaries and underwriters are looking for tools that can improve pricing accuracy and lead to higher earnings. Predictive modeling, a tool many companies are familiar with from its uses in disease management, intuitively seems promising.

Risk selection represents another area where the application of predictive models seems promising. Carriers have always struggled to understand how the morbidity of their enrollment from a “slice” case compares to that of the other plan offerings and how membership within a given market compares to the population in total. The introduction of consumer driven plans and the possibility that these plans attract only the healthiest employees makes this an even more pressing issue today.

Traditional underwriting uses a risk model based on age, sex and prior cost, with some additional features on occasion, such as geographic region and industry. Predictive modeling comes from health services research and adds the additional predictive factor of diagnoses. Predictive models use information included in medical and pharmacy claims (diagnoses, procedures, drug type/dosage, etc.) to estimate future costs at the individual member level. While the terminology and approach of predictive modeling may be different from pricing models underwriters have traditionally used, underwriters should not dismiss adapting predictive models for use in pricing.

Generally speaking, groups under 250 lives are too small to have a credible experience base under traditional models. Past claim costs can be an unreliable starting point for pricing these groups because of the phenomenon of "regression to the mean". Research has shown that as many as 75% of members with very high or low costs in one year will move back towards average costs in the subsequent year. This is part of the rationale for the common sales pushback that the health event driving the prior year’s claims is over and will not recur. Without knowing the reason for the cost, it is difficult for underwriting to refute this pushback. As a result, underwriters turn to medical underwriting, which can be costly and inconsistent. Predictive models can be used instead to separate those members whose high costs are driven by chronic conditions, and therefore likely to continue, from those driven by random or one-time events.

Predictive models offer underwriters a low-cost means of setting cost estimates based on illness burden and prior treatments. They can predict instances of “regression to the mean” and conversely situations when a member might become higher cost in the future. Increasingly, employers and benefits consultants are using them to evaluate health plan performance and set pricing levels. Finally, the results of predictive models are more intuitive than some other common rating factors like SIC codes.

But much of the large body of existing literature on the validity of predictive models can be frustrating to actuaries and underwriters. Until recently there was no research making a case for the underwriting value of predictive models. Thanks to articles from the Society of Actuaries’ Health Section News in August 2003 and August 2005, this has begun to change. While these articles represent a huge step towards building the case that predictive models can improve underwriting accuracy, a comparative analysis is still needed of the tools available that considers both technical and practical features.

Most predictive modeling research has assessed how well the tools do in targeting patients for disease management or risk adjusting payments among large employers and health plans. It has not asked or answered the right questions for underwriting applications, so companies may need to do additional analysis and thinking in order to choose the best model to use in their underwriting.

Technical Analysis:
Most research on predictive models, and the 2002 SOA Study , performs analysis at the wrong level for underwriting applications. Generally speaking, the research has evaluated whether the models identify those few individuals who will have the highest future cost, and whose costs can be most effectively reduced, so that their care can be better managed and those risks avoided.

In contrast, pricing tools are more concerned with costs at the employer group (or block of business) level, than at the member level. Commonly-used evaluation metrics like R-squared, which measures what percentage of the variation in results is explained by a given model, usually have very low values at the individual level. So readers conclude that these tools only explain 10-20% of the variation in results. That sounds like a poor result to actuaries and underwriters who are used to thinking about entire accounts or blocks of business – but traditional age-sex factors or a prior year’s claims experience don’t explain much of the variation in a single individual’s results either!

What is needed is additional analysis that compares predictive ability across tools for groups of 20, 50, 100 and more lives to see how results vary at various case sizes. Statisticians use the Grouped R-squared to measure the ability of the models to explain variation in expenses at the group level (account, employer, provider, etc.). It should be no surprise to actuaries and underwriters that predictive models do better at the group level as opposed to the individual member level. Whereas individual R-squared for age/sex, experience and diagnosis models are 2%, 6% and 9% respectively, the Grouped R-squared using 2-percentile cost groups is 17%, 50% and 82% respectively.

Many studies truncate very high dollar claims above a threshold of $25,000 or $50,000. One reason this is done is to prevent a small percentage error in the predicted claim value from having a disproportionate impact on the R-squared calculation. For example, if the model predicts $750,000 of claims for a member whose actual costs turn out to be $850,000, that model still has done a good job of identifying that member as high risk. However, that $100,000 of error may be very significant in pricing, especially for a relatively small group, if the costs are recurring. Truncating also removes the impact of large random claim events from future projections, but predictive models may be better able to distinguish these from large claims that are to some extent ongoing. Finally, the highest cost claims are the most important for pricing some products, like employer stop loss.

It is also common in the published research to focus on groups of members with similar claim costs (for example, the highest 10%) when choosing the best tool to predict who the future highest cost members will likely be. This approach does address the problem of focusing on member-level rather than group-level results, but does not use the kind of heterogeneous groups that exist when pricing real customers.

Finally, an analysis of whether predictive models can improve pricing accuracy needs to be put into the correct business context. The predictive power of these models must be compared to current business tools like age-sex factors, SIC loads and pricing from prior experience. These different techniques should also be tested in combinations, not just independently. This kind of work will help users define the optimal underwriting process for different business segments. And researchers should continue to move beyond hard to understand statistics like R-squared and attempt to analyze the impact of changes to quoted rates on business results like profit margins and close ratios.

Practical Analysis for Underwriters:
Research on the power of predictive models in identifying members most likely to benefit from case management also deviates from pricing and underwriting needs for reasons driven by its practical applications. Case management is focused primarily on the small percentage of members who will have the highest costs rather than the overall population. While this may be helpful in underwriting, it does not address the equally important issue of identifying members and groups who will have the lowest future costs to ensure these are not over-priced and thus lost.

Some tools may focus on correctly predicting costs for patients having those conditions where disease management interventions are available and most effective, and as a result could sacrifice predictive power across all members. The output format of a tool is an important quality as well. The model may only produce a binary indicator of whether a patient is a good case management candidate, but underwriters need some sort of relative risk score in order to derive an expected future cost or cost distribution.

Many academic studies indicate that models should be built not to reflect cost differences among different treatment choices, since these could influence physician behavior when their compensation is tied to the results of a risk model. However, these variations in treatment patterns may lead to real cost differences that should be captured by the tools when they are used for underwriting.

Some practical questions are the same whether you are considering a tool for case management or for underwriting. Is the data required available easily and on a timely basis? Does the model’s design make sense from your underwriting experience? Can you see “inside” the model or is it a “black box?” Can the model be used across varying lines of business?

But some important questions for the actuary or underwriter are not usually asked for case management uses. How will use of the tool impact the underwriting work flow? Will it reduce variation in performance among underwriters? Can it reduce administrative costs and if so, by how much? Can I explain to brokers and customers how the predictive model impacted their rates? How do I apply regulatory restrictions to the output, and is the prediction still valuable once those are taken into account? And how do I reflect varying benefit designs and provider contracts?

All of these technical and practical issues have to be factored into any decision of whether, and which, predictive model to use for underwriting. As a result, you may need to do additional analysis beyond that available in the literature today. Choosing a predictive model for pricing and underwriting is a very different decision than choosing one for case management, and some of these perspectives may lead you to different conclusions.

Check List for Reviewing Predictive Models for Underwriting:
Technical
- Are R-squared results presented at different employer group sizes?
- Are R-squared results compared to other underwriting methods?
- Are large claims truncated in the analysis? If so, at what level and with what impact on the model’s predictive power? And how are the excess claims allocated back to the overall pricing?
- Are the groups being presented in the analysis similar to real customers?
- Can you evaluate the impact on business metrics (close ratio, profit margin, etc.)?
- How, if at all, does the model use credibility?
- How, if at all, does the model incorporate traditional rating factors without “double-counting” these?
- How does the model handle new members or members with no prior claim history?
- How does the model account for incomplete incurred claims?
- How does the model take into account the lag between the experience claim period used and the underwriting year whose costs are projected?

Practical
- Does the model predict well for low and high cost members and low and high cost/risk groups?
- Does the product generate a credible risk score that can be inexpensively applied?
- Does the model reflect real cost differences from area contracting and treatment patterns?
- Does the model integrate into your underwriting process? For example, is the data required by the model available in time for your renewal processes and is the output in a usable format?
- Will the model reduce administrative costs? By how much?
- Can you explain the model’s design and rate impact, especially rate increases, to others – sales, marketing, employee benefits managers, consultants, brokers?
- How will you adjust results for regulatory limits?
- Can you use the model across products and benefit designs?
Software Design/Implementation
- Can you buy the models and link them into your current systems or do you need to purchase an entire reporting system?
- Is the software compatible with your IT environment? For example, if you use Excel, does the software produce results that can be easily exported to Excel?
- What is the model’s implementation time, including needed customization?

Ruth Ann Woodley is a consulting actuary and Vice President of Ruark Consulting, LLC, in Simsbury, CT.

Marilyn Schlein Kramer is President of DxCG, Inc., a predictive modeling company based in Boston, MA. DxCG is a business unit of ISO, Inc.
 

EDITORIAL STAFF
Chief Editor | J.B. Hiers
Munich American Re
Managing Editor | Shirley Weaver
Munich American Re
Senior Editor | Tom Kirner
The Hartford

GUAA BOARD OF DIRECTORS
President | Mark Walker
Minnesota Mutual

Vice President | Phil Lacy
Towers-Perrin
J.B. Hiers
Munich American Re
Kim Miller | Pacific Life
Carolyn Pollard | ING Re
Sky O'Callahan | Standard
Jim Wilmot | BCBS of Illinois
Ann Marie Wood | Anthem
Curt Zepeda | ING Re