Your Demand Model Is Confounded

Your renewal pricing model has a price elasticity coefficient. Your team uses it. It is wrong.

Not wrong as in: someone made an arithmetic error. Wrong as in: the quantity it estimates is not the quantity you need for sound pricing decisions. It measures the association between price change and lapse, and it embeds a confound that naive regression cannot remove. In observational insurance data, this confound can materially overstate true price sensitivity. A renewal optimiser built on an overstated elasticity gives back margin to customers who would have renewed anyway.

We built insurance-demand to fix this. This post explains the problem in detail and shows how the library addresses it.

The confound, precisely

Here is what your renewal data actually contains.

Each row is a renewing policy. You observe the renewal price, the price change from last year, and whether the customer renewed. You regress renewal indicator on price change and call the coefficient an elasticity.

The problem: the price change was not randomly assigned. Your pricing model set it, using risk factors. Higher-risk customers received larger premium increases because your actuarial models said they cost more. Higher-risk customers also lapse at higher rates, for reasons that are not entirely about price. They have more claims. They attract fewer alternative insurers willing to cover them. Their financial behaviour is correlated with their claims behaviour in ways your model does not fully capture.

So your regression has conflated two effects: the genuine causal effect of price on renewal probability, and the correlation between high-risk status and lapse propensity. The naive coefficient is a mixture of both. You cannot tell from the regression output how large each component is.

The standard fix is to include technical premium as a control variable. This is better than nothing. But it only removes the variation explained by your risk model, and your risk model is not perfect. Any residual risk variation that correlates with both the price change and the renewal decision will still bias the coefficient. If your pricing model misses a material risk factor - and all of them do - the confound remains.

The same structure appears in conversion modelling. PCW business is the clearest case. Your technical premium for a given risk is high, so you quote high, so you convert poorly on that risk class. A naive regression of conversion on quoted price absorbs the risk-driven price variation and the risk-driven demand variation. The coefficient overstates price sensitivity for the high-risk segments most affected by this problem.

Why this matters commercially

A renewal optimiser built on an overstated elasticity will systematically give back too much margin.

Take a concrete example. Suppose the true causal elasticity is -0.023: a 1 percentage-point premium increase causes a 2.3 percentage-point reduction in renewal probability. Your naive GLM estimate is -0.047. Guelman and Guillén (2014) demonstrated this type of confounding using propensity score methods on automobile insurance renewal data, finding that the causal estimate was substantially lower in absolute magnitude than the naive association. If the optimiser thinks P(renewal) falls twice as fast with price as it actually does, it will undercut more aggressively than it should. On a renewal book of 200,000 policies, using the wrong elasticity can represent millions of pounds in unnecessary margin concession annually.

Post-PS21/11, this matters even more. The FCA’s GIPP remedies (PS21/11, effective January 2022) banned price-walking: you cannot charge renewing customers more than the equivalent new business price through the same channel. The direction of the optimisation is now one-sided. You can discount but you cannot surcharge. Every percentage point of discount you apply unnecessarily is pure margin loss with no regulatory upside.

The Double Machine Learning fix

Double Machine Learning (DML), from Chernozhukov et al.’s 2018 paper in The Econometrics Journal, solves the confounding problem by separating what the confounders explain from what the price explains.

The model is:

Y = theta * D + g(X) + epsilon
D = m(X) + v

Y is the outcome (conversion or renewal indicator). D is the treatment (log price ratio: log of quoted price divided by technical premium). X is the set of confounders (risk features, channel, time period). theta is the elasticity you want. g(X) and m(X) are unknown nonlinear functions.

The algorithm:

Fit a flexible model - we use CatBoost - to predict Y from X. Compute residuals: Y_tilde = Y - predicted_Y.
Fit a separate CatBoost model to predict D from X. Compute residuals: D_tilde = D - predicted_D.
Regress Y_tilde on D_tilde via OLS. The coefficient is theta.

The residualised treatment D_tilde is the part of the price variation that is not explained by risk characteristics. It is the exogenous variation: the pricing decisions that were not mechanically driven by the risk profile. Seasonal rate changes, portfolio rebalancing exercises, manual underwriting adjustments. Regressing residualised outcomes on residualised treatment isolates the causal price effect.

Both residualisation steps use 5-fold cross-fitting: the nuisance models are trained on held-out folds and the residuals are computed on the fold not used for training. Cross-fitting prevents overfitting bias from leaking into the final estimate. The mathematical guarantee (Neyman orthogonality) is that errors in steps 1 and 2 produce only second-order bias in theta, not first-order. You get a valid confidence interval.

The full pipeline: conversion, retention, elasticity, demand curve

insurance-demand implements this as four connected components.

Conversion modelling

uv add insurance-demand

import polars as pl
from insurance_demand import ConversionModel
from insurance_demand.datasets import load_motor_quotes

df = load_motor_quotes()  # 200k synthetic quotes with known data generating process

model = ConversionModel(
    base_estimator="catboost",
    price_col="quoted_price",
    technical_premium_col="technical_premium",
    feature_cols=["age_band", "vehicle_group", "ncd_years", "area", "channel",
                  "rank_position", "price_ratio_to_cheapest"],
    price_transform="log_ratio",  # models log(price / technical_premium)
)

model.fit(df.filter(pl.col("quote_date") < "2025-01-01"))
probs = model.predict_proba(df.filter(pl.col("quote_date") >= "2025-01-01"))

The price_transform="log_ratio" option is not optional in practice. A quoted price of £800 means different things for a risk with a technical premium of £700 versus one of £400. The ratio removes this ambiguity. The model is estimating how conversion responds to pricing above or below technical, which is the commercial decision you are actually making.

If you have aggregator data - competitor prices from Consumer Intelligence or eBenchmarkers - include rank_position and price_ratio_to_cheapest. In 2024, 63% of UK motor insurance switchers used a PCW. On a PCW, being near the top of results matters significantly for conversion, and no smooth function of absolute price captures that visibility effect. A conversion model without rank position is misspecified for PCW business.

Retention modelling

from insurance_demand import RetentionModel
from insurance_demand.datasets import load_motor_renewals

df_renewals = load_motor_renewals()  # 100k synthetic renewal records

model = RetentionModel(
    model_type="logistic",
    price_col="renewal_price",
    price_change_col="price_change_pct",
    feature_cols=["tenure_years", "ncd_years", "payment_method",
                  "claim_count_3yr", "channel"],
    event_col="lapsed",
)

model.fit(df_renewals)
renewal_probs = model.predict_proba(df_renewals)

For customer lifetime value modelling, the logistic single-renewal model is insufficient. You need a multi-period renewal probability, which requires survival analysis. Weibull AFT gives a parametric hazard function and handles the mid-term censoring problem correctly - policies that have not yet reached renewal are not treated as definitive non-lapses:

model = RetentionModel(
    model_type="survival_weibull",
    tenure_col="tenure_years",
    event_col="lapsed",
    price_col="renewal_price",
    price_change_col="price_change_pct",
    feature_cols=["ncd_years", "payment_method", "claim_count_3yr", "channel"],
)

model.fit(df_renewals)
survival_curve = model.predict_survival(df_renewals, times=[1, 2, 3, 5])

Post-PS21/11, the commercial game for renewals is CLV optimisation rather than inertia extraction. You cannot charge loyal customers more than new customers for the same risk. What you can do is identify which customers will lapse without a discount (high elasticity) and offer targeted retention discounts to them. For that, you need multi-period renewal probabilities. Survival models provide them. Logistic renewal models do not.

Debiased elasticity estimation

The conversion and retention models give you naive marginal effects. The ElasticityEstimator gives you the causal estimate:

from insurance_demand import ElasticityEstimator

est = ElasticityEstimator(
    outcome_col="converted",
    treatment_col="log_price_ratio",
    feature_cols=["age_band", "vehicle_group", "ncd_years", "area",
                  "channel", "month"],
    n_folds=5,
    heterogeneous=False,  # global elasticity; see below for segment-level
)

est.fit(df_quotes)
print(est.summary())

Price Elasticity (DML)
  Treatment: log_price_ratio
  Outcome:   converted
  Estimate:  -0.312
  Std Error:  0.021
  95% CI:    (-0.353, -0.271)
  N:          187,432

A 10% increase in the price-to-technical-premium ratio reduces conversion probability by approximately 3.1 percentage points at the average conversion rate, with a confidence interval of (-3.5pp, -2.7pp). This is precise enough to use in an optimiser.

Compare this with model.marginal_effect() from the ConversionModel. If the DML estimate and the naive estimate are within 10% of each other, confounding is not material for your portfolio and your GLM was giving you adequate guidance. If they diverge materially - which is common on PCW conversion data - you have been pricing with a biased elasticity.

For segment-level elasticity, set heterogeneous=True. This uses CausalForestDML from Microsoft Research’s econml library under the hood:

est = ElasticityEstimator(
    outcome_col="converted",
    treatment_col="log_price_ratio",
    feature_cols=["age_band", "vehicle_group", "ncd_years", "area", "channel"],
    n_folds=5,
    heterogeneous=True,
)

est.fit(df_quotes)
per_customer_elasticity = est.effect(df_quotes)

The heterogeneous result will consistently show higher elasticity for young drivers and PCW customers, and lower elasticity for high-NCD, long-tenure customers. A single portfolio-average elasticity loses this heterogeneity, and any optimiser working from one number will misdirect discount budget.

The DML approach has practical data requirements. Minimum 50,000 quotes with meaningful price variation. Technical premium must have been stored at quote time, not recalculated retrospectively. Rate reviews generate the exogenous price variation DML needs to work - if you applied a uniform commercial loading across the whole book for an extended period, there is little variation left after residualisation and your confidence intervals will be wide.

Demand curves and pricing

Once you have an elasticity estimate, DemandCurve converts it into a callable price-to-probability function:

from insurance_demand import DemandCurve, OptimalPrice

curve = DemandCurve.from_estimator(
    estimator=est,
    base_conversion_rate=0.18,  # current book average
    form="semi_log",
)

# Conversion probability at different price levels relative to technical
import polars as pl
prices = pl.Series("price_ratio", [0.9, 1.0, 1.1, 1.2, 1.3])
print(curve.predict(prices))
# [0.217, 0.180, 0.149, 0.124, 0.102]

For profit-maximising price per risk:

opt = OptimalPrice(
    demand_curve=curve,
    technical_premium=650.0,
    cost_per_policy=45.0,
    enbp_ceiling=None,  # new business: no constraint
)

result = opt.solve()
print(f"Optimal price: £{result.optimal_price:.0f}")
print(f"Expected conversion: {result.expected_conversion:.1%}")

For renewal pricing, the ENBP ceiling is hard. The OptimalPrice solver will never return a price above it:

opt = OptimalPrice(
    demand_curve=renewal_curve,
    technical_premium=650.0,
    cost_per_policy=30.0,
    enbp_ceiling=710.0,  # new business price via same channel
)

For portfolio-level optimisation with factor-structure constraints, insurance-demand feeds its demand curves into rate-optimiser. The demand library’s job is to supply the price-to-probability functions; the rate optimiser’s job is to find the factor-level adjustments that satisfy LR, volume, and ENBP constraints simultaneously.

FCA compliance: the ENBP checker

PS21/11 is direct on this: renewal prices must not exceed the Equivalent New Business Price for the same risk profile through the same channel. The rule has been in force since January 2022. The FCA’s evaluation paper from July 2025 (EP25/2) confirmed price-walking has been substantially eliminated, and also confirmed that multi-firm reviews are ongoing - so the audit question is live.

The ENBP calculation is channel-specific. A customer who originally came via Confused.com has an ENBP calculated from your Confused.com new business price for that risk, not your direct price. If you quote differently across channels, the calculation must reflect that. Cash-equivalent incentives offered to new customers (cashback via PCW, first-month-free) must be reflected in the ENBP - the effective new business price is net of those incentives.

from insurance_demand.compliance import ENBPChecker

checker = ENBPChecker(
    new_business_price_col="nb_price",
    renewal_price_col="renewal_price",
    channel_col="channel",
    tolerance=0.0,  # strict: renewal must be <= NB price
)

violations = checker.check(df_renewals)
print(violations.shape[0], "policies with renewal_price > ENBP")
print(violations.select(["policy_id", "channel", "renewal_price",
                          "nb_price", "excess"]))

Run this check before any renewal batch. If you have violations, fix them before renewal invitations go out. The FCA’s multi-firm reviews have found implementation failures in ENBP calculation methodology - usually because new business prices were pulled from the wrong channel or because cashback incentives were not netted off correctly. The ENBPChecker is a systematic audit against these failure modes.

PS21/11 does not prohibit demand modelling. It constrains what the optimiser objective can do with the demand model output. For renewals, you are now optimising in the space where renewal_price <= ENBP - you can discount but not surcharge. This is actually a simpler optimisation problem than the pre-2022 world: it is “who do we discount and by how much?” rather than “who do we discount and who do we surcharge?”. Demand modelling is still valuable because identifying which customers will lapse without a discount is exactly what survival-based retention models do.

What DML cannot fix

Three limitations to be explicit about.

Data quality. DML residualises on observed confounders. If technical premium was recalculated retroactively after a model refit, your log_price_ratio is wrong and the elasticity estimate is biased in ways DML cannot recover. Store technical premium at quote time, not at analysis time.

Unobserved confounders. If you ran a targeted marketing campaign in Q3 2024 while also applying rate changes in the same period, and campaign exposure is not in your dataset, the campaign effect and price effect are confounded. DML handles observed confounders; it cannot handle unobserved ones. Run est.sensitivity_analysis() before taking elasticity results to a pricing committee - it tells you how large unobserved confounding would need to be to overturn the conclusion.

Near-zero treatment variation. DML identifies elasticity from the variation in log_price_ratio after removing the part explained by confounders. If your commercial loading decisions have been very uniform across the book, the residualised treatment has near-zero variance and your confidence intervals will be too wide to be actionable. The fix is genuine exogenous variation: rate review timing effects, manual underwriting adjustments that break from the standard tariff, or competitor market shocks if you have competitor price data as an instrument.

Where to start

# Core library
uv add insurance-demand

# With DML elasticity estimation (requires doubleml, econml):
uv add "insurance-demand[dml]"

# With survival-based retention models (requires lifelines):
uv add "insurance-demand[survival]"

# Everything:
uv add "insurance-demand[dml,survival]"

The recommended starting point is the confounding bias check. Fit a ConversionModel on a year of PCW quote data, compute model.marginal_effect(), then run ElasticityEstimator on the same data and compare est.summary() against the naive marginal effect. The difference is the confounding bias in your current elasticity assumption.

On motor PCW datasets, the DML estimate is commonly lower in absolute magnitude than the naive estimate. The naive estimate includes risk-driven demand variation; the DML estimate strips it out. If your renewal pricing strategy was built on the naive number, you may have been discounting more than necessary.

Commercial platforms - Akur8, Earnix, Radar - implement versions of this methodology in their demand modules. The methodology is not proprietary. The insurance-demand library is the same maths in an auditable Python package with no vendor lock-in, a clean sklearn-compatible API, and a data structure that your existing Polars and CatBoost workflow already understands.

Source and issue tracker on GitHub.

demandelasticityDMLdouble-machine-learningconversionretentionFCAGIPPPS21-11ENBPinsurance-demandpythonmotorcatboostpolars

Back to all articles