Module 1: Databricks for Pricing Teams

Part of the Modern Insurance Pricing with Python and Databricks course.


What this module covers

Most Databricks tutorials aimed at actuaries teach you the Databricks UI in the abstract. This module teaches you to set up a workspace for a specific purpose: pricing insurance.

The difference is in the details. Where to put your tables. How to set retention properties that satisfy an FCA audit. Why the flat-file data pass between pricing and MI teams is the thing Databricks actually fixes. How to stop losing track of which model run produced which output.

By the end of this module you have a working pricing workspace on Databricks Free Edition with:


Files

tutorial.md - The written guide. Read this first. ~4,000 words covering the full setup from cluster configuration to MLflow model registration. Written for someone who prices motor or home insurance for a living and is being asked to move to Databricks.

notebook.py - The Databricks notebook. Import this into your Databricks workspace and run it cell by cell. Covers everything in the tutorial in runnable form. All steps work on Databricks Free Edition.


Requirements


What you will not find here


Part of the MVP bundle

This module is included in the £295 MVP bundle (modules 1, 2, 4, 6). The full course is £495. Individual modules are £79.

See burningcost.github.io/course for the full curriculum.