← index
CASE STUDY · data engineering · 2026
Strata
Public medallion data pipeline on synthetic insurance data — DuckDB, dbt, SCD2, data contracts, 42 tests, CI green.
uilt as the public proof of NDA-locked DWH work at BRILife. Full medallion architecture (bronze→silver→gold) on synthetic insurance data. Implements SCD2 slowly changing dimensions, data contracts, reconciliation, and 42 automated tests — all patterns used in production but open-sourceable here.
highlights
- +Medallion architecture: bronze/silver/gold layers with clear promotion rules
- +SCD2 implementation for policyholder dimension changes
- +Data contracts enforced at ingestion boundary
- +42 tests covering transformation logic and reconciliation
- +CI pipeline (GitHub Actions) — green on every commit
what was hard
Designing synthetic data that mirrors real insurance domain without exposing BRILife patterns; Making SCD2 testable in a local DuckDB environment without a full warehouse