← Blog
·5 min readData engineeringLakehouse

What a lakehouse actually buys you (and when you don't need one)

A plain explanation of the bronze-silver-gold idea, minus the vendor pitch.

'Lakehouse' gets thrown around like it's magic. It isn't. It's a sensible way to organise data so the people downstream stop arguing about whose number is right. Here's the idea without the slideware.

Three layers, one job each

  • Bronze: raw data, exactly as it arrived. You keep it so you can always rebuild.
  • Silver: cleaned and conformed. Types fixed, duplicates gone, joins that make sense.
  • Gold: the business-ready tables. One agreed definition of 'revenue', 'active client', 'on time'.

The win isn't the diagram. It's that everyone pulls 'revenue' from the same gold table, so two reports stop disagreeing - and when they do, you can trace it back through silver to the raw row.

When you don't need one

If you have one clean source and a handful of reports, a lakehouse is overkill. Don't build a platform to solve a spreadsheet's problem. The architecture earns its keep when you've got many sources, many consumers, and people who've stopped trusting the dashboards.

Build the smallest thing that makes the numbers agree. Sometimes that's a lakehouse. Sometimes it's a good model and a stern conversation about definitions.

That's the test we apply before recommending anything: not 'is this best practice?' but 'does your problem actually need it?'

Working on something like this?