Everything simple is false. Everything complex is unusable

Barnaby Davies

Published Mar 13, 2020

So said Ambroise-Paul-Toussaint-Jules Valéry. The British statistician George Box, who said "All models are wrong, some are useful", was making the same point.

This is the first of two posts setting out some concepts. These concepts are applicable to whatever business you are in and wherever you are in terms of your data governance maturity. Data governance maturity is something we'll dive into at a later date.

Whatever we do in data governance, it needs to be economic, efficient and effective.

At the earliest stages of this process, it's important to consider scope and agree a terms of reference. Not doing this will limit success. Try to avoid the common mistake of overlooking what you should be doing in the rush to get to how you should do it.

For almost all data governance initiatives, it's important to partition the work. Doing this will allow you to prioritise what work you do first as well as 'maximising the volume of work not done'. It will also help with stakeholder engagement and enable communication in terms of benefits and capabilities.

However you eventually do partition the work, you should keep the following model in mind.

Sensitive data is data which needs safeguarding as it will damage the interests of the business if it should fall into the wrong hands. It has associated costs, risks and a corresponding value.

Redundant data is data which has no further value to the business. It has associated costs and risks but no corresponding value.

Other data is data which doesn't have a particular risk attached to it. It has associated costs and a corresponding value.

An important characteristic of this model is that it focuses on what the data is not where it is or what format it is in. That's a theme we'll be revisiting regularly. Another characteristic of this model is that it is simple. We've put an abstraction layer between us and our data governance objectives. This makes it a lot easier to engage stakeholders.

In the next post, I'll build on what is set out here and provide a simple model comprising inputs, processes and outputs which will address pretty much all data governance workloads.

Talking points:

If the model above isn't complete - what's missing and why does it matter?
Do you see benefit in focusing on data based on what it is, not where it is? Conversely, do you see problems in focusing on where the data is stored rather than what the data is?
How important is stakeholder engagement in your data governance project? Why?
What does 'maximising the volume of work not done' mean to you?

To view or add a comment, sign in

Everything simple is false. Everything complex is unusable

Barnaby Davies

More articles by Barnaby Davies

Others also viewed

What Causes the Data Governance Gap

The Three Wise Men of Data Governance and Data Quality

VP of Data has announced the adoption of Data Governance tools

Should a focus on Data Validity precede attention to Data Consistency?

Associations - Never Put the AMS Cart Before the Data Governance Horse

So you say you’re a “data-driven” Business…

Why Most Data Strategies Start With Assumptions

Data Quality - Is your data lying to you?

My Weekend Read in CDO Magazine Just Shifted My Entire Data Strategy

Avoiding Skeletons in Your Data Closet: How to Keep Your Transformation Project on Track

Explore content categories

More articles by Barnaby Davies

On AI and baseline technology exemplars.

Climbing Mount Roosevelt: Part 3

Climbing Mount Roosevelt: Part 2

Climbing Mount Roosevelt (Part 1)

Project management isn't very digital

How to build a data governance tool

A helping hand with data governance

Does your data look a bit like this...

A quick and dirty guide to implementing GDPR in an operational setting (part 1)

Never mind a GDPR Project, what does GDPR mean for projects?