Are we worrying about the wrong Data Security challenges?

Douglas Kinloch

Published Jul 16, 2020

I regularly read articles on the latest worries about Data Security and Data Integrity (not necessarily the same thing) and while these are all key, I do wonder if they are actually the weakest links about which the CDO/CIO/CISOs should be thinking?

Thinking on my roles over the last decade I have worked in various companies to deliver technologies to make certain tasks easier; be it the code protection of individual executables, the delivery of secure, easy to use Payment or DRM SDKs or the reduction of effort in the deployment and integration of Advanced Data Models by 10,000%.

Most large organisations are (understandably, and usually correctly) relying on many security layers protecting their data within their systems. There is a huge and very profitable industry that has grown up around software and hardware technologies designed to protect the integrity of data and the information stored on servers, on-premise and in the cloud. There is a reasonable argument that given the focus and sheer magnitude of cloud providers, these may even be more secure than a company’s own servers, housed in their own buildings.

But we still see breaches. Un-protected AWS buckets, forgotten servers, Apps rushed to production & poorly pen-tested (or not tested at all) and APIs assumed to be secure but with no authentication in place. Technology to solve these issues exist and have existed across multiple markets for many years. Some of them even work.

We’re now moving into a new world of Data, the growth in this market has meant that Data Scientists are among the most sought-after (and highly paid) “software developers” of any kind. Ask many corporate CEOs (or Health Ministers) today what is one of their key assets and they will all answer “Data, and the Data/AI/ML Models that consume it”.

And yet the individual unique applications that are created, for that is what a model is, are delivered to the cloud or to IT systems with little or no thought to security.

Compiled languages like C++ can be easily de-compiled and run-time compiled languages like Java are easy prey to even the simplest high school hackers using tools downloaded from the web. Code written in these languages can be analysed like a film editor can run a sequence frame by frame, and like a film editor, hackers can cut and paste new malicious code into the application or remove other code as required.

While in the very best cases the data centres have barbed wire around them, their hardware and software firewalls are in place and the encryption keys are agreed, all protected by multi-factor authentication. While user management KYC and the rest are all part of the story, the APIs to the outside world are secured, has anyone thought to secure the source code of the models being delivered to production?

I mean, why go to the bother of subverting the data when you can compromise a model before or during deployment in order to subvert the information you need or deliver their payload?

What price AI/ML Model Integrity?

Douglas Kinloch 4y

Back in the summer of 2020 I wrote this piece on the security of AI/ML and the integrity of modes in use by corporations. I've just listened to the #fintechconnect key notes on AI in Finance and security seemed to be somewhat low on anyone's radar. Financial Services are not alone in this but as we move to a Data Driven world, they may be an issue to be be explored here!

To view or add a comment, sign in

Are we worrying about the wrong Data Security challenges?

Douglas Kinloch

More articles by Douglas Kinloch

Others also viewed

Threat & Self Defence for your data: In-depth guide on data security

API Security for the Data Layer

Safeguarding Digital Fortresses: A Comprehensive Exploration of Vulnerabilities and Security Measures in Data Warehouses, Data Lakes, and Databases

Addressing the Intricacies of Big Data Security: Challenges and Solutions

Computing on encrypted data for better security – Part 1

Azure security best practices

Aren't we already Graph'ing?

Securing the Communication between Nodes of Distributed Data Grid and Distributed Task Grid (Across the Internet in a SuperCluster)

🔐 Demystifying Dataverse Security: Best Practices for a Safer Power Platform

Curated Endpoint Data Strategy

Explore content categories

More articles by Douglas Kinloch

Digital ID? How much does a night-club attendant need to know about you?

Technology needs not to be "new" to succeed?

2020 in the rearview mirror

Location, location, location in a pandemic?

Core Body Temperature, so what?

Why so many sensors?

Why are Waire Devices worn on the upper arm?

Are we beginning to see the Big Tech move on Financial Services?

So how do you, as a small tech company, engage with large corporations, without hitting the rocks?

Do you get Deja Vu all over again?