Small (data) is beautiful

Companies are investing in big data projects with the expectation that aggregating data into one big central data repository will automatically allow them to find solutions to all of their business problems. They merge exiting easy-to-use data sets, maybe add new ones, and develop an entire taxonomy with limited access rights around this central data repository.

And now what? What do you do with this data? Will your data scientists dissect the data, but won’t know how relevant it is since they are disconnected from business operations? Or will your business analysts be trained on how to access the data first, but then will only scratch the surface since they may have confirmation bias focusing on data that they know and agree with and ignoring everything else?

Maybe it would be better to start small. Before embracing big investments in big data, it would be worthwhile to find out if your organization can handle small data. Do you have the analytical talent to examine and interrogate a small data set with let’s say half a million records, which can be even managed in a spreadsheet? Can your analysts find relationships across different dimensions and across time that may impact your company’s revenues and profits? Are they able to distinguish correlations from causations? Can they recognize patterns in the data? Are they unbiased enough to identify the unknown unknowns and create hypotheses for further testing? Are they skilled to apply basic statistics and advanced regression, simulation, clustering, and other techniques to test these hypotheses? And are they willing to reiterate all the steps if the analysis and modeling results don’t confirm their initial assumptions?

If you don’t have the talent to do all this, maybe big data is not for you … at least not until your analytics teams can fully understand (small) data and its implications on business results. Otherwise your big data project may be like teenage sex as written by The Register: “everyone's talking about it, only a few know how to do it, they all think everyone else is at it and so pretend they are too”.

To view or add a comment, sign in

More articles by Jack Lampka

  • Change management is dead. Long live change management!

    I never liked the term “change management”. I have often experienced change management programs as something imposed by…

    4 Comments
  • GenAI doesn’t take away jobs, it creates them.

    The myth that chatbots are replacing people has been debunked a while ago. Although CEOs and founders prefer to tell…

    2 Comments
  • New book: "AI - The human factor"

    70% of AI projects fail, not because of lack of computing power, huge data centers, or latest Large Language Models…

    12 Comments
  • The myth about AI replacing people

    “Over 14 million jobs worldwide have already been lost directly due to AI-driven technologies.” This and similar…

    5 Comments
  • What is AGI and why you should ignore it

    The discussion about Artificial General Intelligence (AGI) resurfaced recently after Yann LeCun left Meta. LeCun is a…

    11 Comments
  • Regulate AI chatbots!

    Adam Raine, a 16-year-old from California, was encouraged by ChatGPT to kill himself. Adam's parents have filed a…

    3 Comments
  • How data leaders shape business success with AI

    70% of AI projects fail … and based on recent analyses, this failure rate may be even higher. It’s not because of lack…

  • AI hype is over. Let’s get real with AI.

    Let’s get real with AI … “real” as in implementing AI solutions that reduce expenses or increase revenues. Here are top…

  • AI-first fails: Personal experience

    Just two weeks ago I wrote about how human-first AI beats AI-first across three aspects: technology, employees, and…

    2 Comments
  • What do businesses gain from the AI race? NOTHING!

    Are you impressed by the global AI race? You probably have seen the executive orders on AI, AI action plans, and…

    3 Comments

Others also viewed

Explore content categories