Not All Data Problems Demand Machine Learning: Beyond the Hype!

Mahesh Divakaran

Published Jun 11, 2025

The recent years showed an explosive growth on the usage and discussion of the artificial intelligence (AI) and machine learning (ML) in academic as well as industrial contexts. These are of course transformational technologies. They are found in various industries - HealthCare, finance, marketing, logistics - and they provide sinks and tools for pattern discovery, predictive analytics, and reduction of manual work. But in the excitement and forward movement there is an emerging risk that those two sides, of techno-know-how and analytical know-how, are starting to get blurred. This post aims to break down a common misconception: is every data-driven problem a machine learning problem?

The truth is most of the real-world business problems can be addressed without fancy ML models. In other words, not everything is a nail, and nailguns are cumbersome and slow in some situations. Faced with a new problem, particularly in consulting or internal analytics teams, many early career data professionals naturally think of the problem as what machine learning algorithm to use. Questions such as “Shall we go to Random Forest or shall we go with Gradient Boosting?” or “Is this something that deep learning can make better?” frequently emerge before having grasped the extent or character of the phenomena. This technical-first perspective, though well-intentioned, skips an essential first step: problem definition and contextual understanding.

A more adult approach, though, starts with asking better questions. What is the client (or stakeholders) actually asking for? What is the goal: to forecast, to discover, to understand, to act? These are the questions that should be driving method choices, not just relying on a machine learning pipeline. A lot of business problems are essentially descriptive or diagnostic. For instance, a marketing team might want to understand why a campaign performed poorly in a certain region, or a hospital administrator might be looking for bottlenecks in patient flow. These are problems where the ‘why’ outweighs the ‘what’, and solutions can frequently be found somewhere around a structured data analysis, simple statistical summaries, or some elementary data visualisation.

This principle is increasingly supported in industry practice. A 2020 McKinsey examination of the state of AI adoption discovered that investments in AI were rising across industries, but that most of the successful applications were in narrow, well-defined problems where the business value was evident. In addition, many of the use cases discussed did not relate to complex ML models. Instead, dashboards, rules-based systems, and business intelligence tools created real and direct results. This makes a very important point: the value of data science is not in how complex the models are, it is in providing decision support and actionable inference.

Let’s consider it in a few examples. In another company, an operational delay was discovered and found to be caused by variation in order processing time between regions. Instead of creating a predictive model, the analyst summarized and graphed the information with Power BI and quickly found out that one of his regional teams were manually handling some requests because their SOPs were several months old! The answer was a managerial and procedural one, not an algorithmic one. In a second example, a retail company desired to process its customers’ feedback in order to learn what were common complaints. Rather than implementing a complete NLP pipeline with sentiment analysis, the group employed basic regular expressions and keyword clustering in Excel to classify issues and detect themes. These easy methods resulted in quicker deployment, less maintenance, and immediate impact.

Recommended by LinkedIn

AI is Finally Putting Big Data to Use

Ben Z. 11 months ago

Big Data: the Holy Grail or just a fad?

Eleonora Cao 7 years ago

Navigating the Landscape of Data and AI

Varun Bhatia 1 year ago

One of the reasons why simple solutions tend to surpass complicated ones in business settings is because they are easily interpretable. Executives and domain specialists are more apt to take action on information they can comprehend. A linear regression with a transparent coefficient is more useful than a black-box ensemble model with 90% accuracy but low explainability. Additionally, simpler solutions use fewer resources. They minimize computational overhead, preparation time for data, and retraining requirements of models. This matters in small and medium-sized enterprises (SMEs), where infrastructure and specialized skills might be scarce.

Simplicity also promotes autonomy. If stakeholders are able to access dashboards or Excel capabilities without going back to the data team to ask every question, then a data empowerment culture is created. Complex ML pipelines tend to need continuous support, maintenance, and explanation, particularly when business context changes or model drift takes place. From a long-term sustainability perspective, it is more effective to implement tools that can be operated and handled by the business users themselves.

Obviously, this is not to say that machine learning is not of value. There are many applications where ML is a necessity. High-dimensional classification problems, image recognition, network traffic anomaly detection, customer personalization engines, and medical diagnosis have all been benefited greatly by ML developments. But the choice to apply ML must be based on need, rather than fashion.

The correct analytical tool must fit the type of problem. Another commonly used framework makes a distinction between descriptive, diagnostic, predictive, and prescriptive analytics. Descriptive analytics, e.g., dashboards and summary statistics, assists in comprehending what occurred. Diagnostic analytics uncovers the reasons for what occurred, sometimes through exploratory data analysis and correlation studies. Predictive analytics, which generally entails ML models, predicts what will probably occur, whereas prescriptive analytics advises what to do. Plunging into predictive or prescriptive models without checking whether the problem is adequately understood at all at a descriptive or diagnostic level is methodologically flawed.

Scholarly research underlies this incremental strategy. In a Harvard Business Review article, Thomas Davenport and Jeanne Harris make the case for a stepwise model of analytic maturity, in which organizations need to be proficient in foundational data management and reporting before jumping into machine learning or AI programs. They warn against rolling out sophisticated models too early, which can result in implementation failure and loss of stakeholder confidence.

There is also a psychological factor to be considered in this conversation. Most professionals, particularly those in the early years of their careers, feel an overwhelming need to prove technical competency. This frequently manifests as overengineering designs. Although being masterful with algorithms is indeed a valuable asset, true expertise is the exercise of judgment—knowing when complexity is necessary and when it is not. Experienced professionals are rather marked by the simplicity of their tools, but by their lucidity of mind, practical thinking, and focus on business goals.

In summary, machine learning is a priceless part of the contemporary analytics toolkit, but it is not the go-to solution to every problem. Data science is, after all, a problem-solving discipline, and good problem solvers fit the solution to the setting. They start with listening, move on to exploration, and only add complexity if simpler options do not suffice. With more and more organizations going big on data initiatives, we need to allow a culture that celebrates impact over complexity and clarity over intricacy. By not falling back on machine learning too easily, we create room for solutions that are not only elegant and interpretable but also truly useful.

Data Digest: Clinical Analysis

3,234 followers

+ Subscribe

Shailendra Chauksey 10mo

Mahesh's insights are spot on. To elaborate, just taking one example here, leveraging well-crafted Python code utilizing established, open-source libraries can yield significant benefits. However, when it comes to integrating Generative AI (Gen AI) into existing architectures, it's crucial to carefully evaluate its applicability to specific use cases. While Gen AI can undoubtedly enhance certain aspects, it's essential to avoid forcing its adoption as a replacement for proven, reliable technologies without thorough justification. Instead, the incorporation of Large Language Models (LLMs) or other Gen AI components should be guided by a thoughtful technical design that prioritizes suitability and efficacy for the intended purpose, rather than mere trend-following.

Anton Meshcheriakov 10mo

ML also requires a lot more data to come up to the conclusion compared to the traditional statistical analysis. It is the price for using an universal solution what can adapt to a multitude of inputs. If, on other hand, one uses some Mark I Natural Intelligence to figure out the nature of the problem, a tailored method can demonstrate the result with a handful of data compared to training of any model.

1 Reaction

VCI Institute 10mo

Appreciate the reminder to step back and think business-first.

1 Reaction

Moeed Furqan 10mo

It’s easy to get caught up in the allure of complex algorithms, but simplicity often brings better results. Have you ever worked on a project where a simple approach outshone an AI-driven model?

1 Reaction

Dr. Jayalekshmi S 10mo

Absolutely aligned with this perspective. In my experience, some of the most impactful solutions have come from clear data understanding and simple tools—not complex models. It’s essential to move from a “model-first” mindset to a “problem-first” mindset. Machine learning is powerful, but it should be used where it truly adds value, not just for the sake of using it. Great read and an important reminder for all data professionals.

Not All Data Problems Demand Machine Learning: Beyond the Hype!

Mahesh Divakaran

Recommended by LinkedIn

Data Digest: Clinical Analysis

3,234 followers

More articles by Mahesh Divakaran

Others also viewed

Data Science and AI into 2018 - A PoV

Often Misunderstood: Generative AI’s Value Proposition

Applying Machine Learning in your Company: 10 criteria for the best AI solutions

The AI Fever

Why the road to AI starts with data.

The Role of Big Data Culture in AI

Digging into Machine Learning’s Real-World Impact

This MIT Study Made My AI Data Analysis More Reliable

The dangers of AI tools and business decision-making

Revaluing Statistical Fundamentals in Modern AI: Lessons from Generative Models like Stable Diffusion

Machine Learning in Marketing Analytics

Best Practices For Evaluating Predictive Analytics Models

Machine Learning Applications in Strategy

Addressing AI Adoption and Skills Gaps in the Workplace

How to Overcome Challenges in Marketing Analytics

Explore content categories

Recommended by LinkedIn

Data Digest: Clinical Analysis

3,234 followers

More articles by Mahesh Divakaran

From p-Values to Pipelines: How a Six-Layer AI Stack Is Reengineering the Clinical Trial

The Story of ANOVA: A Deep Dive into Statistical Group Comparisons

The Quantum Leap: Are We Ready for the Next Tech Revolution?

Decoding Cancer: The Transformative Role of OMIC Data in Modern Oncology

Apple Watch's New Features Revolutionizing Clinical Diagnosis and Saving Lives

Drowning in Data? Mastering the Art of Clear and Concise Insights

AI Revolutionizing Healthcare: Rapid Changes Over the Last 20 Months

Patient-Centric Approaches in Clinical Data Collection

Unlocking Insights: The Importance of ANOVA in Clinical Trials

Ensuring the Integrity of Regression Analysis: The Vital Role of Assumptions in Clinical Data Interpretation

Others also viewed

Data Science and AI into 2018 - A PoV

Often Misunderstood: Generative AI’s Value Proposition

Applying Machine Learning in your Company: 10 criteria for the best AI solutions

The AI Fever

Why the road to AI starts with data.

The Role of Big Data Culture in AI

Digging into Machine Learning’s Real-World Impact

This MIT Study Made My AI Data Analysis More Reliable

The dangers of AI tools and business decision-making

Revaluing Statistical Fundamentals in Modern AI: Lessons from Generative Models like Stable Diffusion

Similar topics

Machine Learning in Marketing Analytics

Best Practices For Evaluating Predictive Analytics Models

Machine Learning Applications in Strategy

Addressing AI Adoption and Skills Gaps in the Workplace

How to Overcome Challenges in Marketing Analytics

Explore content categories