Adding a few thoughts on Data Mining & Big Data

Adding a few thoughts on Data Mining & Big Data

Disclaimers : Am a Digital Product Manager - having worked in E-commerce, Fin-Tech, Telecom (Data + OTT), Consumer Durable Retail so my insights shall be based on B2C industries & perspectives. Lastly am no expert at any tool - R, Python, SQL but have used it intermediately for Running the Product. So please read it more from a "User in Need" perspective.

Sharing an article which helped me collaborate my learning by @SanilSubhashChandraBose :

https://www.garudax.id/pulse/practical-guide-data-mining-e-commerce-business-subhash-chandra-bose/

In order to perform data mining - there are a few must haves :

  • Customer Behavioral Data (at least a year's trends are required) for all KPIs like Name, location, mobile number, Transaction Details, Date & Time for transactions, etc.
  • Tool to analyze - R, Python, SQL
  • Business Objectives/ Consumer Problems to derive e.g. Segmenting of customer data or projecting customer sales data
  • And lastly, an open mind to question the findings from domain knowledge perspective

Here am sharing a few techniques for Data Mining :

  • Supervised (Predictive) : When the exercise is based on a resulting/controlled variable. Z = F(X) + Y . E.g. Campaign analytics on what shall be the possible outcome basis these changes
  1. Classification : Decision Tree, Rule Induction, Neural Network, Nearest Neighbor Classification
  2. Regression (Elastic Net) : Linear, Logistic, Polynomal, Stepwise, Ridge, Lasso
  3. Forecasting
  4. Predictive Modelling
  • Unsupervised (Descriptive) : When there is no controlled variable to follow & its an exercise based on absolute numerical values of the inputs. Z = F (X) . E.g. CUG, UAT Feedbacks collation or survey collections. Also collating social feedback metrics like likes, comments, playstore ratings, etc. is Descriptive.
  1. Clustering : PCA & feature selection
  2. Association
  3. Sequential Analysis
  • Diagnostic : This is used to analyse why this happened & possible reasons. Root cause Analysis,
  • Prescriptive : This is used to guide what should be done to prevent something from happening, it is in the form of process note/ set of recommendations.
  1. Monte Carlo Situation
  2. Pattern Identification & Alerts
  3. Optimizations

Which technique to use in which problem situation is the trick to master I believe :)

Also sharing a Predictive Analysis/Data Mining Ecosystem Diagram :

For every test - post defining objective, its important to calculate the sample size (n) & p-value for deciding the accepted area of optimum along with confidence level for accuracy (alpha).

To view or add a comment, sign in

More articles by Misha S.

  • Key Victories of Last Decade

    There are three I can clearly think - Rise of independent Female in every way without a touch of manipulation Youth…

    1 Comment
  • Post about AI & Fintech Policy Making

    🌟 Navigating the Policy Maze: How AI and Fintech Regulations Are Shaping the Future of Finance in 2025 – Expanded…

  • DeepTech & India - a viable story?

    "In the Valley, you find deeper bench strength and a stronger foundation. It's not just about hiring engineers or…

  • Product Management in 2025

    Continuing my series of articles on evolution of Product management over years : Product Management 2024 Product…

    1 Comment
  • Now to the Era of Agent Experience (AX)

    Absolutely. Here's the updated version of your LinkedIn article on Agent Experience (AX) with a new data-backed section…

  • Deep AI Agents: An Overview

    A deep AI agent refers to an autonomous system powered by deep learning (a subset of machine learning). These agents…

  • Why I created & nurture "ProductNurture" Community🌱🎨

    When times are hard - its our light within joined together which protects us Another scene which explains my emotions…

    4 Comments
  • AI Update Product Managers [One Place Solution]

    Key AI Large Impacting Applications & their LLMs : OpenAI ChatGPT Microsoft Google Gemini Meta AI using Llama 3 version…

    2 Comments
  • Reasons for Middle Management Foundering

    Continuing my previous Article on Middle Management, am trying to understand possible reasons for Top Leadership which…

  • Healthy Attributes of Middle Manager PMs

    Trust me this is way trickier than you suppose! 🏄🎢 And this one is the most crucial health indicator for your team's…

Others also viewed

Explore content categories