Data Analysis & Statistical Insights without Pivot Tables, Data Cleansing or Math – Part 1

Laura Krylov, MBA, PMP

Published Apr 7, 2025

The productivity gains to be had by using AI for statistical and data analysis are staggering. About a year ago, we (RFP Ninja) had to write so many instructions just to get ChatGPT to read raw tabular data sets correctly 100% of the time that we gave up and used explicit code to ensure accuracy. This past weekend, I wanted to see how ChatGPT has evolved and improved. I started with something small and when I realized what it could do, I kept pushing the limits.

Eventually, I recreated, and actually improved on, some work I did a few years ago that utilized 100,000+ rows of machine data 25 columns wide as the source. The file was so big it used to choke my computer when I manipulated pivot tables. GPT 4.0 sliced through the data in minutes and gave me deeper analysis than I did the first time working over the course of a week. If this kind of work was still part of my daily workload, I’d probably still be buried in that rabbit hole loading data sets and asking ChatGPT to help me gain insights.

If you’re analyzing data sets on a regular basis, the best approach is to set up your own GPT to do the work. Creating a GPT is like creating a personal AI model. It just takes a few minutes to get it set up. This way, you can continue to refine the instructions and train your model. Click here to see an article that talks about how to create GPTs.

Here's a quick synopsis of what ChatGPT 4.0 was able to do with the first small test.

Statistical Differences

This example shows that you don’t need to remember much of anything from Stats 101 to be able to get statistical insights using ChatGPT.

This is a little data set of open rates for different outreach email options. You can argue that the sample size isn’t big enough to draw any conclusions, but it’s still a valid test to see how ChatGPT performs.

➡️ GPT Prompt: Look at this table and tell me if there is a statistical difference between any of these results for the percent opened. <<I then pasted the table above into the prompt window>>

Normally, to get this answer, I would have to compare all the pairs. This is tedious.

🖥️ Chat GPT 4.0 Response

The statistical comparison shows that there's only one pair with a significant difference (p-value < 0.05):

Recommended by LinkedIn

From Raw Data To Ready Insights: Using AI To Speed Up…

UnivDatos (UDS) 1 month ago

How Poor Data Engineering Can Derail AI Initiatives

Adarsh Gupta 9 months ago

From BI to HI to AI: Why Human Intelligence is the…

Francois de Wet 9 months ago

Test 1 vs. Test 2 (p-value = 0.038)

All other comparisons did not show statistically significant differences. This means that, apart from Test 1 versus Test 2, there isn't enough evidence to conclude that the differences in open rates between the other tests are statistically significant.

The great thing about LLMs is that if you want more info, you just ask; there’s no need to google P values.

➡️ Additional Prompt: Explain why the p value of .038 shows the difference is significant

🖥️ Chat GPT 4.0 Response

A p-value of 0.038 means there's a 3.8% chance of seeing this level of difference between Test 1 and Test 2 purely by random chance if there was actually no real difference between them.

Since this value (0.038) is below the common threshold (0.05, or 5%), we say it's statistically significant. In other words, it's unlikely to have occurred just by chance, suggesting that the difference in the percent opened between Test 1 and Test 2 is probably real rather than random variation.

p-value < 0.05 → statistically significant (real difference likely)
p-value ≥ 0.05 → not statistically significant (difference could easily be random)

Conclusion

Chat GPT didn't need any explanation of the data I was providing or what kind of statistical test to perform. It took a data set and made a determination about how to provide insights. Imagine what it can do with some additional instructions.

So next, I dug up something with 100,000+ rows of machine data 25 columns wide and started asking questions. The summary of this exercise will be my next post…

Kirill A Krylov, PhD, CFA 1y

Even I could understand this one.. and that says a lot! very practical and not intellectually pretentious.. love the excel comparison.. Die, You Pivot Table, Die!!!

Data Analysis & Statistical Insights without Pivot Tables, Data Cleansing or Math – Part 1

Laura Krylov, MBA, PMP

Here's a quick synopsis of what ChatGPT 4.0 was able to do with the first small test.

Statistical Differences

Recommended by LinkedIn

Conclusion

More articles by Laura Krylov, MBA, PMP

Others also viewed

AI in the Data Management playground

Leveraging ChatGPT for Advanced Data Analysis: A Deep Dive with Synthea's COVID-19 Data

Insights from a Data & AI Week - Edition 29/25

Data Isn't Part of Your AI Strategy. It IS Your AI Strategy

Data Cleaning and Agentic AI

Using Naïve Bayes with Speech Analytics Output to predict First Contact Resolution

Will AI Replace Data Analysts?

Qwen 3.5 vs. GPT-5.2 vs. Claude Opus 4.5: We Benchmarked All Three on Real Data Visualization Tasks

Mastering Data and Text Analysis with Smart AI Tools: Top Picks for 2025

Data is King – Why Data Quality Matters

How To Fine-Tune AI Models On Small Datasets

How to Navigate GPT Model Limitations

How to Use ChatGPT Plus for Enterprise Sales

ChatGPT Prompt Strategies for Copywriting

How to Improve Chatbot Responses With Custom Instructions

Explore content categories

Here's a quick synopsis of what ChatGPT 4.0 was able to do with the first small test.

Statistical Differences

Recommended by LinkedIn

Conclusion

More articles by Laura Krylov, MBA, PMP

Can You Build Workflow Tools with ChatGPT?

Shifting to Finding Value

The Blueprint for Smart AI Responses: Quality & Structured Data

Easy to Implement Ideas for Better File Management for Both Humans & AI

From Pioneers to Pragmatists: A Roadmap for AI Adoption in the Office

What we changed about how we do pilots and why it’s so much better

What Agents Can & Can't Do

Using AI to Do the Work so You Can Do the Thinking

Driving Innovation with Digital Twins

Ideas on How to Choose Models & Subscriptions in Open AI

Others also viewed

AI in the Data Management playground

Leveraging ChatGPT for Advanced Data Analysis: A Deep Dive with Synthea's COVID-19 Data

Insights from a Data & AI Week - Edition 29/25

Data Isn't Part of Your AI Strategy. It IS Your AI Strategy

Data Cleaning and Agentic AI

Using Naïve Bayes with Speech Analytics Output to predict First Contact Resolution

Will AI Replace Data Analysts?

Qwen 3.5 vs. GPT-5.2 vs. Claude Opus 4.5: We Benchmarked All Three on Real Data Visualization Tasks

Mastering Data and Text Analysis with Smart AI Tools: Top Picks for 2025

Data is King – Why Data Quality Matters

Similar topics

How To Fine-Tune AI Models On Small Datasets

How to Navigate GPT Model Limitations

How to Use ChatGPT Plus for Enterprise Sales

ChatGPT Prompt Strategies for Copywriting

How to Improve Chatbot Responses With Custom Instructions

Explore content categories