Data Analysis with Statistics and Visualization

So far this week, I’ve been diving into the statistical side of data analysis, which has been especially exciting given my love for numbers. I started with data visualization, focusing on the differences between bar charts and histograms and when each should be used. I also explored pie charts and their use cases, although I’ve noticed that some experts strongly dislike them and avoid using them altogether. I’m curious to hear where you stand on that. From there, I moved into more technical visualizations like line graphs and scatter plots. While studying line graphs, I learned about trendlines and how they help reveal relationships in the data. When data points cluster closely around the trendline, it suggests a positive correlation, while points that are more spread out indicate little to no correlation. However, this is not determined by sight alone. There is a statistical measure called R-squared that quantifies the strength of the relationship. I have not studied it in depth yet, but it produces a value between 0 and 1, where values closer to 1 indicate a stronger correlation. The interpretation of this value depends on the type of data being analyzed. I also reviewed the structure of graphs, specifically the independent variable on the x-axis and the dependent variable on the y-axis. One key takeaway stood out clearly. Correlation does not imply causation. Just because two variables move together does not mean that one causes the other. That is something I will carry forward as I continue studying data analysis. There is still a long week ahead, and I am looking forward to learning more. #DataAnalysis #LearningInPublic #Python #Statistics #Data

  • No alternative text description for this image

To view or add a comment, sign in

Explore content categories