Linear Regression Intuition: Scaling and Gradient Descent

4mo Edited

Why I wrote 50 lines of code when import sklearn takes three. I’ve been building Linear Regression from scratch to really get an intuition of how these algorithms work. Here is the realization that hit me: The Learning Rate is not absolute; it is relative to the scale of your data. Because I was using raw salary data ($100k+), the gradients were massive. Multiplying a massive gradient by 0.01 still resulted in a huge step size, causing the algorithm to overshoot the minima entirely. This is exactly why libraries like Scikit-Learn emphasize preprocessing pipelines. Without normalizing or standardizing your features, Gradient Descent is fighting the scale of your own data. Abstraction is great for productivity, but implementation is essential for intuition. #MachineLearning #Algorithms #DataScience #ArtificialIntelligence #Python #ScikitLearn

16 Comments

Ayushman Joshi 4mo

Does the circles represents confidence intervals/credible regions, in any sense? I often come across similar looking contour plots generated for MCMC and that represents Credible regions.

Melika Afshari 4mo

Interesting I would appreciate you to tell me how you started learning the algorithms from scratch, I mean there are so many resources and I want to learn but I get really confused

Tathagat Chavada 4mo

On top of that you will know what to tweak when your model is not performing well. Once learned how the algorithm works behind the scenes, the wisdom that you gain, will guide you through further.

Nilesh Garg 4mo

Interesting 🤔 I mean if you do not normalise it, the step would be big Coz step = -learning_rate times gradient But shouldn't that balance out given the path would be non normalized 🤔

Mohamed Ifreen 3mo

Abstractions are not always helpful especially when it comes to understanding low-level details!

Gaurav Purwar 3mo

To get better understanding of regression, learn the Gaussian process and Cholesky decomposition.

1 Reaction

Karishma Tallreja 4mo

interesting

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Steve Jose
3mo Edited
Report this post
Diving deeper into Machine Learning algorithms! 🚀 Just wrapped up a session on Multivariate Linear Regression. It’s fascinating to see how adding more dimensions (independent variables) allows for much more accurate predictions compared to simple linear regression. In this project, I worked on: 🔹 Preprocessing data (handling missing values & converting text data to numbers). 🔹 Using Pandas for data manipulation. 🔹 Implementing the model using Scikit-Learn to predict salaries based on experience and test scores. Every line of code helps clarify the math behind the magic. 📉💻 kaggle notebook:https://lnkd.in/gYJZD47Q #MachineLearning #Python #DataScience #LinearRegression #Coding #LearningJourney
Like Comment
To view or add a comment, sign in
Koushal Kasdhoiya
4mo
Report this post
📊 NumPy Learning Progress – Lecture 2 🚀 Continuing my NumPy journey, today I explored performance comparison and array creation techniques using Python and NumPy. 🔍 What I learned: ⏱️ Time comparison between Python lists and NumPy arrays Why NumPy is faster for large-scale numerical operations Creating multi-dimensional arrays using np.zeros() np.ones() Understanding array shape and structure 💡 Key takeaway: NumPy performs operations at a much lower level, making it highly efficient for Data Science, AI/ML, and numerical computing. Building strong fundamentals step by step 💪 More to come! 📈 #Python #NumPy #DataScience #MachineLearning #AI #PerformanceOptimization #CodingJourney #BTech #PythonDeveloper #VSCode If you want: ✨ shorter caption 🔥 more impactful hooks 🧠 beginner-friendly explanation
Like Comment
To view or add a comment, sign in
Lakhan Singh
3mo
Report this post
I used to think Machine Learning was too complex. Too many algorithms. Too many tutorials. Too much confusion. So I stopped rushing. Now I focus on: Strong basics (Python, SQL, statistics) Understanding why an algorithm is used Learning fewer models, but properly Practicing on real business data Making mistakes and fixing them Slow progress — but real confidence. Machine Learning isn’t magic. It’s just consistency. #MachineLearning #DataScience #LearningJourney #CareerGrowth #TechCareers #Data Analytics
Like Comment
To view or add a comment, sign in
Ekendra Yadav
3mo
Report this post
#Day41 of my Data Science and Machine Learning journey at Skill Shikshya Today I worked hands on with linear regression and focused on implementation and evaluation instead of theory alone. What I covered today: ✔️ Linear regression implementation using NumPy to understand the math behind the model ✔️ Linear regression implementation using scikit learn for practical and scalable workflows ✔️ Performance metrics to evaluate how well the model actually performs This step matters. If you only use libraries without understanding what happens underneath, you are just copying code. NumPy builds intuition, sklearn brings efficiency, and metrics keep the model honest. Day 41 done. Learning with purpose, not shortcuts. #100DaysOfLearning #MachineLearning #LinearRegression #Python #NumPy #ScikitLearn #DataScience #SkillShikshya #LearningJourney
Like Comment
To view or add a comment, sign in
Ali Jabbar
3mo
Report this post
No Frameworks. Just Math. I recently stepped back from high-level frameworks like TensorFlow to build a Neural Network entirely from scratch using only Python and NumPy. My goal wasn't to reinvent the wheel, but to truly understand how it turns. What I built: • A Multi-Layer Perceptron (MLP) for diabetes prediction. • Manual implementation of Backpropagation (calculating gradients via the Chain Rule). • A custom Gradient Descent optimizer. The Reality: Writing the code was the easy part. The real challenge was debugging the math when my loss curve wouldn't converge. It forced me to dig deep into how matrix dimensions align and why derivative stability matters so much in optimization. It was a humbling experience that gave me a much deeper appreciation for the tools we use every day. You can check out my implementation here: 👇 [https://lnkd.in/dScEJUwv] #DataScience #Python #MachineLearning #DeepLearning #Coding #Growth
Like Comment
To view or add a comment, sign in
Jalwat Singh
4mo
Report this post
Learning Pandas has completely changed how I look at data. Learning Pandas has completely changed how I look at data. At first, it’s just rows and columns. Then you realize it’s about asking the right questions and letting data answer them. From reading datasets with read_csv() to cleaning data using dropna() and fillna() to gaining insights with groupby() — Pandas is not just a library, it’s a data thinking tool. Still learning, still practicing, one notebook at a time 🚀 #Pandas #DataScience #Python #LearningInPublic #DataAnalytics #ContinuousLearning
Like Comment
To view or add a comment, sign in
Shilpi Salwan
3mo
Report this post
🐍 Day 70 – Kicking Off NumPy: Faster Math, Smarter Data Workflows Today, I’m kicking off my NumPy series The first big mindset shift is this: ✅ Moving from loop-based thinking ✅ To array-based, vectorized thinking NumPy: • Powers Pandas • Underlies machine learning libraries • Handles scientific and numerical computing With NumPy: ✅ Operations run significantly faster ✅ Memory usage is more efficient ✅ Code becomes cleaner and more aligned with how data problems are solved 👉 Python gives you flexibility. 👉 NumPy gives you performance and scale. More NumPy deep dives ahead… onward and upward! #MyPythonJourney #DataAnalytics #Python #NumPy #LearningInPublic #AnalyticsJourney
Like Comment
To view or add a comment, sign in
Shahroz Imran
3mo
Report this post
Peeling back the layers of the "Black Box" in Machine Learning. 🧠💻 We often rely on libraries like Scikit-Learn to do the heavy lifting, but sometimes you have to get your hands dirty with the math to truly understand what's happening. Currently working on Principal Component Analysis (PCA) for my latest project. Instead of just running the function, I’m digging into the Eigenvalues and Eigenvectors to visualize how variance is actually distributed across the features. As you can see in the console, the first two components alone capture over 98% of the variance (71.72% + 27.23%). This means I can significantly reduce the dimensionality of the dataset without losing critical information—making the model faster and less prone to overfitting. There is something satisfying about seeing the linear algebra behind the algorithms come to life! #DataScience #MachineLearning #PCA #LinearAlgebra #Python #DimensionalityReduction #Coding
Like Comment
To view or add a comment, sign in
Ramsundar M
3mo
Report this post
🚀 Built Logistic Regression From Scratch (No ML Libraries!) Today I implemented a Logistic Regression model from scratch using Python & NumPy to truly understand how classification models work. 📌 Key concepts I explored: Sigmoid function which is an activation function of this model. Gradient Descent for updating weights and bias. Cross-Entropy / Log Loss as the loss function for classification. It is used as a classification model. (Probability lies between 0 and 1). Note : Mean Squared Error can't be used as the loss function for logistic regression since it causes gradient descent with multiple local minima. I’d really appreciate feedback or suggestions on my approach — especially around optimization or best practices. Always open to learning! 🙏 Github repository link : https://lnkd.in/gfsDug8U #MachineLearning #LogisticRegression #FromScratch #Python #NumPy #DataScience #AI #LearningByDoing #Placements
Like Comment
To view or add a comment, sign in
Vipul Rawat
3mo
Report this post
🚀 NumPy Series – Day 2 Creating NumPy Arrays & Understanding Their Power Today I focused on the core foundation of NumPy: Arrays. Why NumPy arrays are important NumPy arrays are the backbone of numerical computing in Python. They are faster than Python lists, memory-efficient, and support vectorized operations. That’s why they are widely used in Machine Learning, Data Science, and AI. What I learned today Creating NumPy arrays I learned how to create 1D and 2D arrays using Python lists. One important rule: all elements must have the same data type. Creating arrays without loops NumPy allows creating arrays filled with zeros, ones, or custom values. Identity matrices, number ranges, and evenly spaced values can be created easily. Understanding array properties I explored how to check an array’s shape, size, number of dimensions, and data type. Changing data types I learned how to explicitly define data types and convert between float and integer to optimize memory usage. Reshaping and flattening arrays Arrays can be reshaped into different dimensions. Multi-dimensional arrays can also be flattened into a single dimension. Key takeaway NumPy makes data handling faster, cleaner, and more efficient without writing complex loops. Day 2 completed. Continuing my NumPy learning journey 🚀 #NumPy #Python #DataScience #MachineLearning #AI #LearningInPublic #LinkedIn #PythonDeveloper
Like Comment
To view or add a comment, sign in

945 followers

18 Posts

View Profile Connect

Linear Regression Intuition: Scaling and Gradient Descent

More Relevant Posts

Explore related topics

Explore content categories