Uses of Partial Correlation

Uses of Partial Correlation

There are many ways to accomplish our goal to develop a simplest predictive model. A common and easiest approach is to remove all non-significant predictors (high p-values) from the full model while keeping an eye on the increasing R-square and the reducing RMSE (Root Mean Square Error). However, the problem arises when the significance of one predictor depends on the other predictors that are in the model. Highly correlated variables make it difficult to determine which predictor to be removed from the model to accomplish our purposes. Shedding light on this topic is the goal of this article. As always the immediate audience is me. 

This is where, in our story, we call upon the powers of our protagonist, the Partial Correlation. Partial Correlation is a measure of the association between two continuous variables while controlling for the effect of one or more covariates.

For Example - 

Exhibit 1 -

Partial Correlation will allow us to see correlations between each predictor and the response, after adjusting for other predictors. Here we are controlling for the effect of price on all variables. We see Baths, Square Feet, Miles to Resort, and Acres have a higher partial correlation with Price (Exhibit 1 - Partial Corr). While DoM, Beds, Cars, and Years Old have a lower partial correlation with the response. We also see DoM has the highest p-value (0.9249) as shown in Exhibit 2.

Exhibit 2 -

Let's begin by removing DoM variable from our model because of its high p-value and low partial correlation with price.

Exhibit 3 -

As we can see, upon removing DoM from our model our RMSE has reduced from 64.52 to 63.65(exhibit 3).

We will now remove the next variable with high p-value - Beds with p-value 0.5622 and then followed by Cars and Years Old. It's important to note that non-significant variables need to be removed one at a time because each removal will affect the significance of other variables. After removal of non-significant variables, a p-value of Square Feet reduces to 0.0240, statistically significant value. We are now left with those variables - Baths, Square Feet, Miles to Resort, Acres - that have a highest partial correlation with Price. 

This approach in removing variables from the model gives backbone and reasoning into our thought process in simplifying the model and in addition, this approach provides better visibility into how to modify a model when the significance of one predictor depends on the other predictors that are in the model. 

Analytics tool and Data Source: JMP

To view or add a comment, sign in

More articles by Vijay Patha

  • Tool#4 - Argument Boxes

    * This is an excerpt from the book Machine Learning Product Manager: 10 Tools to Jumpstart your Career* Product…

  • Tool #1 to succeed as a Machine Learning Product Manager

    * This is an excerpt from the book Machine Learning Product Manager: 10 Tools to Jumpstart your Career** By reading…

    2 Comments
  • Teamwork Myths

    When was the last time you paid attention to your breathing? Why would you? For most of us, breathing is natural and…

    1 Comment
  • Buy bitcoin? A case for and against

    Your friend might have paid off her student loans from the recent surge in the bitcoin's value. Like me, you might be…

  • Uncomplicated intro to Qubits

    How would solve a maze problem 20 years from now? Here is quick analogy comparing the classical and quantum computing…

  • Ordering Product Road Map

    Product road map should be ordered not just prioritized. Prioritization is one way to order a product road map.

  • Just got chi-squared!

    One of these graph is telling a false story! The question we are attempting to answer is, "Does marital status and/or…

  • Symphony of Variables Impact

    Nothing at work is more exciting to me than to bring data driven insights that can drive clarity into business…

  • Indicators of Multicollinearity

    As always the immediate audience for this article is myself. I don’t think there is no shortage of supply of…

    2 Comments

Others also viewed

Explore content categories