correlation – Row Coding

pandas columns correlation with statistical significance

November 29, 2023 by Tarik

To calculate all the p-values at once, you can use calculate_pvalues function (code below): df = pd.DataFrame({‘A’:[1,2,3], ‘B’:[2,5,3], ‘C’:[5,2,1], ‘D’:[‘text’,2,3] }) calculate_pvalues(df) The output is similar to the corr() (but with p-values): A B C A 0 0.7877 0.1789 B 0.7877 0 0.6088 C 0.1789 0.6088 0 Details: Column D is automatically ignored as it … Read more

Remove highly correlated variables

November 19, 2023 by Tarik

cor shows only NA or 1 for correlations – Why?

November 15, 2023 by Tarik

How to plot a correlation matrix into a graph?

November 9, 2023 by Tarik

Correlated features and classification accuracy

September 24, 2023 by Tarik

Correlated features do not affect classification accuracy per se. The problem in realistic situations is that we have a finite number of training examples with which to train a classifier. For a fixed number of training examples, increasing the number of features typically increases classification accuracy to a point but as the number of features … Read more

How to visualize correlation matrix as a schemaball in Matlab

August 31, 2023 by Tarik

Kinda finished I guess.. code can be found here at github. Documentation is included in the file. The yellow/magenta color (for positive/negative correlation) is configurable, as well as the fontsize of the labels and the angles at which the labels are plotted, so you can get fancy if you want and not distribute them evenly … Read more

Dealing with missing values for correlations calculation

August 9, 2023 by Tarik

Python pandas returns empty correlation matrix

August 2, 2023 by Tarik

As Jeff mentioned in the comments, the problem resulted from my columns having the object dtype. For future reference, even if the object looks numeric, check the dtype and make sure it is numeric (e.g. do foo.astype(float)) before computing the correlation matrix.

Calculate correlation with cor(), only for numerical columns

July 11, 2023 by Tarik