Preview

Calculating Correlation Values for Categorical Data

Good Essays
Open Document
Open Document
253 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Calculating Correlation Values for Categorical Data
Calculating correlation values for categorical data
In order to find the correlation values for the fields in our data set, The Pearson Correlation Coefficient was used. This requires that the data in both fields be quantitative. But what if we were looking to calculate the correlation on two given fields that were say, numerical and categorical, or even both categorical.
The Point Biserial coefficient is a special case of The Pearson Correlation Coefficient; it is a branch of PCC although they are mathematically equivalent. It is used when one field has quantitative data and the other has categorical values, specifically categorical data that can only be one of two options for example gender. To calculate the PBC the data is divided between the two values of the dichotomous data, where the two values of this field are given the values 0 and 1. The distribution of the data will in general show the frequencies for each value and can be used to show how well two fields are correlated.
Spearman’s Rank Order Coefficient is a method of estimating correlation between data that is nominal and importantly must be ordered. It checks how well the relationship between the two fields can be described using a monotonic function
Another method for calculating the correlation is the Chi squared Test, this requires data to be classified and frequencies worked out in a table. From this table the correlations can be determined using the Chi Square Test, this works on any pair of nominal or categorical

You May Also Find These Documents Helpful

  • Satisfactory Essays

    Correlational studies show relationships between variables. If high scores on one variable predict high scores on the other variable, the correlation is positive. If high scores on one variable predict low scores on the other variable, the correlation is negative.…

    • 404 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Nt1330 Unit 5 Study Guide

    • 398 Words
    • 2 Pages

    6. Determine the value of the Correlation Coefficient. [Remember that the r is the square root of r2] Comment on how well the regression line fits the data.…

    • 398 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Correlation between Q10 What is your cumulative Grade Point Average at Kaplan University? and Q11 How many hours do you spend on school work each week? is: 0.27817234…

    • 1838 Words
    • 8 Pages
    Good Essays
  • Satisfactory Essays

    The correlation coefficient (r value) is a quantitative assessment of the strength of relationship between the x and y values in a set of (x, y) pairs. The value of r is a measure of the extent to which x and y are linearly related or the extent to which the points in the scatterplot fall close to a straight line. The value of r is between -1 and +1. A value near the upper limit, +1, indicates a substantial positive relationship, whereas an r value close to the lower limit, -1,…

    • 441 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    The Pearson r correlation coefficient is used with _____ level data. Pearson r coefficients can range from ______ to ______. (2 points)…

    • 445 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    | Correlation research may clarify relationships between variables that cannot be examined by other research methods. They allow prediction of behavior…

    • 765 Words
    • 4 Pages
    Satisfactory Essays
  • Powerful Essays

    Assignment 2 5

    • 571 Words
    • 7 Pages

    c) If these sharks are representative of the population of basking sharks, what would you predict is the mean speed for a filter-feeding basking shark that is 5.0 meters in length? Show any calculations below.…

    • 571 Words
    • 7 Pages
    Powerful Essays
  • Satisfactory Essays

    Researchers use the ____Correlation Method____________________ to establish the degree of relationship between two characteristics, events, or behaviors.…

    • 490 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    1st Individual Homework

    • 609 Words
    • 10 Pages

    3. What is the average Total $ spent, the average Total # of purchases, and the average number of months since last purchase?…

    • 609 Words
    • 10 Pages
    Satisfactory Essays
  • Good Essays

    Correlation as a measure of association summary that involves testing relationships between two variables. The focus for the test will be based on the strengths of the association and the direction. According to Leedy and Ormrod, (2010) “A correlation exist if, when one variable increase, another variable either increases or decrease in a somewhat predictable fashion”. Relying on the differences of the results, and what will have some common result will help to determine the test using correlation as a measure of association summary.…

    • 581 Words
    • 2 Pages
    Good Essays
  • Good Essays

    2. Bivariate statistics refers to the statistical analysis of the relationship between two variables. (Points : 1)…

    • 1330 Words
    • 6 Pages
    Good Essays
  • Satisfactory Essays

    Study guide answer exam 1

    • 1138 Words
    • 5 Pages

    A correlation exists when 2 variables are related to each other. May be positive or negative depends on nature of the association between the variables measured. Correlation indicates the 2 variables that change together in the opposite direction. Strength of correlation depends on size of coefficient.…

    • 1138 Words
    • 5 Pages
    Satisfactory Essays
  • Powerful Essays

    Final Projedct

    • 999 Words
    • 4 Pages

    a)Correlation between Q10 What is your cumulative Grade Point Average at Kaplan University? and Q11 How many hours do you spend on school work each week? is:…

    • 999 Words
    • 4 Pages
    Powerful Essays
  • Powerful Essays

    Homeostasis Lab Report

    • 1166 Words
    • 5 Pages

    Go to http://www.danielsoper.com/statcalc/calculator.aspx?id=44 and enter the correlation coefficient along with the sample size of individuals used in your calculation. This will give you a probability of obtaining these results by chance (p-value). P values below 0.05 indicate a statistically significant result.…

    • 1166 Words
    • 5 Pages
    Powerful Essays
  • Satisfactory Essays

    Fyp Proposal

    • 418 Words
    • 2 Pages

    CAA is a method that makes good use of the correlation between the basis vectors to reflect the overall correlation between two sets of data. In order to find the correlation between two sets of variables, several representative basis vectors can be extracted and the overall correlation can be represented.…

    • 418 Words
    • 2 Pages
    Satisfactory Essays