Preview

Chapter 7 - K neighbours

Satisfactory Essays
Open Document
Open Document
520 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Chapter 7 - K neighbours
7.1
a. How would this customer be classified?
A. This customer would be classified as not accepting the personal loan offer. According to the KNN_Output there appears to be overfitting due to the discrepancies in the classification matrix for training (Class 0 = 0% error, Class 1 = 0% error, Overall = 0% error), and validation error (Class 0 = 4.2% error, Class 1 = 55.85% error, and Overall = 9.1% error).

b. What is a choice of k that balances between overfitting and ignoring the predictor information?
A. A choice of k that balances between overfitting and ignoring the predictor would be k = 6. The value is chosen because it minimizes the % validation error. After testing various k levels. According to the validation error log for different k the best k points to 6, where %error training is 7.4% and validation % error is 8.75%.

c. Show the classification matrix for the validation data that results from using the best k.

d. Classify the customer using the best k
A. According to the best k the customer would not be inclined to accept the personal loan.
e. Re-partition the data, this time into training, validation, and test sets (50%: 30%: 20%). Apply the k-NN method with the k chosen above, compare the classification matrix of the test set with that of the training and validation sets. Comment on the differences and their reason.
A. Based on the training, validation, and test matrices we can see a steady increase in the percentage errors. There does not appear to be overfitting due to the minimal error discrepancies among all three matrices, from the training to the validation error there is a 5.69% difference, and from validation to test error there is a 14.05% error difference. Based on the lift chart, the model appears to make a difference even though the loan acceptance has a 82% error rate for the test classification matrix.
9.3
i. Compare the tree generated by the CT with the one generated by the RT. Are they

You May Also Find These Documents Helpful

  • Satisfactory Essays

    lab3c chem11

    • 314 Words
    • 2 Pages

    4. It is important to the class that accurate results are obtained because a class data graph will be made after all the data is recorded.…

    • 314 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    HW 2

    • 577 Words
    • 3 Pages

    (c) What potential problems are there for the method proposed in (b)? How can you improve it?…

    • 577 Words
    • 3 Pages
    Satisfactory Essays
  • Better Essays

    Economics and Book Online

    • 1059 Words
    • 4 Pages

    b. Show the relevant choices for this student. What determines which of these options the student will choose?…

    • 1059 Words
    • 4 Pages
    Better Essays
  • Good Essays

    Supervised-Deciding whether to issue a loan to an applicant based on demographic and financial data (with reference to a database of similar data on prior customers).…

    • 362 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Understanding Fico Scores

    • 2191 Words
    • 9 Pages

    The research in this report was taken from a few different sources. The primary research was conducted by distributing a survey to the general public. The survey was designed to help us understand how much people actually know about their score. However, due to limited time and resources the survey was completed by only 20 people. The information provided by the survey was still useful despite the limitation on sample size. The secondary research was taken from websites, books, and training materials from the lending industry.…

    • 2191 Words
    • 9 Pages
    Powerful Essays
  • Satisfactory Essays

    d. Choose one of the variables in your dataset and classify it according to the levels of measurement. Explain how you know.…

    • 343 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    3505 M2 Fall 2014 Soltn

    • 3355 Words
    • 15 Pages

    e. Apply RAROC to the data on the above loan. Calculate each component of RAROC.…

    • 3355 Words
    • 15 Pages
    Satisfactory Essays
  • Satisfactory Essays

    C) We can find K by using the regression line method and time series data or cross sectional data.…

    • 381 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    A) What is the percent error for each of the four phenotypes between the expected kernel numbers (calculated from the Punnett square) and the observed kernel numbers?…

    • 350 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    B. Which unknowns are you confident that you correctly identified? What specific test was crucial in this confidence?…

    • 535 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    ece 6001

    • 509 Words
    • 5 Pages

    A photon counter connected to the output of a fiber detects the number of photons,…

    • 509 Words
    • 5 Pages
    Satisfactory Essays
  • Good Essays

    There are 50 credit customers who were selected for the data collection on five variables such as location, income, size, years, and credit balance. In order to understand more about their customer, AJ DAVIS must use graphical, numerical summary to be able to interpret and better expand their business in the future.…

    • 1166 Words
    • 5 Pages
    Good Essays
  • Good Essays

    Lawson Case

    • 878 Words
    • 4 Pages

    There are two chief participants in this case study, Paul Mackay and Jackie Patrick. Mackay, a sole proprietor of Lawsons (a general merchandising retail site in Riverdale, Ontario), has approached the Commercial Bank of Ontario in order to acquire an additional $194, 000 bank loan and a $26,000 line of Credit. Patrick, a first time loans officer, has been appointed to Mackay’s request. As such although apprehensive to finish her first loan, she must take into consideration the difficulties of this particular case.…

    • 878 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    Be Our Guest

    • 365 Words
    • 2 Pages

    Lenders are securing their funds which a high interest rate because Be Our Guest, Inc. had past due receivables. Also the lenders described in the textbook were only willing to lend 70 percent of the amount collected from customers as they don’t have a guarantee that the company’s customers will ever pay their bills.…

    • 365 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    f. Explain whether you believe the information in requirement d or e provides the most useful data for evaluating the potential for misstatements. Explain why.…

    • 265 Words
    • 2 Pages
    Good Essays