The data set has 425 cases and 15 variables pertaining to past and current customers who have borrowed from a bank for various reasons. The data set contains customer-related information such as financial standing, the reason for the loan, employment, demographic information, and the outcome or dependent variable for credit standing. Also classifying each case as good or bad, based on the institution’s experience.
Scrape, clean, and manipulate the data so that it is usable.
Create a list of the top five predictors to identify potentially risky customers. (e.g. financial standing, employment, etc.)
Compile a report as a written document of at least 1000 words and your Excel sheet (which is the Credit Risk Excel Sheet)