0% found this document useful (0 votes)

703 views13 pages

Bank Loan Case Study

Uploaded by

230 Sahithi sri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

703 views13 pages

Bank Loan Case Study

Uploaded by

230 Sahithi sri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

BANK LOAN CASE

STUDY

By: VISHAL SONI

1
PROJECT DESCRIPTION
This project aims at analyzing the risk appetite of banks. When the
company receives a loan application, the company must decide for
loan approval based on the applicant’s profile. Two types of risks are
associated with the bank’s decision:

• If the applicant is likely to repay the loan, then not approving the
loan results in a loss of business to the company.
• If the applicant is not likely to repay the loan, i.e., he/she is likely to
default, then approving the loan may lead to a financial loss for the
company.

The data given contains the information about the loan application at
the time of applying for the loan. It contains two types of scenarios:

• The client with payment difficulties: he/she had late payment more
than X days on at least one of the first Y instalments of the loan in
our sample.
• All other cases: All other cases when the payment is paid on time.

Based on the scenarios a detailed analysis must be conducted and

insights needs to be drawn to help bank identify the pattern which
may be used for taking actions such as denying the loan, reducing the
amount of loan, lending (too risky applicants) at a higher interest
rate, etc. This will ensure that the consumers capable of repaying the
loan are not rejected.

2
TECH STACK USED

Microsoft®Excel®2016

Purpose – All the analysis has been performed

in excel. This tool is also used to create graphical
representation of the results and to understand
the result set better.

3
APPROACH
I have used COUNTA function to count the total rows in each
column. After that I have found the percentage of null values in each
column using the formula 1- (Total Row Counts for each columns /
Total Row Counts). After that I have removed all the columns having
null value percentages more than 30%. For column having less than
30% null value percentages I have done mean, median and mode
imputations for the missing values for columns having null value
percentages less than 30%. I have also found the outliers using
interquartile range method considering relevant columns. After going
through each column description, I have kept only relevant columns
to bring out the insights. The columns having days are converted in
to years by simply dividing the days by 365.Click on the below link
to open the excel file. The excel file contain all the analysis.

https://docs.google.com/spreadsheets/d/1wAmpXp5r76j-
fIqqpf6z6AIerqhQqjjN/edit?usp=share_link&ouid=1148888472243
03662552&rtpof=true&sd=true
4
OUTLIERS

In the above XY plotter we can see that for the target variable 1 there are income which
are beyond the limit. There are applicants who are drawing an income of around 11 crores
whereas majority of applicants are drawing income in lacs only. For analysis refer the
sheet outliers for AMT_TOTAL_INCOME in the above link.

5
In the sheet outliers for CNT_CHILDREN there are outliers for the target column 0 and as well
as 1. The XY Plotter for 0 shows 19 children which is highly unusual these days. The XY plotter
for 1 shows more than 7 children.

In the sheet outliers for Days Employed there are outliers for both target column 0 and 1.
The XY plotter shows there are applicants being employed for 1000 years from the day of
application which is clearly an anomaly.

6
DATA
IMBALANCE

In the excel file attached above the sheet Data

imbalance shows the ratio of total applicants with
payment difficulties (1) to the total applicants with
installments being paid on time (0) to be 11.39.
That is out of total applications of 3075011, 92%
applicants paid installments on time thus makes the
majority class and the rest of the 8% of applicants
had payment difficulties thus makes the minority
class.

7
UNIVARIAITE
ANALYSIS

Univariate Analysis refers to the analysis of data that contains only one variable. It
does not deal with causes or relationships and the main purpose of the analysis is to
describe the data and find patterns that exist within it. The above graph is an example
of univariate analysis which depicts simply the count of applicants for the variable
AMT_CREDIT grouped in different credit bins. Majority of the applicants were offered
loans in the credit range of 9 Lacs and above.

8
UNIVARIAITE
SEGMENTED ANALYSIS

Univariate Analysis refers to the analysis of data that contains only one
variable. Segmented analysis here means that the data variable is analyzed in
subsets. The above graph is an example of univariate segmented analysis which
depicts simply the count of segmented applicants (0 & 1) for the variable
AMT_TOTAL_INCOME grouped in different income bins. As evident from
the graph there are very few targets 1 applicant who draw an income of more
than 50 Lacs and above which can be the reason for the difficulties in the
payments. Also, maximum applicants (0,1) draw an income between 1.25 Lacs
to 1.5 Lacs but there are applicants which are having payment difficulties
despite belonging to the same income range.

9
BIVARIAITE
ANALYSIS

Bivariate Analysis refers to the analysis of data that contains only two
variables. The analysis of this type of data deals with causes and
relationships and the analysis is done to find out the relationship among
the two variables. The above graph is an example of bivariate analysis
which depicts the relation between AMT_CREDIT and
AMT_TOTAL_INCOME. As evident from the graph applicants
drawing higher income were offered higher loan amount. Thus, these
two variables follow a directionally proportional relation.

10
CORRELATIONS FOR APPLICANTS WITH PAYMENT MADE ON TIME

** The Analysis can be found on the above attached link on page 4 on sheet “Correlation for Target 0” in excel file
Bank Loan Case Study.

The heat map in the The color scheme used for The most relevant correlations
above slide shows the the heat map in the above can be seen between the
correlations between slide is green to white which variables are:
 AMT_TOTAL_INCOME to
the different variables indicates the strongest AMT_CREDIT
for the target (0) that is correlations are in green and  DAYS_EMPLOYED to
DAYS_BIRTH
applicants with no the weakest correlations
 REGION_POPULATION_RELA
payment difficulties. being in whites. TIVE to AMT_INCOME_TOTAL

11
CORRELATIONS FOR APPLICANTS WITH PAYMENT DIFFICULTIES

** The Analysis can be found on the above attached link on page 4 on sheet “Correlation for Target 1” in excel file
Bank Loan Case Study.

12
CONCLUSION
This project helps in handling the large datasets. How
exploratory data analysis can be applied to large datasets.
When dealing with the large datasets it is also important
to select only those columns which are extremely useful
to our analysis. Finding correlations columns can become
very convenient while dealing with large datasets as it
saves time selecting which columns should be considered
for analysis. The project also helps in understanding the
various terminologies used in the banking domain. The
insight drawn from the project are as follows:

Applicants drawing higher income were offered

higher loan amount by the bank.
Majority of applicants drawn an income range
between 1.25 Lacs – 1.5 Lacs, also the defaults drawn
income between the same range.
Majority of applicants were offered loans in the credit
range of 9 Lacs and above.

Bank Loan Case Study Report
No ratings yet
Bank Loan Case Study Report
23 pages
Trainity Data Analytics Training Project 6
No ratings yet
Trainity Data Analytics Training Project 6
22 pages
Bank Loan Case Study
No ratings yet
Bank Loan Case Study
34 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
13 pages
Hiring Process Analytics Sujata
No ratings yet
Hiring Process Analytics Sujata
13 pages
Hiring Data Insights for MNCs
100% (1)
Hiring Data Insights for MNCs
8 pages
Operational Analytics Insights
No ratings yet
Operational Analytics Insights
18 pages
Impack of Car Features
No ratings yet
Impack of Car Features
19 pages
Car Feature Impact Analysis
No ratings yet
Car Feature Impact Analysis
9 pages
IMDB Movie Analysis
No ratings yet
IMDB Movie Analysis
17 pages
Hiring Data Insights for Analysts
No ratings yet
Hiring Data Insights for Analysts
11 pages
Bank Loan Case Study
No ratings yet
Bank Loan Case Study
13 pages
Operational Analytics and Investigating Metric Spike
No ratings yet
Operational Analytics and Investigating Metric Spike
9 pages
Data Analytics Projects Portfolio
No ratings yet
Data Analytics Projects Portfolio
126 pages
Operation Analytics and Investigating Metric Spike
No ratings yet
Operation Analytics and Investigating Metric Spike
11 pages
DA Portfolio Project
No ratings yet
DA Portfolio Project
16 pages
Project 4 Imdb Movie Analysis
No ratings yet
Project 4 Imdb Movie Analysis
17 pages
Operation Analytics and Investigating Metric Spike
50% (2)
Operation Analytics and Investigating Metric Spike
14 pages
Hiring Process Analytics Project 4 On Statistics
100% (1)
Hiring Process Analytics Project 4 On Statistics
6 pages
Call Volume Trend Analysis Report
No ratings yet
Call Volume Trend Analysis Report
11 pages
Movie Industry Data Insights
No ratings yet
Movie Industry Data Insights
4 pages
Instagram User Insights & Metrics
No ratings yet
Instagram User Insights & Metrics
6 pages
Trainity Data Analytics Trainee Task 9 - ADVAIT CHAVAN - Data Analysis Portfolio
No ratings yet
Trainity Data Analytics Trainee Task 9 - ADVAIT CHAVAN - Data Analysis Portfolio
151 pages
Tableau Assignment
100% (4)
Tableau Assignment
7 pages
Imdb Movie Analysis - Project 5
No ratings yet
Imdb Movie Analysis - Project 5
13 pages
Analyzing The Impact of Car Features On Price and Profitability
No ratings yet
Analyzing The Impact of Car Features On Price and Profitability
12 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Operation & Metric Analytics Guide
No ratings yet
Operation & Metric Analytics Guide
32 pages
IMDB Movie Analysis Report
No ratings yet
IMDB Movie Analysis Report
11 pages
Autos Automobile.. EDA Project by Anjali Sinha
No ratings yet
Autos Automobile.. EDA Project by Anjali Sinha
26 pages
Trainity Insta Analytics Report
No ratings yet
Trainity Insta Analytics Report
3 pages
Vicky Gupta: Data Scientist
No ratings yet
Vicky Gupta: Data Scientist
1 page
Introduction To SQL Test Your Understanding
100% (1)
Introduction To SQL Test Your Understanding
71 pages
Project SQL
No ratings yet
Project SQL
5 pages
Great Lakes Extraa - Learn Project Business Report - 2-Kavish-Rathod
No ratings yet
Great Lakes Extraa - Learn Project Business Report - 2-Kavish-Rathod
22 pages
IMDB Movie Analysis
No ratings yet
IMDB Movie Analysis
7 pages
Python Data Analysis Tasks
No ratings yet
Python Data Analysis Tasks
9 pages
Crime Analysis
No ratings yet
Crime Analysis
13 pages
Assignment 3
No ratings yet
Assignment 3
6 pages
13 - Histograms and The Normal Distribution - pcs-1
No ratings yet
13 - Histograms and The Normal Distribution - pcs-1
28 pages
SQL Project
No ratings yet
SQL Project
15 pages
Relational Database Design & SQL Exercises
No ratings yet
Relational Database Design & SQL Exercises
11 pages
Excel Data Analysis Project-6
No ratings yet
Excel Data Analysis Project-6
1 page
Project - Data Mining: Bank - Marketing - Part1 - Data - CSV
No ratings yet
Project - Data Mining: Bank - Marketing - Part1 - Data - CSV
4 pages
Sales Analysis for Business Analysts
No ratings yet
Sales Analysis for Business Analysts
5 pages
Montgomery Fleet Equipment Inventory FA PART 2 END
No ratings yet
Montgomery Fleet Equipment Inventory FA PART 2 END
5 pages
2 ASSIGNMENT 2 (Beginning Superstore)
0% (1)
2 ASSIGNMENT 2 (Beginning Superstore)
1 page
Tableau Assignment
100% (1)
Tableau Assignment
7 pages
Myntra Data Analyst Interview Questions
No ratings yet
Myntra Data Analyst Interview Questions
34 pages
Additional Project Problem Statement - FIFA Data Analysis
No ratings yet
Additional Project Problem Statement - FIFA Data Analysis
2 pages
Uber HYD COE Business Analyst JD - Analytics & Data Science-1 PDF
No ratings yet
Uber HYD COE Business Analyst JD - Analytics & Data Science-1 PDF
3 pages
Assignment 2
No ratings yet
Assignment 2
10 pages
The Cricket Winner Prediction With Applications of ML and Data Analytics
No ratings yet
The Cricket Winner Prediction With Applications of ML and Data Analytics
18 pages
Diwali Sales Analysis EDA 1696347982
No ratings yet
Diwali Sales Analysis EDA 1696347982
8 pages
Trainity Project 2
No ratings yet
Trainity Project 2
9 pages
Data Analysis for Business Ops
100% (2)
Data Analysis for Business Ops
24 pages
Upgrad Placement Report - May
No ratings yet
Upgrad Placement Report - May
8 pages
Merit List-16-05-2024
No ratings yet
Merit List-16-05-2024
23 pages
Excel Question B.com Hons
No ratings yet
Excel Question B.com Hons
4 pages
Bank Loan Data Analysis Study
No ratings yet
Bank Loan Data Analysis Study
11 pages
The Impact of Tangible and Intagible Assets On The Smes Success. The Albanian Case
No ratings yet
The Impact of Tangible and Intagible Assets On The Smes Success. The Albanian Case
12 pages
7591950+ +paper+ +Strategies+in+Decision+Making
No ratings yet
7591950+ +paper+ +Strategies+in+Decision+Making
7 pages
Iogi2018,+06 +Ichsan+Ramadhan+Mokodompit +OK
No ratings yet
Iogi2018,+06 +Ichsan+Ramadhan+Mokodompit +OK
11 pages
European Studies Master Thesis Topics
100% (3)
European Studies Master Thesis Topics
7 pages
Factors Influencing Quality of Construction Projects in Cambodia
No ratings yet
Factors Influencing Quality of Construction Projects in Cambodia
11 pages
Business Statistics 2nd Edition J. K. Sharma Instant Download
100% (8)
Business Statistics 2nd Edition J. K. Sharma Instant Download
81 pages
Explainable Artificial Intelligence Approaches
No ratings yet
Explainable Artificial Intelligence Approaches
14 pages
84th Economic & Social Dev Conference Proceedings
No ratings yet
84th Economic & Social Dev Conference Proceedings
254 pages
Volume 34 Issue 1 Paper 2
No ratings yet
Volume 34 Issue 1 Paper 2
36 pages
Python Data Visualization Guide
No ratings yet
Python Data Visualization Guide
21 pages
Connecting Social Intelligence With Social Media Usage A Study at A University
100% (1)
Connecting Social Intelligence With Social Media Usage A Study at A University
5 pages
Portfolio Models-Introduction: I I I J I I J Ij I II I II I I
No ratings yet
Portfolio Models-Introduction: I I I J I I J Ij I II I II I I
23 pages
Factors Affecting Consumer Buying Decision Towards Choosing A Smartphone Among Young Adults
No ratings yet
Factors Affecting Consumer Buying Decision Towards Choosing A Smartphone Among Young Adults
13 pages
Essential Statistics For The Behavioral Sciences 1st Edition Privitera Fast Access
No ratings yet
Essential Statistics For The Behavioral Sciences 1st Edition Privitera Fast Access
314 pages
PSYCH ASSESSMENT REVIEWER COHENSummarizedbyKIAMERCADO 1
No ratings yet
PSYCH ASSESSMENT REVIEWER COHENSummarizedbyKIAMERCADO 1
55 pages
Viral Pandey Bankruptcy Prediction
No ratings yet
Viral Pandey Bankruptcy Prediction
7 pages
BBA-upto-6th-Sem.-batch-2022-onwards (1) BBA
No ratings yet
BBA-upto-6th-Sem.-batch-2022-onwards (1) BBA
67 pages
Journal of Research and Innovation in Education
No ratings yet
Journal of Research and Innovation in Education
18 pages
Workflow Diagram
No ratings yet
Workflow Diagram
2 pages
Assignment 2 - Data Management
No ratings yet
Assignment 2 - Data Management
68 pages
Cloud Computing Course Structure
No ratings yet
Cloud Computing Course Structure
21 pages
Machine Learning Activity-Based Costing: Conceptual Test
No ratings yet
Machine Learning Activity-Based Costing: Conceptual Test
45 pages
Index Analysis of The Causes of Vehicular Traffic Congestion in South-Eastern Nigeria
No ratings yet
Index Analysis of The Causes of Vehicular Traffic Congestion in South-Eastern Nigeria
10 pages
Specimen 2019 (IAL) MS - Unit 6 Edexcel Biology A-Level
No ratings yet
Specimen 2019 (IAL) MS - Unit 6 Edexcel Biology A-Level
6 pages
Support Vector Machines Problem Statement
No ratings yet
Support Vector Machines Problem Statement
27 pages
Psychological Hardiness & Homesickness
No ratings yet
Psychological Hardiness & Homesickness
22 pages
The Use of ICT in Educational Organizations A Quan
No ratings yet
The Use of ICT in Educational Organizations A Quan
11 pages
Hnu B215 Biostatistics For Health Sciences
No ratings yet
Hnu B215 Biostatistics For Health Sciences
13 pages
Mathematics LO - 2022
No ratings yet
Mathematics LO - 2022
44 pages
Panel Data Analysis Guide
No ratings yet
Panel Data Analysis Guide
22 pages

Bank Loan Case Study

Uploaded by

Bank Loan Case Study

Uploaded by

BANK LOAN CASE

By: VISHAL SONI

Based on the scenarios a detailed analysis must be conducted and

Purpose – All the analysis has been performed

In the excel file attached above the sheet Data

Applicants drawing higher income were offered

You might also like