SlideShare a Scribd company logo
1 of 26
Download to read offline
IBM SPSS presentation Amsterdam, 11th November 2014 
Drs. Ing. J.A.C.M. Smit (Jan) 
Director of 
STATSCONsult, based in Drunen NL 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
STATSCONsult 
Support, marketing and Sales of software products for statistical analyses 
Courses in Statistics 
Consultancy in Data Analyses 
Jan Smit worked for SPSS from 1984 until 1989. 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
STATSCONsult Consultancy 
SPSS Intro courses 
SPSS assistance in data analyses 
SPSS advanced courses 
SPSS Risk Analyses (including Weight of Evidence) 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Examples of Logistic Regression 
We wish to model the likelihood of an event that is likely to happen which depends on a number of factors (predictors): 
◦To predict whether a patient has (or will have) a given disease 
◦Prediction of a customer's propensity to purchase an appliance (TV) 
◦Prediction of passing an exam 
◦Prediction of paying back a loan in full 
◦Risk analyses is done with Logistic Regression 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
What are the assumptions of using Logistic Regression? 
The predictors are not too much highly multiple correlated (multicollinearity) 
A continuous predictor should have a monotone descending (ascending) probability of the dependent variable in the data 
We obtain a (model + error); residuals (=error) should not dominate 
Model should be interpretable, easy to use and be useful for forecasting 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Logistic regression, application 
We analyse the effect of a number of independent predictors (x1, x2, .. xn) on a dependent variable Y, where Y in [0,1] 
Covariates are predictors, for which we wish to correct (such as age) 
Predictors can be continuous, nominal or ordinal 
◦Independent variables can be continuous, Age 
◦Ordinal or Nominal, Level of Education, 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Data 
1500 observations 
We wish to have a model of Previous Default (Y=1) 
From now on, we say Previous Default (risk to pay-off bankloan) =“Risk” 
Interpret the model, use the model for prediction. 
Based on the predictors (Age, .. ,Household Income) 
548 observations have Risks (Y=1) in our data set 
Here 90% of observations are used for the model; the remaining observations are used for prediction. 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
What are my odds? 
We cannot use regression, though we use all the theory of linear regression. 
In Logistic regression our model is: 
◦log(P(y=1)/P(y=0) )= a + b1*x1 + b2*x2 + ..+ bn*xn 
◦Linear regression : Y= a + b1*x1 + b2*x2 + ..+ bn*xn (nearly the same) 
◦Odds : P(y=1), P(y=0) and P(y=1)/P(y=0) ; my odds are 2 to 1, meaning P(y=1)/P(y=0)=2 
Log(odds) makes statistics possible : 
◦P=2/3: odds ratio= 2; log(2)=0,69 
◦P=1/2: odds ratio=1; log(1)=0 
A coefficient is the change in the log odds, when other factors are fixed. 
Sometimes I have the Odds against, or Odds on, or Odds even. 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Bankloan data 
We wish to model the chance of paying back in full a bank loan. When Risk =1, the loan was not in the end returned to the bank. 
Y : Risk{1=yes, 0=no} 
X : A number of factors that may affect Y 
Age in years age 
Level of education ed 
Years with current employer employ 
Years at current address address 
Household income in thousands income 
Debt to income ratio (x100) debtinc 
Credit card debt in thousands creddebt 
Other debt in thousands othdebt 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Make groups via visual binning 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Odd ratios of risk decreases with higher values for age 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Dependency of Age on Risk 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
AND IN FORMULAS 
We read : Log Odds = log(Risk=1 / Risk=0) = Constant + B * age = 
1,250 – 0,055*age 
For age=20 : LogOdds = 1,25-1,1= 0,15 
For age=30 : LogOdds = 1,25- 1,65 = -0,4 
For age=40 : LogOdds = 1,25 – 2,2 = -0,95 
If age= 22,7 the LogOdds=0 
According to model : 
For age=20 : OddsRatio = exp(0,15) =1,15 
For age=30 : OddsRatio = exp(-0,4) = 0,67 
For age=40 : OddsRatio = exp(-0,95)= 0,4 
Probability : 
For age=20 : P(yes)= 0,54 
For age=30 : P(Y=1) = 0,40 
For age=40 : P(Y=10 = 0,28 
We conclude Age can be used as an predictor for Risk (as Sig-P < 0,05) 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Usage of dialog in IBM SPSS: 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Specify categorical predictors 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Output (2) from initial stage (all main effects) The -2 log likelihood is leading. 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Output of predictors and effect on Risk 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
What predictors can we use in the model to estimate “Risk” 
If the Sig.-p< 0,05 for a predictor, we may conclude that this predictor has an effect on the depended variable (a significance effect). 
If the Sig.-p>0,05 for a predictor, we may conclude that we are uncertain that this predictor has an effect on the depended variable. 
Watch out for pit falls (remove a variable that has no effect, and re-estimate). 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Modelling 
By using Backwards (LR), at each step we re-estimate the model, leaving out a non- significant predictor: 
After this step : only the variables: age, employ, debtinc, creddebt are significant 
Note that correlations of predictors may affect the order of inclusion in model (employ and address are highly correlated) 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
After Backward Deletion 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Interpretation of the model 
If the coefficient of a predictor < 0 the odd ratio decreases for larger values. 
Large coefficients (positive or negative) are more important (go with large Wald statistics and small Sig.-p values). 
Here people with 
1.short period at current employer (Change) and 
2.high Credit Card Debts (Expenders) and 
3.high values of Debts to Income ratio (Have Fun) and 
4.low ages (Young) 
show high risk. 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Classification on the data in model; If we adjust the cut value to a lower p (0,5) the Predicted Yes column values becomes lower. 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Model expression 
The model is: 
Log(Risk=1/ Risk=0) = -0,133 (constant) 
- 0,213 * employ (from 0 to 50) 
+ 0,483 * creddebt (from 0 to 36) 
+ 0,102 * debtinc (from 0 to 40) 
- 0,040 * age (from 18 to 60) 
If this expression > 0, the probability > 0,5 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Prediction 
Prediction is rather good (102 out of 133) 
Make use of the model and apply this to the remaining observations that are not included in the model. 
65+ 15 were formally classified as “No Risk” 
65 +16 are selected in model as “No Risk” 
We are able to change the cut off value of 0,5 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Comparison Classification Trees and Logistic Regression 
If the number of variables is high the result of LR still is simple; CT output will become large and complex. 
CT finds interactions, segments, with highest P. With LR segments are determined with high probability. 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation
Vragen 
Jan Smit 
jan.smit@statsconsult.nl 
+31 416 378 125 
http://www.statsconsult.nl/ 
11/11/2014 
STATSCONsult, Logistic Regression, IBM SPSS presentation

More Related Content

Viewers also liked

04. logistic regression ( 로지스틱 회귀 )
04. logistic regression ( 로지스틱 회귀 )04. logistic regression ( 로지스틱 회귀 )
04. logistic regression ( 로지스틱 회귀 )Jeonghun Yoon
 
Logistic Regression: Behind the Scenes
Logistic Regression: Behind the ScenesLogistic Regression: Behind the Scenes
Logistic Regression: Behind the ScenesChris White
 
VSSML16 L2. Ensembles and Logistic Regression
VSSML16 L2. Ensembles and Logistic RegressionVSSML16 L2. Ensembles and Logistic Regression
VSSML16 L2. Ensembles and Logistic RegressionBigML, Inc
 
Stepwise Logistic Regression - Lecture for Students /Faculty of Mathematics a...
Stepwise Logistic Regression - Lecture for Students /Faculty of Mathematics a...Stepwise Logistic Regression - Lecture for Students /Faculty of Mathematics a...
Stepwise Logistic Regression - Lecture for Students /Faculty of Mathematics a...Alexander Efremov
 
Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)MikeBlyth
 
Logistic Regression in Sports Research
Logistic Regression in Sports ResearchLogistic Regression in Sports Research
Logistic Regression in Sports ResearchJ P Verma
 
Fault prediction using logistic regression (Python)
Fault prediction using logistic regression (Python)Fault prediction using logistic regression (Python)
Fault prediction using logistic regression (Python)Binayak Dutta
 
Intro to Classification: Logistic Regression & SVM
Intro to Classification: Logistic Regression & SVMIntro to Classification: Logistic Regression & SVM
Intro to Classification: Logistic Regression & SVMNYC Predictive Analytics
 
Week 7 spss 2 2013
Week 7 spss 2 2013Week 7 spss 2 2013
Week 7 spss 2 2013wawaaa789
 
Probability Concepts Applications
Probability Concepts  ApplicationsProbability Concepts  Applications
Probability Concepts Applicationsguest44b78
 
Statistical Package for Social Science (SPSS)
Statistical Package for Social Science (SPSS)Statistical Package for Social Science (SPSS)
Statistical Package for Social Science (SPSS)sspink
 
Basic Probability
Basic Probability Basic Probability
Basic Probability kaurab
 
Probability powerpoint
Probability powerpointProbability powerpoint
Probability powerpointTiffany Deegan
 
Basic Concept Of Probability
Basic Concept Of ProbabilityBasic Concept Of Probability
Basic Concept Of Probabilityguest45a926
 
Probability Powerpoint
Probability PowerpointProbability Powerpoint
Probability Powerpointspike2904
 

Viewers also liked (19)

04. logistic regression ( 로지스틱 회귀 )
04. logistic regression ( 로지스틱 회귀 )04. logistic regression ( 로지스틱 회귀 )
04. logistic regression ( 로지스틱 회귀 )
 
Probabilty.
Probabilty.Probabilty.
Probabilty.
 
Logistic Regression: Behind the Scenes
Logistic Regression: Behind the ScenesLogistic Regression: Behind the Scenes
Logistic Regression: Behind the Scenes
 
VSSML16 L2. Ensembles and Logistic Regression
VSSML16 L2. Ensembles and Logistic RegressionVSSML16 L2. Ensembles and Logistic Regression
VSSML16 L2. Ensembles and Logistic Regression
 
Binary Logistic Regression Example
Binary Logistic Regression ExampleBinary Logistic Regression Example
Binary Logistic Regression Example
 
Stepwise Logistic Regression - Lecture for Students /Faculty of Mathematics a...
Stepwise Logistic Regression - Lecture for Students /Faculty of Mathematics a...Stepwise Logistic Regression - Lecture for Students /Faculty of Mathematics a...
Stepwise Logistic Regression - Lecture for Students /Faculty of Mathematics a...
 
Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)Logistic regression (blyth 2006) (simplified)
Logistic regression (blyth 2006) (simplified)
 
Logistic Regression in Sports Research
Logistic Regression in Sports ResearchLogistic Regression in Sports Research
Logistic Regression in Sports Research
 
Fault prediction using logistic regression (Python)
Fault prediction using logistic regression (Python)Fault prediction using logistic regression (Python)
Fault prediction using logistic regression (Python)
 
Multilevel Binary Logistic Regression
Multilevel Binary Logistic RegressionMultilevel Binary Logistic Regression
Multilevel Binary Logistic Regression
 
Intro to Classification: Logistic Regression & SVM
Intro to Classification: Logistic Regression & SVMIntro to Classification: Logistic Regression & SVM
Intro to Classification: Logistic Regression & SVM
 
Week 7 spss 2 2013
Week 7 spss 2 2013Week 7 spss 2 2013
Week 7 spss 2 2013
 
Probability Concepts Applications
Probability Concepts  ApplicationsProbability Concepts  Applications
Probability Concepts Applications
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Statistical Package for Social Science (SPSS)
Statistical Package for Social Science (SPSS)Statistical Package for Social Science (SPSS)
Statistical Package for Social Science (SPSS)
 
Basic Probability
Basic Probability Basic Probability
Basic Probability
 
Probability powerpoint
Probability powerpointProbability powerpoint
Probability powerpoint
 
Basic Concept Of Probability
Basic Concept Of ProbabilityBasic Concept Of Probability
Basic Concept Of Probability
 
Probability Powerpoint
Probability PowerpointProbability Powerpoint
Probability Powerpoint
 

Similar to Haal meer IBM SPSS Statistics 11.11.14 Voorspellen aan de hand van logistische regressie STATSCONsult

Terry Johns: Uncertainty - understanding the impact and the importance of rec...
Terry Johns: Uncertainty - understanding the impact and the importance of rec...Terry Johns: Uncertainty - understanding the impact and the importance of rec...
Terry Johns: Uncertainty - understanding the impact and the importance of rec...Association for Project Management
 
Creating Risk Profile Graphs
Creating  Risk  Profile  GraphsCreating  Risk  Profile  Graphs
Creating Risk Profile Graphspuspaltamuli
 
CHAPTER 11 LOGISTIC REGRESSION.pptx
CHAPTER 11 LOGISTIC REGRESSION.pptxCHAPTER 11 LOGISTIC REGRESSION.pptx
CHAPTER 11 LOGISTIC REGRESSION.pptxUmaDeviAnanth
 
Estimation of parameters.pptxxxxxxxxxxxx
Estimation of parameters.pptxxxxxxxxxxxxEstimation of parameters.pptxxxxxxxxxxxx
Estimation of parameters.pptxxxxxxxxxxxxAliceRivera13
 
CABT SHS Statistics & Probability - Estimation of Parameters (intro)
CABT SHS Statistics & Probability -  Estimation of Parameters (intro)CABT SHS Statistics & Probability -  Estimation of Parameters (intro)
CABT SHS Statistics & Probability - Estimation of Parameters (intro)Gilbert Joseph Abueg
 
Summer 07-mfin7011-tang1922
Summer 07-mfin7011-tang1922Summer 07-mfin7011-tang1922
Summer 07-mfin7011-tang1922stone55
 
Machine learning algorithms and business use cases
Machine learning algorithms and business use casesMachine learning algorithms and business use cases
Machine learning algorithms and business use casesSridhar Ratakonda
 
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROBOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROAnthony Kilili
 
Logistic-Regression-Webinar.pdf
Logistic-Regression-Webinar.pdfLogistic-Regression-Webinar.pdf
Logistic-Regression-Webinar.pdfVishaliKalra2
 
Hadoop World 2011: Building Relational Event History Model with Hadoop - Josh...
Hadoop World 2011: Building Relational Event History Model with Hadoop - Josh...Hadoop World 2011: Building Relational Event History Model with Hadoop - Josh...
Hadoop World 2011: Building Relational Event History Model with Hadoop - Josh...Cloudera, Inc.
 
Quantifying an association to predict future events chapt
Quantifying an association to predict future events chaptQuantifying an association to predict future events chapt
Quantifying an association to predict future events chaptMARK547399
 
Consumer credit-risk3440
Consumer credit-risk3440Consumer credit-risk3440
Consumer credit-risk3440stone55
 
Intro to Quant Trading Strategies (Lecture 9 of 10)
Intro to Quant Trading Strategies (Lecture 9 of 10)Intro to Quant Trading Strategies (Lecture 9 of 10)
Intro to Quant Trading Strategies (Lecture 9 of 10)Adrian Aley
 
Eco550 Assignment 1
Eco550 Assignment 1Eco550 Assignment 1
Eco550 Assignment 1Lisa Kennedy
 
A PRACTICAL POWERFUL ROBUST AND INTERPRETABLE FAMILY OF CORRELATION COEFFICIE...
A PRACTICAL POWERFUL ROBUST AND INTERPRETABLE FAMILY OF CORRELATION COEFFICIE...A PRACTICAL POWERFUL ROBUST AND INTERPRETABLE FAMILY OF CORRELATION COEFFICIE...
A PRACTICAL POWERFUL ROBUST AND INTERPRETABLE FAMILY OF CORRELATION COEFFICIE...Savas Papadopoulos, Ph.D
 
Essay on-data-analysis
Essay on-data-analysisEssay on-data-analysis
Essay on-data-analysisRaman Kannan
 
Kpi – Key Performance
Kpi – Key PerformanceKpi – Key Performance
Kpi – Key Performancequantanalytics
 

Similar to Haal meer IBM SPSS Statistics 11.11.14 Voorspellen aan de hand van logistische regressie STATSCONsult (20)

Terry Johns: Uncertainty - understanding the impact and the importance of rec...
Terry Johns: Uncertainty - understanding the impact and the importance of rec...Terry Johns: Uncertainty - understanding the impact and the importance of rec...
Terry Johns: Uncertainty - understanding the impact and the importance of rec...
 
Creating Risk Profile Graphs
Creating  Risk  Profile  GraphsCreating  Risk  Profile  Graphs
Creating Risk Profile Graphs
 
CHAPTER 11 LOGISTIC REGRESSION.pptx
CHAPTER 11 LOGISTIC REGRESSION.pptxCHAPTER 11 LOGISTIC REGRESSION.pptx
CHAPTER 11 LOGISTIC REGRESSION.pptx
 
Estimation of parameters.pptxxxxxxxxxxxx
Estimation of parameters.pptxxxxxxxxxxxxEstimation of parameters.pptxxxxxxxxxxxx
Estimation of parameters.pptxxxxxxxxxxxx
 
CABT SHS Statistics & Probability - Estimation of Parameters (intro)
CABT SHS Statistics & Probability -  Estimation of Parameters (intro)CABT SHS Statistics & Probability -  Estimation of Parameters (intro)
CABT SHS Statistics & Probability - Estimation of Parameters (intro)
 
Summer 07-mfin7011-tang1922
Summer 07-mfin7011-tang1922Summer 07-mfin7011-tang1922
Summer 07-mfin7011-tang1922
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
Bivariate Regression
Bivariate RegressionBivariate Regression
Bivariate Regression
 
Machine learning algorithms and business use cases
Machine learning algorithms and business use casesMachine learning algorithms and business use cases
Machine learning algorithms and business use cases
 
Econometrics
EconometricsEconometrics
Econometrics
 
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACROBOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
BOOTSTRAPPING TO EVALUATE RESPONSE MODELS: A SAS® MACRO
 
Logistic-Regression-Webinar.pdf
Logistic-Regression-Webinar.pdfLogistic-Regression-Webinar.pdf
Logistic-Regression-Webinar.pdf
 
Hadoop World 2011: Building Relational Event History Model with Hadoop - Josh...
Hadoop World 2011: Building Relational Event History Model with Hadoop - Josh...Hadoop World 2011: Building Relational Event History Model with Hadoop - Josh...
Hadoop World 2011: Building Relational Event History Model with Hadoop - Josh...
 
Quantifying an association to predict future events chapt
Quantifying an association to predict future events chaptQuantifying an association to predict future events chapt
Quantifying an association to predict future events chapt
 
Consumer credit-risk3440
Consumer credit-risk3440Consumer credit-risk3440
Consumer credit-risk3440
 
Intro to Quant Trading Strategies (Lecture 9 of 10)
Intro to Quant Trading Strategies (Lecture 9 of 10)Intro to Quant Trading Strategies (Lecture 9 of 10)
Intro to Quant Trading Strategies (Lecture 9 of 10)
 
Eco550 Assignment 1
Eco550 Assignment 1Eco550 Assignment 1
Eco550 Assignment 1
 
A PRACTICAL POWERFUL ROBUST AND INTERPRETABLE FAMILY OF CORRELATION COEFFICIE...
A PRACTICAL POWERFUL ROBUST AND INTERPRETABLE FAMILY OF CORRELATION COEFFICIE...A PRACTICAL POWERFUL ROBUST AND INTERPRETABLE FAMILY OF CORRELATION COEFFICIE...
A PRACTICAL POWERFUL ROBUST AND INTERPRETABLE FAMILY OF CORRELATION COEFFICIE...
 
Essay on-data-analysis
Essay on-data-analysisEssay on-data-analysis
Essay on-data-analysis
 
Kpi – Key Performance
Kpi – Key PerformanceKpi – Key Performance
Kpi – Key Performance
 

More from Daniel Westzaan

Haal meer uit IBM SPSS Statistics 11.11.14 Laila Fettah - IBM
Haal meer uit IBM SPSS Statistics 11.11.14 Laila Fettah - IBMHaal meer uit IBM SPSS Statistics 11.11.14 Laila Fettah - IBM
Haal meer uit IBM SPSS Statistics 11.11.14 Laila Fettah - IBMDaniel Westzaan
 
Haal meer uit IBM SPSS Statistics 11.11.14 Profileren met beslissingsbomen ...
Haal meer uit IBM SPSS Statistics 11.11.14   Profileren met beslissingsbomen ...Haal meer uit IBM SPSS Statistics 11.11.14   Profileren met beslissingsbomen ...
Haal meer uit IBM SPSS Statistics 11.11.14 Profileren met beslissingsbomen ...Daniel Westzaan
 
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014Daniel Westzaan
 
BA Summit 2014 Imtech ICT Workshop Tips & Tricks; maak optimaal gebruik van I...
BA Summit 2014 Imtech ICT Workshop Tips & Tricks; maak optimaal gebruik van I...BA Summit 2014 Imtech ICT Workshop Tips & Tricks; maak optimaal gebruik van I...
BA Summit 2014 Imtech ICT Workshop Tips & Tricks; maak optimaal gebruik van I...Daniel Westzaan
 
BA Summit 2014 Predictive maintenance: Met big data het lek dichten
BA Summit 2014  Predictive maintenance: Met big data het lek dichtenBA Summit 2014  Predictive maintenance: Met big data het lek dichten
BA Summit 2014 Predictive maintenance: Met big data het lek dichtenDaniel Westzaan
 
BA Summit 2014 Social media analytics volgers en vrienden zeggen niet alles
BA Summit 2014 Social media analytics volgers en vrienden zeggen niet allesBA Summit 2014 Social media analytics volgers en vrienden zeggen niet alles
BA Summit 2014 Social media analytics volgers en vrienden zeggen niet allesDaniel Westzaan
 
BA Summit 2014 Werk efficiënter met IBM SPSS- en IBM Cognos-oplossingen speci...
BA Summit 2014 Werk efficiënter met IBM SPSS- en IBM Cognos-oplossingen speci...BA Summit 2014 Werk efficiënter met IBM SPSS- en IBM Cognos-oplossingen speci...
BA Summit 2014 Werk efficiënter met IBM SPSS- en IBM Cognos-oplossingen speci...Daniel Westzaan
 
BA Summit 2014 Ontdek de nieuwe mogelijkheden van IBM SPSS Modeler 16.0
BA Summit 2014 Ontdek de nieuwe mogelijkheden van IBM SPSS Modeler 16.0BA Summit 2014 Ontdek de nieuwe mogelijkheden van IBM SPSS Modeler 16.0
BA Summit 2014 Ontdek de nieuwe mogelijkheden van IBM SPSS Modeler 16.0Daniel Westzaan
 
BA Summit 2014 BMW brengt klantbeleving naar een hogere versnelling met Predi...
BA Summit 2014 BMW brengt klantbeleving naar een hogere versnelling met Predi...BA Summit 2014 BMW brengt klantbeleving naar een hogere versnelling met Predi...
BA Summit 2014 BMW brengt klantbeleving naar een hogere versnelling met Predi...Daniel Westzaan
 
BA Summit 2014 Haal heldere inzichten uit complexe data met visualisatie
BA Summit 2014 Haal heldere inzichten uit complexe data met visualisatie BA Summit 2014 Haal heldere inzichten uit complexe data met visualisatie
BA Summit 2014 Haal heldere inzichten uit complexe data met visualisatie Daniel Westzaan
 
BA Summit 2014 Maak kennis met de revolutionaire analytics van IBM Watson
BA Summit 2014 Maak kennis met de revolutionaire analytics van IBM WatsonBA Summit 2014 Maak kennis met de revolutionaire analytics van IBM Watson
BA Summit 2014 Maak kennis met de revolutionaire analytics van IBM WatsonDaniel Westzaan
 
BA Summit 2014 Vergroot uw inzicht: kijk vooruit met business intelligence en...
BA Summit 2014 Vergroot uw inzicht: kijk vooruit met business intelligence en...BA Summit 2014 Vergroot uw inzicht: kijk vooruit met business intelligence en...
BA Summit 2014 Vergroot uw inzicht: kijk vooruit met business intelligence en...Daniel Westzaan
 
BA Summit 2014 Stuur uw ondernemingsstrategie met financiële en operationele ...
BA Summit 2014 Stuur uw ondernemingsstrategie met financiële en operationele ...BA Summit 2014 Stuur uw ondernemingsstrategie met financiële en operationele ...
BA Summit 2014 Stuur uw ondernemingsstrategie met financiële en operationele ...Daniel Westzaan
 
Ba Summit 2014 Betere planning en forecasting met predictive analytics
Ba Summit 2014   Betere planning en forecasting met predictive analyticsBa Summit 2014   Betere planning en forecasting met predictive analytics
Ba Summit 2014 Betere planning en forecasting met predictive analyticsDaniel Westzaan
 

More from Daniel Westzaan (14)

Haal meer uit IBM SPSS Statistics 11.11.14 Laila Fettah - IBM
Haal meer uit IBM SPSS Statistics 11.11.14 Laila Fettah - IBMHaal meer uit IBM SPSS Statistics 11.11.14 Laila Fettah - IBM
Haal meer uit IBM SPSS Statistics 11.11.14 Laila Fettah - IBM
 
Haal meer uit IBM SPSS Statistics 11.11.14 Profileren met beslissingsbomen ...
Haal meer uit IBM SPSS Statistics 11.11.14   Profileren met beslissingsbomen ...Haal meer uit IBM SPSS Statistics 11.11.14   Profileren met beslissingsbomen ...
Haal meer uit IBM SPSS Statistics 11.11.14 Profileren met beslissingsbomen ...
 
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
 
BA Summit 2014 Imtech ICT Workshop Tips & Tricks; maak optimaal gebruik van I...
BA Summit 2014 Imtech ICT Workshop Tips & Tricks; maak optimaal gebruik van I...BA Summit 2014 Imtech ICT Workshop Tips & Tricks; maak optimaal gebruik van I...
BA Summit 2014 Imtech ICT Workshop Tips & Tricks; maak optimaal gebruik van I...
 
BA Summit 2014 Predictive maintenance: Met big data het lek dichten
BA Summit 2014  Predictive maintenance: Met big data het lek dichtenBA Summit 2014  Predictive maintenance: Met big data het lek dichten
BA Summit 2014 Predictive maintenance: Met big data het lek dichten
 
BA Summit 2014 Social media analytics volgers en vrienden zeggen niet alles
BA Summit 2014 Social media analytics volgers en vrienden zeggen niet allesBA Summit 2014 Social media analytics volgers en vrienden zeggen niet alles
BA Summit 2014 Social media analytics volgers en vrienden zeggen niet alles
 
BA Summit 2014 Werk efficiënter met IBM SPSS- en IBM Cognos-oplossingen speci...
BA Summit 2014 Werk efficiënter met IBM SPSS- en IBM Cognos-oplossingen speci...BA Summit 2014 Werk efficiënter met IBM SPSS- en IBM Cognos-oplossingen speci...
BA Summit 2014 Werk efficiënter met IBM SPSS- en IBM Cognos-oplossingen speci...
 
BA Summit 2014 Ontdek de nieuwe mogelijkheden van IBM SPSS Modeler 16.0
BA Summit 2014 Ontdek de nieuwe mogelijkheden van IBM SPSS Modeler 16.0BA Summit 2014 Ontdek de nieuwe mogelijkheden van IBM SPSS Modeler 16.0
BA Summit 2014 Ontdek de nieuwe mogelijkheden van IBM SPSS Modeler 16.0
 
BA Summit 2014 BMW brengt klantbeleving naar een hogere versnelling met Predi...
BA Summit 2014 BMW brengt klantbeleving naar een hogere versnelling met Predi...BA Summit 2014 BMW brengt klantbeleving naar een hogere versnelling met Predi...
BA Summit 2014 BMW brengt klantbeleving naar een hogere versnelling met Predi...
 
BA Summit 2014 Haal heldere inzichten uit complexe data met visualisatie
BA Summit 2014 Haal heldere inzichten uit complexe data met visualisatie BA Summit 2014 Haal heldere inzichten uit complexe data met visualisatie
BA Summit 2014 Haal heldere inzichten uit complexe data met visualisatie
 
BA Summit 2014 Maak kennis met de revolutionaire analytics van IBM Watson
BA Summit 2014 Maak kennis met de revolutionaire analytics van IBM WatsonBA Summit 2014 Maak kennis met de revolutionaire analytics van IBM Watson
BA Summit 2014 Maak kennis met de revolutionaire analytics van IBM Watson
 
BA Summit 2014 Vergroot uw inzicht: kijk vooruit met business intelligence en...
BA Summit 2014 Vergroot uw inzicht: kijk vooruit met business intelligence en...BA Summit 2014 Vergroot uw inzicht: kijk vooruit met business intelligence en...
BA Summit 2014 Vergroot uw inzicht: kijk vooruit met business intelligence en...
 
BA Summit 2014 Stuur uw ondernemingsstrategie met financiële en operationele ...
BA Summit 2014 Stuur uw ondernemingsstrategie met financiële en operationele ...BA Summit 2014 Stuur uw ondernemingsstrategie met financiële en operationele ...
BA Summit 2014 Stuur uw ondernemingsstrategie met financiële en operationele ...
 
Ba Summit 2014 Betere planning en forecasting met predictive analytics
Ba Summit 2014   Betere planning en forecasting met predictive analyticsBa Summit 2014   Betere planning en forecasting met predictive analytics
Ba Summit 2014 Betere planning en forecasting met predictive analytics
 

Recently uploaded

Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/managementakshesh doshi
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts ServiceCall Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Servicejennyeacort
 

Recently uploaded (20)

Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/management
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health Classification
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts ServiceCall Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service
 

Haal meer IBM SPSS Statistics 11.11.14 Voorspellen aan de hand van logistische regressie STATSCONsult

  • 1. IBM SPSS presentation Amsterdam, 11th November 2014 Drs. Ing. J.A.C.M. Smit (Jan) Director of STATSCONsult, based in Drunen NL 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 2. STATSCONsult Support, marketing and Sales of software products for statistical analyses Courses in Statistics Consultancy in Data Analyses Jan Smit worked for SPSS from 1984 until 1989. 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 3. STATSCONsult Consultancy SPSS Intro courses SPSS assistance in data analyses SPSS advanced courses SPSS Risk Analyses (including Weight of Evidence) 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 4. Examples of Logistic Regression We wish to model the likelihood of an event that is likely to happen which depends on a number of factors (predictors): ◦To predict whether a patient has (or will have) a given disease ◦Prediction of a customer's propensity to purchase an appliance (TV) ◦Prediction of passing an exam ◦Prediction of paying back a loan in full ◦Risk analyses is done with Logistic Regression 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 5. What are the assumptions of using Logistic Regression? The predictors are not too much highly multiple correlated (multicollinearity) A continuous predictor should have a monotone descending (ascending) probability of the dependent variable in the data We obtain a (model + error); residuals (=error) should not dominate Model should be interpretable, easy to use and be useful for forecasting 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 6. Logistic regression, application We analyse the effect of a number of independent predictors (x1, x2, .. xn) on a dependent variable Y, where Y in [0,1] Covariates are predictors, for which we wish to correct (such as age) Predictors can be continuous, nominal or ordinal ◦Independent variables can be continuous, Age ◦Ordinal or Nominal, Level of Education, 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 7. Data 1500 observations We wish to have a model of Previous Default (Y=1) From now on, we say Previous Default (risk to pay-off bankloan) =“Risk” Interpret the model, use the model for prediction. Based on the predictors (Age, .. ,Household Income) 548 observations have Risks (Y=1) in our data set Here 90% of observations are used for the model; the remaining observations are used for prediction. 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 8. What are my odds? We cannot use regression, though we use all the theory of linear regression. In Logistic regression our model is: ◦log(P(y=1)/P(y=0) )= a + b1*x1 + b2*x2 + ..+ bn*xn ◦Linear regression : Y= a + b1*x1 + b2*x2 + ..+ bn*xn (nearly the same) ◦Odds : P(y=1), P(y=0) and P(y=1)/P(y=0) ; my odds are 2 to 1, meaning P(y=1)/P(y=0)=2 Log(odds) makes statistics possible : ◦P=2/3: odds ratio= 2; log(2)=0,69 ◦P=1/2: odds ratio=1; log(1)=0 A coefficient is the change in the log odds, when other factors are fixed. Sometimes I have the Odds against, or Odds on, or Odds even. 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 9. Bankloan data We wish to model the chance of paying back in full a bank loan. When Risk =1, the loan was not in the end returned to the bank. Y : Risk{1=yes, 0=no} X : A number of factors that may affect Y Age in years age Level of education ed Years with current employer employ Years at current address address Household income in thousands income Debt to income ratio (x100) debtinc Credit card debt in thousands creddebt Other debt in thousands othdebt 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 10. Make groups via visual binning 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 11. Odd ratios of risk decreases with higher values for age 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 12. Dependency of Age on Risk 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 13. AND IN FORMULAS We read : Log Odds = log(Risk=1 / Risk=0) = Constant + B * age = 1,250 – 0,055*age For age=20 : LogOdds = 1,25-1,1= 0,15 For age=30 : LogOdds = 1,25- 1,65 = -0,4 For age=40 : LogOdds = 1,25 – 2,2 = -0,95 If age= 22,7 the LogOdds=0 According to model : For age=20 : OddsRatio = exp(0,15) =1,15 For age=30 : OddsRatio = exp(-0,4) = 0,67 For age=40 : OddsRatio = exp(-0,95)= 0,4 Probability : For age=20 : P(yes)= 0,54 For age=30 : P(Y=1) = 0,40 For age=40 : P(Y=10 = 0,28 We conclude Age can be used as an predictor for Risk (as Sig-P < 0,05) 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 14. Usage of dialog in IBM SPSS: 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 15. Specify categorical predictors 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 16. Output (2) from initial stage (all main effects) The -2 log likelihood is leading. 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 17. Output of predictors and effect on Risk 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 18. What predictors can we use in the model to estimate “Risk” If the Sig.-p< 0,05 for a predictor, we may conclude that this predictor has an effect on the depended variable (a significance effect). If the Sig.-p>0,05 for a predictor, we may conclude that we are uncertain that this predictor has an effect on the depended variable. Watch out for pit falls (remove a variable that has no effect, and re-estimate). 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 19. Modelling By using Backwards (LR), at each step we re-estimate the model, leaving out a non- significant predictor: After this step : only the variables: age, employ, debtinc, creddebt are significant Note that correlations of predictors may affect the order of inclusion in model (employ and address are highly correlated) 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 20. After Backward Deletion 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 21. Interpretation of the model If the coefficient of a predictor < 0 the odd ratio decreases for larger values. Large coefficients (positive or negative) are more important (go with large Wald statistics and small Sig.-p values). Here people with 1.short period at current employer (Change) and 2.high Credit Card Debts (Expenders) and 3.high values of Debts to Income ratio (Have Fun) and 4.low ages (Young) show high risk. 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 22. Classification on the data in model; If we adjust the cut value to a lower p (0,5) the Predicted Yes column values becomes lower. 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 23. Model expression The model is: Log(Risk=1/ Risk=0) = -0,133 (constant) - 0,213 * employ (from 0 to 50) + 0,483 * creddebt (from 0 to 36) + 0,102 * debtinc (from 0 to 40) - 0,040 * age (from 18 to 60) If this expression > 0, the probability > 0,5 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 24. Prediction Prediction is rather good (102 out of 133) Make use of the model and apply this to the remaining observations that are not included in the model. 65+ 15 were formally classified as “No Risk” 65 +16 are selected in model as “No Risk” We are able to change the cut off value of 0,5 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 25. Comparison Classification Trees and Logistic Regression If the number of variables is high the result of LR still is simple; CT output will become large and complex. CT finds interactions, segments, with highest P. With LR segments are determined with high probability. 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation
  • 26. Vragen Jan Smit jan.smit@statsconsult.nl +31 416 378 125 http://www.statsconsult.nl/ 11/11/2014 STATSCONsult, Logistic Regression, IBM SPSS presentation