SlideShare a Scribd company logo
1 of 24
Download to read offline
Quantifying Model Uncertainty with AI
Jos Gheerardyn, CEO Yields.io, Dec 2018
jos.gheerardyn@yields.io
www.yields.io
© Yields NV
Why we simplify models
© Yields NV
Why we validate models
© Yields NV
The need for quantificationnrofmodels
time
●  100 - 3000 models
●  Median duration of a single
model validation is >4 weeks**
●  Tiering (quant. & qual.)
○  materiality
○  risk exposure
○  regulatory impact
●  Qualitative assessments are
fairly stable over time
●  Quantitative assessments can
change quickly and allow for
accurate risk management
procedures
The number of models in financial institutions
increases with 10 – 20 % yearly*
* See http://www.mckinsey.com/business-functions/risk/our-insights/the-evolution-of-model-risk-management
** See https://www2.deloitte.com/content/dam/Deloitte/dk/Documents/financial-services/deloitte-nl-global-model-practice-survey.pdf
© Yields NV
Different types of uncertainty
Definition*
●  X is the quantity of interest we want to model
●  xi are states that are possible outcomes of X
●  P is the model
●  P is the set of available models
Risk: We know the probability of each outcome xi
Uncertainty: We do not know the probability of each outcome xi
●  Model risk: Probability measure on P
●  Model uncertainty: We do now know the probabilities on P
More precisely: Model ambiguity means several specifications for probabilities on P**
* See Knight, F. (1921) Risk, uncertainty and profit, Boston: Houghton Mifflin.
** See Epstein, L.G. (1999) A definition of uncertainty aversion, Review of Economic Studies, 65, 579–608.
© Yields NV
Two paradigms: Model averaging vs worst-case*
Bayesian model averaging
●  Prior on model parameters p(θi|Pi)
●  Prior weights on models p(Pi)
Posterior probability on model Pi
With the likelihood of the observed data under Pi being
Computing model-dependent quantities via
Example: option pricing**
Problem: How to choose the prior distributions over models?
p(Pi | x) =
p(x | Pi )p(Pi )
p(x | Pi )p(Pi )
i
∑
p(x | Pi ) = p(x |ϑi, Pi )p(ϑi |
Ei
∫ Pi )dϑi
E[ f (X)| x]= E[ f (X)| Pi, x]
i
∑ p(Pi | x)
* See Cont R. (2006) Model uncertainty and its impact on the pricing of derivative instruments. Mathematical Finance, Wiley
** See Bannör K. and Scherer M (2014) Model Risk and Uncertainty, Springer International Publishing
© Yields NV
Two paradigms: Model averaging vs worst-case*
Worst case
An agent facing uncertainty maximizes his expected utility defined as the worst-case over
all available models
Example: Ellsberg paradox (ambiguity aversion)
●  Urn A: 50/50
●  Urn B: assume subjective probability of distribution
is 40/60
The agent will go for A
max X∈AminPi ∈P EPi
[U(X)]
* See Gilboa & Schmeidler (1989) Maxmin expected utility with non-unique prior, Journal of mathematical economics
© Yields NV
Two paradigms: Model averaging vs worst-case*
Both methods have some key challenges in common
●  How do we choose the candidate models?
●  How to sample relevant parameters?
Machine learning can complement expert opinion to automatically generate candidate
models and sample model parameters.
Similar to scenario generation in specification based testing of software.
© Yields NV
Neural networks
NN Topology Neuron
wij
i
∑ xi + w0 j tanh(vj −θj )
© Yields NV
NN in action
© Yields NV
Autoencoder
●  Minimize reconstruction
error |x - x`|
●  Linear autoencoder = PCA*
●  Fast
●  Non-linear activation
Used for dimensional reduction
and outlier detection
* See e.g. https://www.cs.toronto.edu/~urtasun/courses/CSC411/14_pca.pdf
© Yields NV
Anomaly detection with autoencoder
1.  Train an autoencoder on the data and compute the
Mahalanobis distance histogram
2.  Fit a heavy-tailed distribution to the histogram to
determine a cut-off parameter. Samples with
Mahalanobis distance above this value are considered
anomalies
3.  The resulting vol surfaces are indeed outliers as can be
seen via the z-scores of their parameters
M = (x −µ)T
S−1
(x −µ)
© Yields NV
Anomaly detection with unlabelled data
13
The bottleneck of the autoencoder can be used to reduce the dimensionality of the
problem (to e.g. plot the dataset in 2D). The red dots are the detected outliers.
© Yields NV
Generative adversarial networks*
●  Contains a generator and a discriminator
●  Generative model can serve as a source for test data
* See Goodfellow I. et al, Generative adversarial networks, https://arxiv.org/abs/1406.2661
© Yields NV
Sampling parameters
Using autoencoder to partition data
PD model (logistic regression) calibrated on Lending Club Loan Data*
900k rows, 75 columns, discrete and continuous
Random train / test split: AUC = .896
Training: 99% inner points
AUC on test set = .873
Training: 99.9% inner points
AUC on test set = .845
* See https://www.kaggle.com/wendykan/lending-club-loan-data
© Yields NV
Sampling parameters – ctu’ed
Use GAN’s to generate realistic datasets
●  As an alternative to e.g. bootstrapping
●  To deal with sparse data
Example: Model performance analysis
1.  Generate labeled datasets (1000)
2.  Calibrate model on training set
3.  Measure out of sample performance
AUC -2 log (L)
Bootstrap .87 - .92 .143 - .176
GAN .84 - .93 .120 - .179
© Yields NV
Sampling models
Hyperparameter “de-tuning”
ML in general and (deep) neural network algorithms have many degrees of freedom
●  Number of layers, number of nodes and connections
●  Activation functions
●  Learning rates
●  Etc.
Grid Random SMAC* Genetic
programming**
* See https://www.cs.ubc.ca/~hutter/papers/10-TR-SMAC.pdf
** See https://github.com/EpistasisLab/tpot
© Yields NV
Documenting neural networks
“NN are black-box or at least hard to understand”
Similar problem in quantum field theory
was solved by developing a beautiful
graphical language
Feynman diagrams
© Yields NV
Documenting neural networks ctu’ed
Krizhevsky-style Tensorboard - style
© Yields NV
Model uncertainty in deep learning*
●  Why does my model work?
E.g. dropouts: avoids over-fitting and
improves performance, but why?
●  What does my model know?
I.e. understanding the degree of
certainty in the model
E.g. train model to recognize dog
breeds and present a cat
E.g.2 train model on Dutch mortgages
and present a Belgian client
training testing
* See http://mlg.eng.cam.ac.uk/yarin/thesis/thesis.pdf
© Yields NV
Why does dropout work?
●  Place prior distribution on the weights p(w)
●  Given dataset (x: input, y: label), the posterior is p(w|x,y)
●  Define simple distribtion qM(w)
●  Approximate posterior by qM via minimization of the KL divergence
This is approximate variational inference
One can prove that
Now take qM(w) = M * diag Bernouilli
Sampling from qM(w) is randomly putting columns in M to zero = randomly setting nodes to
zero = dropout
Hence, dropout is approximately integrating over model parameters
KL(qM (w)| p(w | x, y)) ≈ − qM (w)log p(y | x,w)dw + KL(qM (w)| p(w))∫
loss L2 regularization
© Yields NV
What does my model know?
With a Bayesian NN we can compute the uncertainty on the output by looking at the
second moment
© Yields NV
Recap
●  We have introduced classical concepts of model uncertainty and model risk
●  We discussed a Bayesian approach (risk; model averaging) and maxmin approaches
(uncertainty; robust expected utility)
●  We can use ML to generate candidate models and sample candidate model
parameters automatically
●  We can use Bayesian neural networks to integrate over model parameters (dropouts)
and measure uncertainty
© Yields NV
Thank you!
Yields NV
Parvis Sainte-Gudule 5,
1000 Brussels,
Belgium
+32 479 527 261
info@yields.io
Yields.io Ltd
8 Northumberland Ave,
Westminster, London
WC2N 5BY, UK
+44 7584 068899
info@yields.io

More Related Content

Similar to Model Uncertainty

Using Risk Analysis and Simulation in Project Management
Using Risk Analysis and Simulation in Project ManagementUsing Risk Analysis and Simulation in Project Management
Using Risk Analysis and Simulation in Project ManagementMike Tulkoff
 
Machine Learning part 3 - Introduction to data science
Machine Learning part 3 - Introduction to data science Machine Learning part 3 - Introduction to data science
Machine Learning part 3 - Introduction to data science Frank Kienle
 
205250 crystall ball
205250 crystall ball205250 crystall ball
205250 crystall ballp6academy
 
Monitoring data quality by Jos Gheerardyn of Yields.io
Monitoring data quality by Jos Gheerardyn of Yields.ioMonitoring data quality by Jos Gheerardyn of Yields.io
Monitoring data quality by Jos Gheerardyn of Yields.ioDataops Ghent Meetup
 
Machine Learning Approach.pptx
Machine Learning Approach.pptxMachine Learning Approach.pptx
Machine Learning Approach.pptxCYPatrickKwee
 
Credit Card Fraud Detection Using Machine Learning & Data Science
Credit Card Fraud Detection Using Machine Learning & Data ScienceCredit Card Fraud Detection Using Machine Learning & Data Science
Credit Card Fraud Detection Using Machine Learning & Data ScienceIRJET Journal
 
Credit Card Fraud Detection Using Machine Learning & Data Science
Credit Card Fraud Detection Using Machine Learning & Data ScienceCredit Card Fraud Detection Using Machine Learning & Data Science
Credit Card Fraud Detection Using Machine Learning & Data ScienceIRJET Journal
 
Risk_Management_Final_Report
Risk_Management_Final_ReportRisk_Management_Final_Report
Risk_Management_Final_ReportRohan Sanas
 
Modelling the expected loss of bodily injury claims using gradient boosting
Modelling the expected loss of bodily injury claims using gradient boostingModelling the expected loss of bodily injury claims using gradient boosting
Modelling the expected loss of bodily injury claims using gradient boostingGregg Barrett
 
The GREDOR project. Redesigning the decision chain for managing distribution ...
The GREDOR project. Redesigning the decision chain for managing distribution ...The GREDOR project. Redesigning the decision chain for managing distribution ...
The GREDOR project. Redesigning the decision chain for managing distribution ...Université de Liège (ULg)
 
Statistical Learning on Credit Data
Statistical Learning on Credit DataStatistical Learning on Credit Data
Statistical Learning on Credit DataFiras Obeid
 
Machine learning- key concepts
Machine learning- key conceptsMachine learning- key concepts
Machine learning- key conceptsAmir Ziai
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network ModelEric Esajian
 
OpLossModels_A2015
OpLossModels_A2015OpLossModels_A2015
OpLossModels_A2015WenSui Liu
 
AIAA-Aviation-2015-Mehmani
AIAA-Aviation-2015-MehmaniAIAA-Aviation-2015-Mehmani
AIAA-Aviation-2015-MehmaniOptiModel
 
Operations Management VTU BE Mechanical 2015 Solved paper
Operations Management VTU BE Mechanical 2015 Solved paperOperations Management VTU BE Mechanical 2015 Solved paper
Operations Management VTU BE Mechanical 2015 Solved paperSomashekar S.M
 

Similar to Model Uncertainty (20)

Using Risk Analysis and Simulation in Project Management
Using Risk Analysis and Simulation in Project ManagementUsing Risk Analysis and Simulation in Project Management
Using Risk Analysis and Simulation in Project Management
 
Machine Learning part 3 - Introduction to data science
Machine Learning part 3 - Introduction to data science Machine Learning part 3 - Introduction to data science
Machine Learning part 3 - Introduction to data science
 
205250 crystall ball
205250 crystall ball205250 crystall ball
205250 crystall ball
 
Ch13 slides
Ch13 slidesCh13 slides
Ch13 slides
 
Monitoring data quality by Jos Gheerardyn of Yields.io
Monitoring data quality by Jos Gheerardyn of Yields.ioMonitoring data quality by Jos Gheerardyn of Yields.io
Monitoring data quality by Jos Gheerardyn of Yields.io
 
Machine Learning Approach.pptx
Machine Learning Approach.pptxMachine Learning Approach.pptx
Machine Learning Approach.pptx
 
Credit Card Fraud Detection Using Machine Learning & Data Science
Credit Card Fraud Detection Using Machine Learning & Data ScienceCredit Card Fraud Detection Using Machine Learning & Data Science
Credit Card Fraud Detection Using Machine Learning & Data Science
 
Credit Card Fraud Detection Using Machine Learning & Data Science
Credit Card Fraud Detection Using Machine Learning & Data ScienceCredit Card Fraud Detection Using Machine Learning & Data Science
Credit Card Fraud Detection Using Machine Learning & Data Science
 
Risk_Management_Final_Report
Risk_Management_Final_ReportRisk_Management_Final_Report
Risk_Management_Final_Report
 
Modelling the expected loss of bodily injury claims using gradient boosting
Modelling the expected loss of bodily injury claims using gradient boostingModelling the expected loss of bodily injury claims using gradient boosting
Modelling the expected loss of bodily injury claims using gradient boosting
 
The GREDOR project. Redesigning the decision chain for managing distribution ...
The GREDOR project. Redesigning the decision chain for managing distribution ...The GREDOR project. Redesigning the decision chain for managing distribution ...
The GREDOR project. Redesigning the decision chain for managing distribution ...
 
Statistical Learning on Credit Data
Statistical Learning on Credit DataStatistical Learning on Credit Data
Statistical Learning on Credit Data
 
Machine learning- key concepts
Machine learning- key conceptsMachine learning- key concepts
Machine learning- key concepts
 
WHY: Climbing the causality ladder to understand and project the energy deman...
WHY: Climbing the causality ladder to understand and project the energy deman...WHY: Climbing the causality ladder to understand and project the energy deman...
WHY: Climbing the causality ladder to understand and project the energy deman...
 
Lo4b(arif)
Lo4b(arif)Lo4b(arif)
Lo4b(arif)
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network Model
 
OpLossModels_A2015
OpLossModels_A2015OpLossModels_A2015
OpLossModels_A2015
 
AIAA-Aviation-2015-Mehmani
AIAA-Aviation-2015-MehmaniAIAA-Aviation-2015-Mehmani
AIAA-Aviation-2015-Mehmani
 
Lecture 1.pptx
Lecture 1.pptxLecture 1.pptx
Lecture 1.pptx
 
Operations Management VTU BE Mechanical 2015 Solved paper
Operations Management VTU BE Mechanical 2015 Solved paperOperations Management VTU BE Mechanical 2015 Solved paper
Operations Management VTU BE Mechanical 2015 Solved paper
 

Recently uploaded

VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...Suhani Kapoor
 
fca-bsps-decision-letter-redacted (1).pdf
fca-bsps-decision-letter-redacted (1).pdffca-bsps-decision-letter-redacted (1).pdf
fca-bsps-decision-letter-redacted (1).pdfHenry Tapper
 
Monthly Market Risk Update: April 2024 [SlideShare]
Monthly Market Risk Update: April 2024 [SlideShare]Monthly Market Risk Update: April 2024 [SlideShare]
Monthly Market Risk Update: April 2024 [SlideShare]Commonwealth
 
VIP Kolkata Call Girl Serampore 👉 8250192130 Available With Room
VIP Kolkata Call Girl Serampore 👉 8250192130  Available With RoomVIP Kolkata Call Girl Serampore 👉 8250192130  Available With Room
VIP Kolkata Call Girl Serampore 👉 8250192130 Available With Roomdivyansh0kumar0
 
government_intervention_in_business_ownership[1].pdf
government_intervention_in_business_ownership[1].pdfgovernment_intervention_in_business_ownership[1].pdf
government_intervention_in_business_ownership[1].pdfshaunmashale756
 
Classical Theory of Macroeconomics by Adam Smith
Classical Theory of Macroeconomics by Adam SmithClassical Theory of Macroeconomics by Adam Smith
Classical Theory of Macroeconomics by Adam SmithAdamYassin2
 
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsHigh Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
VIP High Class Call Girls Saharanpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Saharanpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Saharanpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Saharanpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
(办理学位证)加拿大萨省大学毕业证成绩单原版一比一
(办理学位证)加拿大萨省大学毕业证成绩单原版一比一(办理学位证)加拿大萨省大学毕业证成绩单原版一比一
(办理学位证)加拿大萨省大学毕业证成绩单原版一比一S SDS
 
Q3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesQ3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesMarketing847413
 
VIP Call Girls Service Begumpet Hyderabad Call +91-8250192130
VIP Call Girls Service Begumpet Hyderabad Call +91-8250192130VIP Call Girls Service Begumpet Hyderabad Call +91-8250192130
VIP Call Girls Service Begumpet Hyderabad Call +91-8250192130Suhani Kapoor
 
Financial institutions facilitate financing, economic transactions, issue fun...
Financial institutions facilitate financing, economic transactions, issue fun...Financial institutions facilitate financing, economic transactions, issue fun...
Financial institutions facilitate financing, economic transactions, issue fun...Avanish Goel
 
Quantitative Analysis of Retail Sector Companies
Quantitative Analysis of Retail Sector CompaniesQuantitative Analysis of Retail Sector Companies
Quantitative Analysis of Retail Sector Companiesprashantbhati354
 
Call Girls In Yusuf Sarai Women Seeking Men 9654467111
Call Girls In Yusuf Sarai Women Seeking Men 9654467111Call Girls In Yusuf Sarai Women Seeking Men 9654467111
Call Girls In Yusuf Sarai Women Seeking Men 9654467111Sapana Sha
 
Lundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdfLundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdfAdnet Communications
 
call girls in Nand Nagri (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in  Nand Nagri (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in  Nand Nagri (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Nand Nagri (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...Henry Tapper
 
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptxOAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptxhiddenlevers
 
High Class Call Girls Nashik Maya 7001305949 Independent Escort Service Nashik
High Class Call Girls Nashik Maya 7001305949 Independent Escort Service NashikHigh Class Call Girls Nashik Maya 7001305949 Independent Escort Service Nashik
High Class Call Girls Nashik Maya 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 

Recently uploaded (20)

VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
 
fca-bsps-decision-letter-redacted (1).pdf
fca-bsps-decision-letter-redacted (1).pdffca-bsps-decision-letter-redacted (1).pdf
fca-bsps-decision-letter-redacted (1).pdf
 
Monthly Market Risk Update: April 2024 [SlideShare]
Monthly Market Risk Update: April 2024 [SlideShare]Monthly Market Risk Update: April 2024 [SlideShare]
Monthly Market Risk Update: April 2024 [SlideShare]
 
VIP Kolkata Call Girl Serampore 👉 8250192130 Available With Room
VIP Kolkata Call Girl Serampore 👉 8250192130  Available With RoomVIP Kolkata Call Girl Serampore 👉 8250192130  Available With Room
VIP Kolkata Call Girl Serampore 👉 8250192130 Available With Room
 
government_intervention_in_business_ownership[1].pdf
government_intervention_in_business_ownership[1].pdfgovernment_intervention_in_business_ownership[1].pdf
government_intervention_in_business_ownership[1].pdf
 
Classical Theory of Macroeconomics by Adam Smith
Classical Theory of Macroeconomics by Adam SmithClassical Theory of Macroeconomics by Adam Smith
Classical Theory of Macroeconomics by Adam Smith
 
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsHigh Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
 
VIP High Class Call Girls Saharanpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Saharanpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Saharanpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Saharanpur Anushka 8250192130 Independent Escort Se...
 
🔝+919953056974 🔝young Delhi Escort service Pusa Road
🔝+919953056974 🔝young Delhi Escort service Pusa Road🔝+919953056974 🔝young Delhi Escort service Pusa Road
🔝+919953056974 🔝young Delhi Escort service Pusa Road
 
(办理学位证)加拿大萨省大学毕业证成绩单原版一比一
(办理学位证)加拿大萨省大学毕业证成绩单原版一比一(办理学位证)加拿大萨省大学毕业证成绩单原版一比一
(办理学位证)加拿大萨省大学毕业证成绩单原版一比一
 
Q3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesQ3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast Slides
 
VIP Call Girls Service Begumpet Hyderabad Call +91-8250192130
VIP Call Girls Service Begumpet Hyderabad Call +91-8250192130VIP Call Girls Service Begumpet Hyderabad Call +91-8250192130
VIP Call Girls Service Begumpet Hyderabad Call +91-8250192130
 
Financial institutions facilitate financing, economic transactions, issue fun...
Financial institutions facilitate financing, economic transactions, issue fun...Financial institutions facilitate financing, economic transactions, issue fun...
Financial institutions facilitate financing, economic transactions, issue fun...
 
Quantitative Analysis of Retail Sector Companies
Quantitative Analysis of Retail Sector CompaniesQuantitative Analysis of Retail Sector Companies
Quantitative Analysis of Retail Sector Companies
 
Call Girls In Yusuf Sarai Women Seeking Men 9654467111
Call Girls In Yusuf Sarai Women Seeking Men 9654467111Call Girls In Yusuf Sarai Women Seeking Men 9654467111
Call Girls In Yusuf Sarai Women Seeking Men 9654467111
 
Lundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdfLundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdf
 
call girls in Nand Nagri (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in  Nand Nagri (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in  Nand Nagri (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Nand Nagri (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
 
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptxOAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
 
High Class Call Girls Nashik Maya 7001305949 Independent Escort Service Nashik
High Class Call Girls Nashik Maya 7001305949 Independent Escort Service NashikHigh Class Call Girls Nashik Maya 7001305949 Independent Escort Service Nashik
High Class Call Girls Nashik Maya 7001305949 Independent Escort Service Nashik
 

Model Uncertainty

  • 1. Quantifying Model Uncertainty with AI Jos Gheerardyn, CEO Yields.io, Dec 2018 jos.gheerardyn@yields.io www.yields.io
  • 2. © Yields NV Why we simplify models
  • 3. © Yields NV Why we validate models
  • 4. © Yields NV The need for quantificationnrofmodels time ●  100 - 3000 models ●  Median duration of a single model validation is >4 weeks** ●  Tiering (quant. & qual.) ○  materiality ○  risk exposure ○  regulatory impact ●  Qualitative assessments are fairly stable over time ●  Quantitative assessments can change quickly and allow for accurate risk management procedures The number of models in financial institutions increases with 10 – 20 % yearly* * See http://www.mckinsey.com/business-functions/risk/our-insights/the-evolution-of-model-risk-management ** See https://www2.deloitte.com/content/dam/Deloitte/dk/Documents/financial-services/deloitte-nl-global-model-practice-survey.pdf
  • 5. © Yields NV Different types of uncertainty Definition* ●  X is the quantity of interest we want to model ●  xi are states that are possible outcomes of X ●  P is the model ●  P is the set of available models Risk: We know the probability of each outcome xi Uncertainty: We do not know the probability of each outcome xi ●  Model risk: Probability measure on P ●  Model uncertainty: We do now know the probabilities on P More precisely: Model ambiguity means several specifications for probabilities on P** * See Knight, F. (1921) Risk, uncertainty and profit, Boston: Houghton Mifflin. ** See Epstein, L.G. (1999) A definition of uncertainty aversion, Review of Economic Studies, 65, 579–608.
  • 6. © Yields NV Two paradigms: Model averaging vs worst-case* Bayesian model averaging ●  Prior on model parameters p(θi|Pi) ●  Prior weights on models p(Pi) Posterior probability on model Pi With the likelihood of the observed data under Pi being Computing model-dependent quantities via Example: option pricing** Problem: How to choose the prior distributions over models? p(Pi | x) = p(x | Pi )p(Pi ) p(x | Pi )p(Pi ) i ∑ p(x | Pi ) = p(x |ϑi, Pi )p(ϑi | Ei ∫ Pi )dϑi E[ f (X)| x]= E[ f (X)| Pi, x] i ∑ p(Pi | x) * See Cont R. (2006) Model uncertainty and its impact on the pricing of derivative instruments. Mathematical Finance, Wiley ** See Bannör K. and Scherer M (2014) Model Risk and Uncertainty, Springer International Publishing
  • 7. © Yields NV Two paradigms: Model averaging vs worst-case* Worst case An agent facing uncertainty maximizes his expected utility defined as the worst-case over all available models Example: Ellsberg paradox (ambiguity aversion) ●  Urn A: 50/50 ●  Urn B: assume subjective probability of distribution is 40/60 The agent will go for A max X∈AminPi ∈P EPi [U(X)] * See Gilboa & Schmeidler (1989) Maxmin expected utility with non-unique prior, Journal of mathematical economics
  • 8. © Yields NV Two paradigms: Model averaging vs worst-case* Both methods have some key challenges in common ●  How do we choose the candidate models? ●  How to sample relevant parameters? Machine learning can complement expert opinion to automatically generate candidate models and sample model parameters. Similar to scenario generation in specification based testing of software.
  • 9. © Yields NV Neural networks NN Topology Neuron wij i ∑ xi + w0 j tanh(vj −θj )
  • 10. © Yields NV NN in action
  • 11. © Yields NV Autoencoder ●  Minimize reconstruction error |x - x`| ●  Linear autoencoder = PCA* ●  Fast ●  Non-linear activation Used for dimensional reduction and outlier detection * See e.g. https://www.cs.toronto.edu/~urtasun/courses/CSC411/14_pca.pdf
  • 12. © Yields NV Anomaly detection with autoencoder 1.  Train an autoencoder on the data and compute the Mahalanobis distance histogram 2.  Fit a heavy-tailed distribution to the histogram to determine a cut-off parameter. Samples with Mahalanobis distance above this value are considered anomalies 3.  The resulting vol surfaces are indeed outliers as can be seen via the z-scores of their parameters M = (x −µ)T S−1 (x −µ)
  • 13. © Yields NV Anomaly detection with unlabelled data 13 The bottleneck of the autoencoder can be used to reduce the dimensionality of the problem (to e.g. plot the dataset in 2D). The red dots are the detected outliers.
  • 14. © Yields NV Generative adversarial networks* ●  Contains a generator and a discriminator ●  Generative model can serve as a source for test data * See Goodfellow I. et al, Generative adversarial networks, https://arxiv.org/abs/1406.2661
  • 15. © Yields NV Sampling parameters Using autoencoder to partition data PD model (logistic regression) calibrated on Lending Club Loan Data* 900k rows, 75 columns, discrete and continuous Random train / test split: AUC = .896 Training: 99% inner points AUC on test set = .873 Training: 99.9% inner points AUC on test set = .845 * See https://www.kaggle.com/wendykan/lending-club-loan-data
  • 16. © Yields NV Sampling parameters – ctu’ed Use GAN’s to generate realistic datasets ●  As an alternative to e.g. bootstrapping ●  To deal with sparse data Example: Model performance analysis 1.  Generate labeled datasets (1000) 2.  Calibrate model on training set 3.  Measure out of sample performance AUC -2 log (L) Bootstrap .87 - .92 .143 - .176 GAN .84 - .93 .120 - .179
  • 17. © Yields NV Sampling models Hyperparameter “de-tuning” ML in general and (deep) neural network algorithms have many degrees of freedom ●  Number of layers, number of nodes and connections ●  Activation functions ●  Learning rates ●  Etc. Grid Random SMAC* Genetic programming** * See https://www.cs.ubc.ca/~hutter/papers/10-TR-SMAC.pdf ** See https://github.com/EpistasisLab/tpot
  • 18. © Yields NV Documenting neural networks “NN are black-box or at least hard to understand” Similar problem in quantum field theory was solved by developing a beautiful graphical language Feynman diagrams
  • 19. © Yields NV Documenting neural networks ctu’ed Krizhevsky-style Tensorboard - style
  • 20. © Yields NV Model uncertainty in deep learning* ●  Why does my model work? E.g. dropouts: avoids over-fitting and improves performance, but why? ●  What does my model know? I.e. understanding the degree of certainty in the model E.g. train model to recognize dog breeds and present a cat E.g.2 train model on Dutch mortgages and present a Belgian client training testing * See http://mlg.eng.cam.ac.uk/yarin/thesis/thesis.pdf
  • 21. © Yields NV Why does dropout work? ●  Place prior distribution on the weights p(w) ●  Given dataset (x: input, y: label), the posterior is p(w|x,y) ●  Define simple distribtion qM(w) ●  Approximate posterior by qM via minimization of the KL divergence This is approximate variational inference One can prove that Now take qM(w) = M * diag Bernouilli Sampling from qM(w) is randomly putting columns in M to zero = randomly setting nodes to zero = dropout Hence, dropout is approximately integrating over model parameters KL(qM (w)| p(w | x, y)) ≈ − qM (w)log p(y | x,w)dw + KL(qM (w)| p(w))∫ loss L2 regularization
  • 22. © Yields NV What does my model know? With a Bayesian NN we can compute the uncertainty on the output by looking at the second moment
  • 23. © Yields NV Recap ●  We have introduced classical concepts of model uncertainty and model risk ●  We discussed a Bayesian approach (risk; model averaging) and maxmin approaches (uncertainty; robust expected utility) ●  We can use ML to generate candidate models and sample candidate model parameters automatically ●  We can use Bayesian neural networks to integrate over model parameters (dropouts) and measure uncertainty
  • 24. © Yields NV Thank you! Yields NV Parvis Sainte-Gudule 5, 1000 Brussels, Belgium +32 479 527 261 info@yields.io Yields.io Ltd 8 Northumberland Ave, Westminster, London WC2N 5BY, UK +44 7584 068899 info@yields.io