SlideShare a Scribd company logo
1 of 30
GET COMPETITIVE WITH
DRIVERLESS AI
Marios Michailidis
NOVEMBER 7, 2017
Background
H2O.ai
Machine Intelligence
• Competitive data scientist
• PhD in ensemble methods at UCL
• Former kaggle #1
How to perceive Driverless AI
• It is an AI that creates AI
• Creates machine learning models given:
 Some input data
 A target variable
 An objective
 Some allocated computing power (CPU or GPU)
H2O.ai
Machine Intelligence
Will there be a default?
Minimize prediction error
6 CPU cores
Predictions
Model interpretability
Insight
Feature engineering
How does DAI become competitive
• Mostly with exhaustive feature engineering
• Using and (tuning) Xgboost models
• Ensemble
H2O.ai
Machine Intelligence
Tuning Xgboost
• Initialize xgboost with modest parameters and
small learning rate, but 10,000 potential trees.
• Cross-validation is used to find optimal
maximum depth of the trees.
• Then early stopping is used to get no. of trees
• Commence feature engineering
• Revisit parameters in the end
H2O.ai
Machine Intelligence
Find best maximum depth
Best number of trees
Feature engineering Revisit parameters
Ensembling
• After Feature engineering, based on the
resources allocated and accuracy, it takes place.
• Up to 40 different xgboost models are build
• Different combinations of :
• Maximum depths
• Tree-growing policies (loss or depth)
• Maximum leaves
• Simple average of all models
H2O.ai
Machine Intelligence
Why Ensembling (1) - Data
• 3,000ish teams
• 133 anonymized columns , numerical or
categorical
• 115 K rows, binary target (accelerate approval)
• DAI scores top 2%
• Had taken my team almost 3 weeks to get there
(we finished 3rd eventually)
H2O.ai
Machine Intelligence
Why Ensembling (2) - Impact
H2O.ai
Machine Intelligence
After-model options
Best features found
Performance through time
Ensemble impact
Why Ensembling (3) - Results
H2O.ai
Machine Intelligence
Top 2%
with
ensemble
Around
Top 4%
without
Empowering DAI (1) - Data
H2O.ai
Machine Intelligence
• Popular competition (1700ish teams) in 2013
• Only 9 columns (8 unique).
• high cardinality – thousands of unique values.
• 90K rows combined for train and test.
• Scope: determine an employee's access needs.
• Metric to maximize was AUC (or Area Under
Curve).
Empowering DAI (2) - Scoring
H2O.ai
Machine Intelligence
Empowering DAI (3) – Initial results
H2O.ai
Machine Intelligence
Empowering DAI (4) – train
predictions
H2O.ai
Machine Intelligence
• Helps to understand how good the model is
• Where there might be deficiencies
x0 x1 x2 x3 y
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
Empowering DAI (5.1) - KFold
H2O.ai
Machine Intelligence
x0 x1 x2 x3 y
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
K=4
pred
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
Empowering DAI (5.2) - KFold
H2O.ai
Machine Intelligence
x0 x1 x2 x3 y
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
pred
0.96
0.03
0.00
0.00
0.00
0.00
0.00
0.00
Fold : 1
Empowering DAI (5.3) - KFold
H2O.ai
Machine Intelligence
x0 x1 x2 x3 y
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
Train
Predict pred
0.96
0.03
0.90
0.12
0.00
0.00
0.00
0.00
Fold : 2
Empowering DAI (5.4) - KFold
H2O.ai
Machine Intelligence
x0 x1 x2 x3 y
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
Train
Predict
Fold : 3
pred
0.96
0.03
0.90
0.12
0.03
0.77
0.00
0.00
Empowering DAI (5.5) - KFold
H2O.ai
Machine Intelligence
x0 x1 x2 x3 y
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
Train
Predict
Fold : 4
pred
0.96
0.03
0.90
0.12
0.03
0.77
0.18
0.91
Empowering DAI (5.6) - KFold
H2O.ai
Machine Intelligence
x0 x1 x2 x3 y
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
Train
Predict
Fold : 4
pred
0.96
0.03
0.90
0.12
0.03
0.77
0.18
0.91
test
0.43
0.03
0.90
0.12
0.03
0.77
0.18
0.91
Empowering DAI (5.7) - KFold
H2O.ai
Machine Intelligence
x0 x1 x2 x3 y
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
0.94 0.27 0.80 0.34 1
0.02 0.22 0.17 0.84 0
0.83 0.11 0.23 0.42 1
0.74 0.26 0.03 0.41 0
0.08 0.29 0.76 0.37 0
0.71 0.76 0.43 0.95 1
0.08 0.72 0.97 0.04 0
0.84 0.79 0.89 0.05 1
Train
Predict
Fold : 4
pred
0.96
0.03
0.90
0.12
0.03
0.77
0.18
0.91
pred
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
test
0.43
0.03
0.90
0.12
0.03
0.77
0.18
0.91
Empowering DAI (5.8) - KFold
H2O.ai
Machine Intelligence
Empowering DAI (6) – Get features
H2O.ai
Machine Intelligence
• Download the feature engineering of DAI
• 55 features derived (out of the initial 9)
• Target column in training data
Empowering DAI (7) – Value of FE
H2O.ai
Machine Intelligence
• Initial set of features is not very predictive
without transformations
• Features derived in DAI are very predictive
Initial Features auc gini
RESOURCE 0.501 0.26%
MGR_ID 0.460 -8.09%
ROLE_ROLLUP_1 0.445 -10.97%
ROLE_ROLLUP_2 0.515 3.04%
ROLE_DEPTNAME 0.534 6.84%
ROLE_TITLE 0.521 4.18%
ROLE_FAMILY_DESC 0.528 5.66%
ROLE_FAMILY 0.495 -0.98%
DAI features auc gini
37_CV_TE_MGR_ID… 0.840 67.9%
18_CV_TE_MGR_ID… 0.819 63.9%
13_CV_TE_MGR_ID… 0.805 61.1%
9_CV_TE_MGR_ID_… 0.796 59.2%
50_WoE_ROLE_DEP… 0.779 55.8%
49_WoE_MGR_ID_R… 0.779 55.7%
45_WoE_MGR_ID_R… 0.774 54.7%
0_CV_TE_MGR_ID_… 0.766 53.2%
8_WoE_MGR_ID_RO… 0.765 53.1%
43_WoE_MGR_ID_R… 0.765 53.0%
Empowering DAI (8) - Stacking
H2O.ai
Machine Intelligence
Models built on DAI FE Test LB
Lightgbm with gbdt 0.909
Lightgbm with dart 0.909
Extra Trees 0.910
Random Forest 0.907
Logistic Regression 0.898
Lightgbm Rmse 0.906
Lightgbm Huber 0.900
Xgboost 0.908
DAI 0.909
DAIderiveddata
Stacking
From 0.90933
To 0.91045
Empowering DAI (9) – Plus counts
H2O.ai
Machine Intelligence
DAIderiveddata
Stacking
From 0.91045
To 0.914
DAI is production-ready
It ignores information about test
data in its learning…Kagglers don’t!
Knowing distribution of test data
helps make better predictions.
For example how frequent a
category is
Models built on DAI FE Test LB
Lightgbm with gbdt 0.909
Lightgbm with dart 0.909
Extra Trees 0.910
Random Forest 0.907
Logistic Regression 0.898
Lightgbm Rmse 0.906
Lightgbm Huber 0.900
Xgboost 0.908
DAI 0.909
Lightgbm plus counts 0.913
Models built on DAI FE Test LB
Lightgbm with gbdt 0.909
Lightgbm with dart 0.909
Extra Trees 0.910
Random Forest 0.907
Logistic Regression 0.898
Lightgbm rmse 0.906
Lightgbm Huber 0.900
Xgboost 0.908
DAI 0.909
Lightgbm plus counts 0.913
Logistic plus dummies 0.907
Empowering DAI (10) – Plus OHE
H2O.ai
Machine Intelligence
DAIderiveddata
Stacking
From 0.914
To 0.9158
Logistic model does not perform
as good. Because best features
were found using tree methods
Dummy Variables or One-Hot
Encoding can improve results for
linear models.
Empowering DAI (11) - Timeline
H2O.ai
Machine Intelligence
Predictions from DAI| Rank 79 | 0.9093
Plus Stacking 9 models| Rank 73 | 0.91045
Test counts | Rank 38 | 0.9139
Dummies | Rank
20 | 0.9158
0.906 0.908 0.91 0.912 0.914 0.916 0.918
2 hours
4 hours
5 hours
6 hours
AUC IN TEST DATA
HOURSIN
AUC IN TEST DATA VERSUS TIME
Further Improvement
• Let it run more time.
• More DAI datasets. The genetic algorithm may come
up with (slightly) different features every time
• Check predictions, search for areas were DAI might
not have done as well as you
• Add deep learning models or other algorithmic
families
• Add your own features
• Add your own models and do stacking using the
Kfold paradigm
H2O.ai
Machine Intelligence
Final words
• Can DAI beat me in predictive modelling competitions?
• In time, (probably) yes
• In depth and creativity, (probably) no
• Can I improve my score with DAI?
• Yes, I can use the features in my models
• Yes, I can use the predictions of stacking
• Yes, I can use the interpretability module or other tools
to get insight about potential additions/pitfalls
• Yes, While DAI is running I can focus on other things ,
like checking visualizations and/or exploring the data.
H2O.ai
Machine Intelligence
H2O.ai
Machine Intelligence

More Related Content

What's hot

Training Large-scale Ad Ranking Models in Spark
Training Large-scale Ad Ranking Models in SparkTraining Large-scale Ad Ranking Models in Spark
Training Large-scale Ad Ranking Models in SparkPatrick Pletscher
 
How to use Apache TVM to optimize your ML models
How to use Apache TVM to optimize your ML modelsHow to use Apache TVM to optimize your ML models
How to use Apache TVM to optimize your ML modelsDatabricks
 
MLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkMLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkSigOpt
 
運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...
運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...
運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...Herman Wu
 
Understanding computer vision with Deep Learning
Understanding computer vision with Deep LearningUnderstanding computer vision with Deep Learning
Understanding computer vision with Deep LearningShubhWadekar
 
PandasUDFs: One Weird Trick to Scaled Ensembles
PandasUDFs: One Weird Trick to Scaled EnsemblesPandasUDFs: One Weird Trick to Scaled Ensembles
PandasUDFs: One Weird Trick to Scaled EnsemblesDatabricks
 
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ..."Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...Edge AI and Vision Alliance
 
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16MLconf
 
Combining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache SparkCombining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache SparkDatabricks
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...inside-BigData.com
 
Machine Learning using Apache Spark MLlib
Machine Learning using Apache Spark MLlibMachine Learning using Apache Spark MLlib
Machine Learning using Apache Spark MLlibIMC Institute
 
Applying your Convolutional Neural Networks
Applying your Convolutional Neural NetworksApplying your Convolutional Neural Networks
Applying your Convolutional Neural NetworksDatabricks
 
Hadoop France meetup Feb2016 : recommendations with spark
Hadoop France meetup  Feb2016 : recommendations with sparkHadoop France meetup  Feb2016 : recommendations with spark
Hadoop France meetup Feb2016 : recommendations with sparkModern Data Stack France
 
Introduction of Feature Hashing
Introduction of Feature HashingIntroduction of Feature Hashing
Introduction of Feature HashingWush Wu
 
Enabling Composition in Distributed Reinforcement Learning with Ray RLlib wit...
Enabling Composition in Distributed Reinforcement Learning with Ray RLlib wit...Enabling Composition in Distributed Reinforcement Learning with Ray RLlib wit...
Enabling Composition in Distributed Reinforcement Learning with Ray RLlib wit...Databricks
 
Cross Device Ad Targeting at Scale
Cross Device Ad Targeting at ScaleCross Device Ad Targeting at Scale
Cross Device Ad Targeting at ScaleTrieu Nguyen
 
JavaOne 2016: Code Generation with JavaCompiler for Fun, Speed and Business P...
JavaOne 2016: Code Generation with JavaCompiler for Fun, Speed and Business P...JavaOne 2016: Code Generation with JavaCompiler for Fun, Speed and Business P...
JavaOne 2016: Code Generation with JavaCompiler for Fun, Speed and Business P...Juan Cruz Nores
 
Writing Continuous Applications with Structured Streaming Python APIs in Apac...
Writing Continuous Applications with Structured Streaming Python APIs in Apac...Writing Continuous Applications with Structured Streaming Python APIs in Apac...
Writing Continuous Applications with Structured Streaming Python APIs in Apac...Databricks
 
Time Series Analysis for Network Secruity
Time Series Analysis for Network SecruityTime Series Analysis for Network Secruity
Time Series Analysis for Network Secruitymrphilroth
 

What's hot (20)

Training Large-scale Ad Ranking Models in Spark
Training Large-scale Ad Ranking Models in SparkTraining Large-scale Ad Ranking Models in Spark
Training Large-scale Ad Ranking Models in Spark
 
How to use Apache TVM to optimize your ML models
How to use Apache TVM to optimize your ML modelsHow to use Apache TVM to optimize your ML models
How to use Apache TVM to optimize your ML models
 
MLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkMLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott Clark
 
20160908 hivemall meetup
20160908 hivemall meetup20160908 hivemall meetup
20160908 hivemall meetup
 
運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...
運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...
運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...
 
Understanding computer vision with Deep Learning
Understanding computer vision with Deep LearningUnderstanding computer vision with Deep Learning
Understanding computer vision with Deep Learning
 
PandasUDFs: One Weird Trick to Scaled Ensembles
PandasUDFs: One Weird Trick to Scaled EnsemblesPandasUDFs: One Weird Trick to Scaled Ensembles
PandasUDFs: One Weird Trick to Scaled Ensembles
 
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ..."Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
 
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
 
Combining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache SparkCombining Machine Learning Frameworks with Apache Spark
Combining Machine Learning Frameworks with Apache Spark
 
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
Efficient Model Selection for Deep Neural Networks on Massively Parallel Proc...
 
Machine Learning using Apache Spark MLlib
Machine Learning using Apache Spark MLlibMachine Learning using Apache Spark MLlib
Machine Learning using Apache Spark MLlib
 
Applying your Convolutional Neural Networks
Applying your Convolutional Neural NetworksApplying your Convolutional Neural Networks
Applying your Convolutional Neural Networks
 
Hadoop France meetup Feb2016 : recommendations with spark
Hadoop France meetup  Feb2016 : recommendations with sparkHadoop France meetup  Feb2016 : recommendations with spark
Hadoop France meetup Feb2016 : recommendations with spark
 
Introduction of Feature Hashing
Introduction of Feature HashingIntroduction of Feature Hashing
Introduction of Feature Hashing
 
Enabling Composition in Distributed Reinforcement Learning with Ray RLlib wit...
Enabling Composition in Distributed Reinforcement Learning with Ray RLlib wit...Enabling Composition in Distributed Reinforcement Learning with Ray RLlib wit...
Enabling Composition in Distributed Reinforcement Learning with Ray RLlib wit...
 
Cross Device Ad Targeting at Scale
Cross Device Ad Targeting at ScaleCross Device Ad Targeting at Scale
Cross Device Ad Targeting at Scale
 
JavaOne 2016: Code Generation with JavaCompiler for Fun, Speed and Business P...
JavaOne 2016: Code Generation with JavaCompiler for Fun, Speed and Business P...JavaOne 2016: Code Generation with JavaCompiler for Fun, Speed and Business P...
JavaOne 2016: Code Generation with JavaCompiler for Fun, Speed and Business P...
 
Writing Continuous Applications with Structured Streaming Python APIs in Apac...
Writing Continuous Applications with Structured Streaming Python APIs in Apac...Writing Continuous Applications with Structured Streaming Python APIs in Apac...
Writing Continuous Applications with Structured Streaming Python APIs in Apac...
 
Time Series Analysis for Network Secruity
Time Series Analysis for Network SecruityTime Series Analysis for Network Secruity
Time Series Analysis for Network Secruity
 

Viewers also liked

GPU Accelerated Machine Learning
GPU Accelerated Machine LearningGPU Accelerated Machine Learning
GPU Accelerated Machine LearningSri Ambati
 
Intro to Machine Learning for GPUs
Intro to Machine Learning for GPUsIntro to Machine Learning for GPUs
Intro to Machine Learning for GPUsSri Ambati
 
Using H2O AutoML for Kaggle Competitions
Using H2O AutoML for Kaggle CompetitionsUsing H2O AutoML for Kaggle Competitions
Using H2O AutoML for Kaggle CompetitionsSri Ambati
 
Scalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2OScalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2OSri Ambati
 
H2O.ai's Driverless AI
H2O.ai's Driverless AIH2O.ai's Driverless AI
H2O.ai's Driverless AISri Ambati
 
スタディサプリを支えるデータ分析基盤 ~設計の勘所と利活用事例~
スタディサプリを支えるデータ分析基盤 ~設計の勘所と利活用事例~スタディサプリを支えるデータ分析基盤 ~設計の勘所と利活用事例~
スタディサプリを支えるデータ分析基盤 ~設計の勘所と利活用事例~Tetsuo Yamabe
 

Viewers also liked (7)

GPU Accelerated Machine Learning
GPU Accelerated Machine LearningGPU Accelerated Machine Learning
GPU Accelerated Machine Learning
 
Intro to Machine Learning for GPUs
Intro to Machine Learning for GPUsIntro to Machine Learning for GPUs
Intro to Machine Learning for GPUs
 
Using H2O AutoML for Kaggle Competitions
Using H2O AutoML for Kaggle CompetitionsUsing H2O AutoML for Kaggle Competitions
Using H2O AutoML for Kaggle Competitions
 
Scalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2OScalable Automatic Machine Learning in H2O
Scalable Automatic Machine Learning in H2O
 
H2O.ai's Driverless AI
H2O.ai's Driverless AIH2O.ai's Driverless AI
H2O.ai's Driverless AI
 
AI Development with H2O.ai
AI Development with H2O.aiAI Development with H2O.ai
AI Development with H2O.ai
 
スタディサプリを支えるデータ分析基盤 ~設計の勘所と利活用事例~
スタディサプリを支えるデータ分析基盤 ~設計の勘所と利活用事例~スタディサプリを支えるデータ分析基盤 ~設計の勘所と利活用事例~
スタディサプリを支えるデータ分析基盤 ~設計の勘所と利活用事例~
 

Similar to DRIVERLESS AI TITLE

Introduction & Hands-on with H2O Driverless AI
Introduction & Hands-on with H2O Driverless AIIntroduction & Hands-on with H2O Driverless AI
Introduction & Hands-on with H2O Driverless AISri Ambati
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...Lucas Jellema
 
realestate and MySQL devops melbourne
realestate and MySQL devops melbournerealestate and MySQL devops melbourne
realestate and MySQL devops melbournemysqldbahelp
 
The Art of Intelligence – Introduction Machine Learning for Oracle profession...
The Art of Intelligence – Introduction Machine Learning for Oracle profession...The Art of Intelligence – Introduction Machine Learning for Oracle profession...
The Art of Intelligence – Introduction Machine Learning for Oracle profession...Lucas Jellema
 
Distributed GLM with H2O - Atlanta Meetup
Distributed GLM with H2O - Atlanta MeetupDistributed GLM with H2O - Atlanta Meetup
Distributed GLM with H2O - Atlanta MeetupSri Ambati
 
Sql server scalability fundamentals
Sql server scalability fundamentalsSql server scalability fundamentals
Sql server scalability fundamentalsChris Adkin
 
[DBA]_HiramFleitas_SQL_PASS_Summit_2017_Summary
[DBA]_HiramFleitas_SQL_PASS_Summit_2017_Summary[DBA]_HiramFleitas_SQL_PASS_Summit_2017_Summary
[DBA]_HiramFleitas_SQL_PASS_Summit_2017_SummaryHiram Fleitas León
 
Roll grinding Six Sigma project
Roll grinding Six Sigma projectRoll grinding Six Sigma project
Roll grinding Six Sigma projectTariq Aziz
 
Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...
Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...
Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...DataWorks Summit
 
STAR CCM GLOBAL CONFERENCE UBERCLOUD
STAR CCM GLOBAL CONFERENCE UBERCLOUDSTAR CCM GLOBAL CONFERENCE UBERCLOUD
STAR CCM GLOBAL CONFERENCE UBERCLOUDThomas Francis
 
Autonomic Resource Provisioning for Cloud-Based Software
Autonomic Resource Provisioning for Cloud-Based SoftwareAutonomic Resource Provisioning for Cloud-Based Software
Autonomic Resource Provisioning for Cloud-Based SoftwarePooyan Jamshidi
 
Spark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
Spark Summit Dublin 2017 - MemSQL - Real-Time Image RecognitionSpark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
Spark Summit Dublin 2017 - MemSQL - Real-Time Image RecognitionSingleStore
 
ClusterPresentation
ClusterPresentationClusterPresentation
ClusterPresentationWill Dixon
 
All (that i know) about exadata external
All (that i know) about exadata externalAll (that i know) about exadata external
All (that i know) about exadata externalPrasad Chitta
 
RS in the context of Big Data-v4
RS in the context of Big Data-v4RS in the context of Big Data-v4
RS in the context of Big Data-v4Khadija Atiya
 
AWS Summit 2013 | Singapore - Understanding the Total Cost of (Non) Ownership...
AWS Summit 2013 | Singapore - Understanding the Total Cost of (Non) Ownership...AWS Summit 2013 | Singapore - Understanding the Total Cost of (Non) Ownership...
AWS Summit 2013 | Singapore - Understanding the Total Cost of (Non) Ownership...Amazon Web Services
 
Marios Michailidis & Mathias Muller, H2O.ai - Time Series with H2O Driverless...
Marios Michailidis & Mathias Muller, H2O.ai - Time Series with H2O Driverless...Marios Michailidis & Mathias Muller, H2O.ai - Time Series with H2O Driverless...
Marios Michailidis & Mathias Muller, H2O.ai - Time Series with H2O Driverless...Sri Ambati
 
CPLEX Optimization Studio, Modeling, Theory, Best Practices and Case Studies
CPLEX Optimization Studio, Modeling, Theory, Best Practices and Case StudiesCPLEX Optimization Studio, Modeling, Theory, Best Practices and Case Studies
CPLEX Optimization Studio, Modeling, Theory, Best Practices and Case Studiesoptimizatiodirectdirect
 

Similar to DRIVERLESS AI TITLE (20)

Introduction & Hands-on with H2O Driverless AI
Introduction & Hands-on with H2O Driverless AIIntroduction & Hands-on with H2O Driverless AI
Introduction & Hands-on with H2O Driverless AI
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
 
realestate and MySQL devops melbourne
realestate and MySQL devops melbournerealestate and MySQL devops melbourne
realestate and MySQL devops melbourne
 
The Art of Intelligence – Introduction Machine Learning for Oracle profession...
The Art of Intelligence – Introduction Machine Learning for Oracle profession...The Art of Intelligence – Introduction Machine Learning for Oracle profession...
The Art of Intelligence – Introduction Machine Learning for Oracle profession...
 
Distributed GLM with H2O - Atlanta Meetup
Distributed GLM with H2O - Atlanta MeetupDistributed GLM with H2O - Atlanta Meetup
Distributed GLM with H2O - Atlanta Meetup
 
Sql server scalability fundamentals
Sql server scalability fundamentalsSql server scalability fundamentals
Sql server scalability fundamentals
 
[DBA]_HiramFleitas_SQL_PASS_Summit_2017_Summary
[DBA]_HiramFleitas_SQL_PASS_Summit_2017_Summary[DBA]_HiramFleitas_SQL_PASS_Summit_2017_Summary
[DBA]_HiramFleitas_SQL_PASS_Summit_2017_Summary
 
Roll grinding Six Sigma project
Roll grinding Six Sigma projectRoll grinding Six Sigma project
Roll grinding Six Sigma project
 
Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...
Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...
Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...
 
STAR CCM GLOBAL CONFERENCE UBERCLOUD
STAR CCM GLOBAL CONFERENCE UBERCLOUDSTAR CCM GLOBAL CONFERENCE UBERCLOUD
STAR CCM GLOBAL CONFERENCE UBERCLOUD
 
Autonomic Resource Provisioning for Cloud-Based Software
Autonomic Resource Provisioning for Cloud-Based SoftwareAutonomic Resource Provisioning for Cloud-Based Software
Autonomic Resource Provisioning for Cloud-Based Software
 
Spark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
Spark Summit Dublin 2017 - MemSQL - Real-Time Image RecognitionSpark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
Spark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
 
ClusterPresentation
ClusterPresentationClusterPresentation
ClusterPresentation
 
All (that i know) about exadata external
All (that i know) about exadata externalAll (that i know) about exadata external
All (that i know) about exadata external
 
RS in the context of Big Data-v4
RS in the context of Big Data-v4RS in the context of Big Data-v4
RS in the context of Big Data-v4
 
AWS Summit 2013 | Singapore - Understanding the Total Cost of (Non) Ownership...
AWS Summit 2013 | Singapore - Understanding the Total Cost of (Non) Ownership...AWS Summit 2013 | Singapore - Understanding the Total Cost of (Non) Ownership...
AWS Summit 2013 | Singapore - Understanding the Total Cost of (Non) Ownership...
 
Marios Michailidis & Mathias Muller, H2O.ai - Time Series with H2O Driverless...
Marios Michailidis & Mathias Muller, H2O.ai - Time Series with H2O Driverless...Marios Michailidis & Mathias Muller, H2O.ai - Time Series with H2O Driverless...
Marios Michailidis & Mathias Muller, H2O.ai - Time Series with H2O Driverless...
 
CPLEX Optimization Studio, Modeling, Theory, Best Practices and Case Studies
CPLEX Optimization Studio, Modeling, Theory, Best Practices and Case StudiesCPLEX Optimization Studio, Modeling, Theory, Best Practices and Case Studies
CPLEX Optimization Studio, Modeling, Theory, Best Practices and Case Studies
 
Tree building 2
Tree building 2Tree building 2
Tree building 2
 
Large scalecplex
Large scalecplexLarge scalecplex
Large scalecplex
 

More from Sri Ambati

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxSri Ambati
 
AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek Sri Ambati
 
LLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thSri Ambati
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionSri Ambati
 
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Sri Ambati
 
Risk Management for LLMs
Risk Management for LLMsRisk Management for LLMs
Risk Management for LLMsSri Ambati
 
Open-Source AI: Community is the Way
Open-Source AI: Community is the WayOpen-Source AI: Community is the Way
Open-Source AI: Community is the WaySri Ambati
 
Building Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OBuilding Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OSri Ambati
 
Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Sri Ambati
 
Cutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersCutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersSri Ambati
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Sri Ambati
 
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Sri Ambati
 
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...Sri Ambati
 
LLM Interpretability
LLM Interpretability LLM Interpretability
LLM Interpretability Sri Ambati
 
Never Reply to an Email Again
Never Reply to an Email AgainNever Reply to an Email Again
Never Reply to an Email AgainSri Ambati
 
Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Sri Ambati
 
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...Sri Ambati
 
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...Sri Ambati
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneySri Ambati
 

More from Sri Ambati (20)

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptx
 
AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek
 
LLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5th
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for Production
 
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
 
Risk Management for LLMs
Risk Management for LLMsRisk Management for LLMs
Risk Management for LLMs
 
Open-Source AI: Community is the Way
Open-Source AI: Community is the WayOpen-Source AI: Community is the Way
Open-Source AI: Community is the Way
 
Building Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OBuilding Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2O
 
Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical
 
Cutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersCutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM Papers
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
 
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
 
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
 
LLM Interpretability
LLM Interpretability LLM Interpretability
LLM Interpretability
 
Never Reply to an Email Again
Never Reply to an Email AgainNever Reply to an Email Again
Never Reply to an Email Again
 
Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)
 
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
 
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation Journey
 

Recently uploaded

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 

Recently uploaded (20)

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 

DRIVERLESS AI TITLE

  • 1. GET COMPETITIVE WITH DRIVERLESS AI Marios Michailidis NOVEMBER 7, 2017
  • 2. Background H2O.ai Machine Intelligence • Competitive data scientist • PhD in ensemble methods at UCL • Former kaggle #1
  • 3. How to perceive Driverless AI • It is an AI that creates AI • Creates machine learning models given:  Some input data  A target variable  An objective  Some allocated computing power (CPU or GPU) H2O.ai Machine Intelligence Will there be a default? Minimize prediction error 6 CPU cores Predictions Model interpretability Insight Feature engineering
  • 4. How does DAI become competitive • Mostly with exhaustive feature engineering • Using and (tuning) Xgboost models • Ensemble H2O.ai Machine Intelligence
  • 5. Tuning Xgboost • Initialize xgboost with modest parameters and small learning rate, but 10,000 potential trees. • Cross-validation is used to find optimal maximum depth of the trees. • Then early stopping is used to get no. of trees • Commence feature engineering • Revisit parameters in the end H2O.ai Machine Intelligence Find best maximum depth Best number of trees Feature engineering Revisit parameters
  • 6. Ensembling • After Feature engineering, based on the resources allocated and accuracy, it takes place. • Up to 40 different xgboost models are build • Different combinations of : • Maximum depths • Tree-growing policies (loss or depth) • Maximum leaves • Simple average of all models H2O.ai Machine Intelligence
  • 7. Why Ensembling (1) - Data • 3,000ish teams • 133 anonymized columns , numerical or categorical • 115 K rows, binary target (accelerate approval) • DAI scores top 2% • Had taken my team almost 3 weeks to get there (we finished 3rd eventually) H2O.ai Machine Intelligence
  • 8. Why Ensembling (2) - Impact H2O.ai Machine Intelligence After-model options Best features found Performance through time Ensemble impact
  • 9. Why Ensembling (3) - Results H2O.ai Machine Intelligence Top 2% with ensemble Around Top 4% without
  • 10. Empowering DAI (1) - Data H2O.ai Machine Intelligence • Popular competition (1700ish teams) in 2013 • Only 9 columns (8 unique). • high cardinality – thousands of unique values. • 90K rows combined for train and test. • Scope: determine an employee's access needs. • Metric to maximize was AUC (or Area Under Curve).
  • 11. Empowering DAI (2) - Scoring H2O.ai Machine Intelligence
  • 12. Empowering DAI (3) – Initial results H2O.ai Machine Intelligence
  • 13. Empowering DAI (4) – train predictions H2O.ai Machine Intelligence • Helps to understand how good the model is • Where there might be deficiencies
  • 14. x0 x1 x2 x3 y 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 Empowering DAI (5.1) - KFold H2O.ai Machine Intelligence
  • 15. x0 x1 x2 x3 y 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 K=4 pred 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 Empowering DAI (5.2) - KFold H2O.ai Machine Intelligence
  • 16. x0 x1 x2 x3 y 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 pred 0.96 0.03 0.00 0.00 0.00 0.00 0.00 0.00 Fold : 1 Empowering DAI (5.3) - KFold H2O.ai Machine Intelligence
  • 17. x0 x1 x2 x3 y 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 Train Predict pred 0.96 0.03 0.90 0.12 0.00 0.00 0.00 0.00 Fold : 2 Empowering DAI (5.4) - KFold H2O.ai Machine Intelligence
  • 18. x0 x1 x2 x3 y 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 Train Predict Fold : 3 pred 0.96 0.03 0.90 0.12 0.03 0.77 0.00 0.00 Empowering DAI (5.5) - KFold H2O.ai Machine Intelligence
  • 19. x0 x1 x2 x3 y 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 Train Predict Fold : 4 pred 0.96 0.03 0.90 0.12 0.03 0.77 0.18 0.91 Empowering DAI (5.6) - KFold H2O.ai Machine Intelligence
  • 20. x0 x1 x2 x3 y 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 Train Predict Fold : 4 pred 0.96 0.03 0.90 0.12 0.03 0.77 0.18 0.91 test 0.43 0.03 0.90 0.12 0.03 0.77 0.18 0.91 Empowering DAI (5.7) - KFold H2O.ai Machine Intelligence
  • 21. x0 x1 x2 x3 y 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 0.94 0.27 0.80 0.34 1 0.02 0.22 0.17 0.84 0 0.83 0.11 0.23 0.42 1 0.74 0.26 0.03 0.41 0 0.08 0.29 0.76 0.37 0 0.71 0.76 0.43 0.95 1 0.08 0.72 0.97 0.04 0 0.84 0.79 0.89 0.05 1 Train Predict Fold : 4 pred 0.96 0.03 0.90 0.12 0.03 0.77 0.18 0.91 pred 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 test 0.43 0.03 0.90 0.12 0.03 0.77 0.18 0.91 Empowering DAI (5.8) - KFold H2O.ai Machine Intelligence
  • 22. Empowering DAI (6) – Get features H2O.ai Machine Intelligence • Download the feature engineering of DAI • 55 features derived (out of the initial 9) • Target column in training data
  • 23. Empowering DAI (7) – Value of FE H2O.ai Machine Intelligence • Initial set of features is not very predictive without transformations • Features derived in DAI are very predictive Initial Features auc gini RESOURCE 0.501 0.26% MGR_ID 0.460 -8.09% ROLE_ROLLUP_1 0.445 -10.97% ROLE_ROLLUP_2 0.515 3.04% ROLE_DEPTNAME 0.534 6.84% ROLE_TITLE 0.521 4.18% ROLE_FAMILY_DESC 0.528 5.66% ROLE_FAMILY 0.495 -0.98% DAI features auc gini 37_CV_TE_MGR_ID… 0.840 67.9% 18_CV_TE_MGR_ID… 0.819 63.9% 13_CV_TE_MGR_ID… 0.805 61.1% 9_CV_TE_MGR_ID_… 0.796 59.2% 50_WoE_ROLE_DEP… 0.779 55.8% 49_WoE_MGR_ID_R… 0.779 55.7% 45_WoE_MGR_ID_R… 0.774 54.7% 0_CV_TE_MGR_ID_… 0.766 53.2% 8_WoE_MGR_ID_RO… 0.765 53.1% 43_WoE_MGR_ID_R… 0.765 53.0%
  • 24. Empowering DAI (8) - Stacking H2O.ai Machine Intelligence Models built on DAI FE Test LB Lightgbm with gbdt 0.909 Lightgbm with dart 0.909 Extra Trees 0.910 Random Forest 0.907 Logistic Regression 0.898 Lightgbm Rmse 0.906 Lightgbm Huber 0.900 Xgboost 0.908 DAI 0.909 DAIderiveddata Stacking From 0.90933 To 0.91045
  • 25. Empowering DAI (9) – Plus counts H2O.ai Machine Intelligence DAIderiveddata Stacking From 0.91045 To 0.914 DAI is production-ready It ignores information about test data in its learning…Kagglers don’t! Knowing distribution of test data helps make better predictions. For example how frequent a category is Models built on DAI FE Test LB Lightgbm with gbdt 0.909 Lightgbm with dart 0.909 Extra Trees 0.910 Random Forest 0.907 Logistic Regression 0.898 Lightgbm Rmse 0.906 Lightgbm Huber 0.900 Xgboost 0.908 DAI 0.909 Lightgbm plus counts 0.913
  • 26. Models built on DAI FE Test LB Lightgbm with gbdt 0.909 Lightgbm with dart 0.909 Extra Trees 0.910 Random Forest 0.907 Logistic Regression 0.898 Lightgbm rmse 0.906 Lightgbm Huber 0.900 Xgboost 0.908 DAI 0.909 Lightgbm plus counts 0.913 Logistic plus dummies 0.907 Empowering DAI (10) – Plus OHE H2O.ai Machine Intelligence DAIderiveddata Stacking From 0.914 To 0.9158 Logistic model does not perform as good. Because best features were found using tree methods Dummy Variables or One-Hot Encoding can improve results for linear models.
  • 27. Empowering DAI (11) - Timeline H2O.ai Machine Intelligence Predictions from DAI| Rank 79 | 0.9093 Plus Stacking 9 models| Rank 73 | 0.91045 Test counts | Rank 38 | 0.9139 Dummies | Rank 20 | 0.9158 0.906 0.908 0.91 0.912 0.914 0.916 0.918 2 hours 4 hours 5 hours 6 hours AUC IN TEST DATA HOURSIN AUC IN TEST DATA VERSUS TIME
  • 28. Further Improvement • Let it run more time. • More DAI datasets. The genetic algorithm may come up with (slightly) different features every time • Check predictions, search for areas were DAI might not have done as well as you • Add deep learning models or other algorithmic families • Add your own features • Add your own models and do stacking using the Kfold paradigm H2O.ai Machine Intelligence
  • 29. Final words • Can DAI beat me in predictive modelling competitions? • In time, (probably) yes • In depth and creativity, (probably) no • Can I improve my score with DAI? • Yes, I can use the features in my models • Yes, I can use the predictions of stacking • Yes, I can use the interpretability module or other tools to get insight about potential additions/pitfalls • Yes, While DAI is running I can focus on other things , like checking visualizations and/or exploring the data. H2O.ai Machine Intelligence

Editor's Notes

  1. 1
  2. 3
  3. 4
  4. 5
  5. 6
  6. 7
  7. 8
  8. 9
  9. 10
  10. 11
  11. 12
  12. 13
  13. 22
  14. 23
  15. 24
  16. 25
  17. 26
  18. 27
  19. 28
  20. 29
  21. 30