SlideShare a Scribd company logo
1 of 32
Download to read offline
The basics of statistical hypothesis
testing in E-commerce.
By Anatoly Vuets
Agenda
• Why do we use (we should use) statistical hypothesis testing in e-commerce?
• Statistical test: how does it work and its main parameters
• Key features for e-commerce
Why do we need statistical
testing in e-commerce?
We need the right decisions
• A/B tests
• Ad-hoc analyses
• Building models
We need the right decisions
• A question: which of these groups makes more profit?
• What is missing here?
We need the right decisions
• A/B test: which version is better?
Statistical test: let’s recall
the basics!
• Random variable (discrete or continuous)
• Probability distribution function (PMF(x), PDF(x))
• Mean M or μ
• Standard deviation SD or σ
Basics of statistics
Basics of statistics: standard
distribution
Statistical test: uncertainty.
...
...
...
...
..................
True metrics value
Statistical population
Sample Possible samples
...
Observed value Other possible values
(distribution)
Uncertainty
We want to conclude about the statistical population based on single sample that we have
observed
Statistical population Observed sample Possible samples
Why is this important?
Distribution of metrics estimate
Statistical test: basic idea
and main parameters.
• We want to test a statement (typically existence of an effect).
• We have a set of observations (sample) from which we conclude the statement.
• Scenario, in which the statement is TRUE is called alternative hypothesis H1.
• Scenario, in which the statement is FALSE is called null hypothesis H0.
• Estimate the probability to observe the sample we have under H0.
• If the probability is high enough - we conclude that H1 can not be accepted. In the opposite
case, we accept H1.
Idea
... H0/H1𝗧(S)
H0: C = 5% H1: C > 5%
Statistical test
H0 H1
H0 Correct
P: 1 - α
Error T1
P: α
H1 Error T2
P: β
Correct
P: 1 - β
Test T(s)
Truth
• Error T1 - accept H1 when H0 is true.
• Error T2 -accept H0 when H1 is true.
• We would like to have a perfect test (α = 0, β = 0).
However as we shall see later, this is impossible in
practice. Because of this, test design and result
interpretation are crucial for proper decision
making.
Statistical test parameters
A detector can be considered as a binary classifier: passenger does not have (H0) or has metal
objects (H1) (weapon etc.)
The detector has a sensitivity knob (decision boundary).
If the sensitivity is low detector falsely detects metal in α = 5% of cases, but skips metal in β =
67% of cases.
If the sensitivity is high - it falsely detects metal in α = 50%, but skips in β = 0.3% of cases.
Intermediate sensitivity values allow choosing the trade-off between skipping a passenger
who has hidden metal objects (increases probability of an incident) and the service speed
(additional airport costs and lower passenger satisfaction).
Statistical test parameters: metal
detector in airport
Statistical tests based on data achieved from an A/B test can be treated as a classifier which is
supposed to tell whether conversion rate increased (H1) or remained the same (H0).
Question: which trade-off between α and β would you choose?
Statistical test parameters:
increasing web-page conversion rate
• H0: C = 5%, H1: C > 5%
• T(s) = c/n, n = 3600
• significance level = 5%
• P(T|H0) - ?
Theory:
Simulation:
bootstrap
How does statistical test work:
distribution P(T|H0)
How does statistical test works:
significance level and decision boundary
• H0: C = 5%, H1: C > 5%
• T(s) = c/n, n = 3600
• significance level = 5%
• P(T|H1) - ?
Hypothesis H1 consists of
infinite number of
hypotheses: C = 5.1%, C =
5.2% … Which one should
we consider?
• H1: С = 5.5%
(+ 10%, minimum expected boost)
How does statistical test works:
distribution P(T|H1)
How does statistical test work:
significance level vs power
How does statistical test work:
significance level vs power
Important features of statistical
testing in e-commerce
Growth dynamics of metrics
Significance level vs power trade-off
improvement: sample size
Significance level vs power trade-off
improvement: effect size
Question: what should we do if we choose α = 10% but got p.value = 12%?
Uncertainty of p-value
• Key parameters of the statistical test are significance level and power that correspond to the
probability of false detection and probability to miss effect.
• Increased test power can be achieved in two ways: by increasing sample size or by increasing
effect size
• Keep in mind that p-value is a random statistic! It is important to account for its uncertainty.
• Mind that some metrics (like conversion from registration to buyer) may take significant time
to measure
• Anomalies in data may dramatically impact test results
Summary
Conclusions
• In e-commerce, test power is often of the most importance (probability not to miss effect)
• In the case of high-traffic business: the required trade-off between significance level and
power can be easily achieved by increasing the sample size.
• In the case of low-traffic business: focus on features which:
1) are cheap, easy to implement and not risky, or
2) have potentially big effects.
Thank you for your attention!

More Related Content

What's hot

Erp for mining industry
Erp for mining industryErp for mining industry
Erp for mining industryJaimin Dave
 
UBER-Current Strategy, Competition Analysis and Global Expansion
UBER-Current Strategy, Competition Analysis and Global ExpansionUBER-Current Strategy, Competition Analysis and Global Expansion
UBER-Current Strategy, Competition Analysis and Global ExpansionShaminder Saini
 
Supply chain Management of P&G
Supply chain Management of P&G Supply chain Management of P&G
Supply chain Management of P&G Fatima Rani
 
Vinculum E-Retail Product Suite for Retailers and E-commerce Vendors
Vinculum E-Retail Product Suite for Retailers and E-commerce VendorsVinculum E-Retail Product Suite for Retailers and E-commerce Vendors
Vinculum E-Retail Product Suite for Retailers and E-commerce VendorsSiddhartha Tripathi
 
Research paper review on car pooling using android operating system a step t...
Research paper review on car pooling using  android operating system a step t...Research paper review on car pooling using  android operating system a step t...
Research paper review on car pooling using android operating system a step t...Akshay Shelake
 
Artificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to businessArtificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to businesspaul young cpa, cga
 
Daraz.pk presentation
Daraz.pk presentationDaraz.pk presentation
Daraz.pk presentationSara Amjad
 
Styles To Frame Digital Marketing In Automobile Industry
 Styles To Frame Digital Marketing In Automobile Industry Styles To Frame Digital Marketing In Automobile Industry
Styles To Frame Digital Marketing In Automobile IndustryOmnePresent
 
AI in Marketing: Guest lecture at Bournemouth university
AI in Marketing: Guest lecture at Bournemouth university  AI in Marketing: Guest lecture at Bournemouth university
AI in Marketing: Guest lecture at Bournemouth university Zoodikers
 
Ai in Digital Marketing
Ai in Digital MarketingAi in Digital Marketing
Ai in Digital MarketingDmitri Zotov
 
Redbus.in - Market Positioning
Redbus.in - Market PositioningRedbus.in - Market Positioning
Redbus.in - Market PositioningRitesh Hati
 
트위터 마케팅 활용 및 사례
트위터 마케팅 활용 및 사례트위터 마케팅 활용 및 사례
트위터 마케팅 활용 및 사례DMC미디어
 
myntra Online Shopping Store
myntra Online Shopping Store myntra Online Shopping Store
myntra Online Shopping Store Shreya Singh
 
BBC WorldWide IT Strategy
BBC WorldWide IT StrategyBBC WorldWide IT Strategy
BBC WorldWide IT StrategyGourav Nagar
 
Amazon supply chain
Amazon supply chainAmazon supply chain
Amazon supply chainParth Thakar
 
A CASE STUDY ALIBABA.COM
A CASE STUDY ALIBABA.COMA CASE STUDY ALIBABA.COM
A CASE STUDY ALIBABA.COMIshworKhatiwada
 

What's hot (20)

Erp for mining industry
Erp for mining industryErp for mining industry
Erp for mining industry
 
UBER-Current Strategy, Competition Analysis and Global Expansion
UBER-Current Strategy, Competition Analysis and Global ExpansionUBER-Current Strategy, Competition Analysis and Global Expansion
UBER-Current Strategy, Competition Analysis and Global Expansion
 
AMAZON-CASE STUDY
AMAZON-CASE STUDYAMAZON-CASE STUDY
AMAZON-CASE STUDY
 
Supply chain Management of P&G
Supply chain Management of P&G Supply chain Management of P&G
Supply chain Management of P&G
 
Vinculum E-Retail Product Suite for Retailers and E-commerce Vendors
Vinculum E-Retail Product Suite for Retailers and E-commerce VendorsVinculum E-Retail Product Suite for Retailers and E-commerce Vendors
Vinculum E-Retail Product Suite for Retailers and E-commerce Vendors
 
Social Media and American Express: Case Studies
Social Media and American Express: Case StudiesSocial Media and American Express: Case Studies
Social Media and American Express: Case Studies
 
Research paper review on car pooling using android operating system a step t...
Research paper review on car pooling using  android operating system a step t...Research paper review on car pooling using  android operating system a step t...
Research paper review on car pooling using android operating system a step t...
 
Artificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to businessArtificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to business
 
Daraz.pk presentation
Daraz.pk presentationDaraz.pk presentation
Daraz.pk presentation
 
Styles To Frame Digital Marketing In Automobile Industry
 Styles To Frame Digital Marketing In Automobile Industry Styles To Frame Digital Marketing In Automobile Industry
Styles To Frame Digital Marketing In Automobile Industry
 
AI in Marketing: Guest lecture at Bournemouth university
AI in Marketing: Guest lecture at Bournemouth university  AI in Marketing: Guest lecture at Bournemouth university
AI in Marketing: Guest lecture at Bournemouth university
 
Ai in Digital Marketing
Ai in Digital MarketingAi in Digital Marketing
Ai in Digital Marketing
 
Amazon
AmazonAmazon
Amazon
 
Redbus.in - Market Positioning
Redbus.in - Market PositioningRedbus.in - Market Positioning
Redbus.in - Market Positioning
 
Amazon.com
Amazon.comAmazon.com
Amazon.com
 
트위터 마케팅 활용 및 사례
트위터 마케팅 활용 및 사례트위터 마케팅 활용 및 사례
트위터 마케팅 활용 및 사례
 
myntra Online Shopping Store
myntra Online Shopping Store myntra Online Shopping Store
myntra Online Shopping Store
 
BBC WorldWide IT Strategy
BBC WorldWide IT StrategyBBC WorldWide IT Strategy
BBC WorldWide IT Strategy
 
Amazon supply chain
Amazon supply chainAmazon supply chain
Amazon supply chain
 
A CASE STUDY ALIBABA.COM
A CASE STUDY ALIBABA.COMA CASE STUDY ALIBABA.COM
A CASE STUDY ALIBABA.COM
 

Similar to Statistical hypothesis testing in e commerce

Elementary Data Analysis with MS Excel_Day-5
Elementary Data Analysis with MS Excel_Day-5Elementary Data Analysis with MS Excel_Day-5
Elementary Data Analysis with MS Excel_Day-5Redwan Ferdous
 
Introduction To Data Science Using R
Introduction To Data Science Using RIntroduction To Data Science Using R
Introduction To Data Science Using RANURAG SINGH
 
Intro to data science
Intro to data scienceIntro to data science
Intro to data scienceANURAG SINGH
 
How Significant is Statistically Significant? The case of Audio Music Similar...
How Significant is Statistically Significant? The case of Audio Music Similar...How Significant is Statistically Significant? The case of Audio Music Similar...
How Significant is Statistically Significant? The case of Audio Music Similar...Julián Urbano
 
A05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsA05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsLeanleaders.org
 
A05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsA05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsLeanleaders.org
 
1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inferenceDev Pandey
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Project two guidelines and rubric.html competencyin this pr
Project two guidelines and rubric.html competencyin this prProject two guidelines and rubric.html competencyin this pr
Project two guidelines and rubric.html competencyin this prPOLY33
 
Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)Matt Hansen
 
hypothesis teesting
 hypothesis teesting hypothesis teesting
hypothesis teestingkpgandhi
 
Chi square analysis-for_attribute_data_(01-14-06)
Chi square analysis-for_attribute_data_(01-14-06)Chi square analysis-for_attribute_data_(01-14-06)
Chi square analysis-for_attribute_data_(01-14-06)Daniel Augustine
 
Business Research Methods Unit V
Business Research Methods Unit VBusiness Research Methods Unit V
Business Research Methods Unit VKartikeya Singh
 
Vital QMS Process Validation Statistics - OMTEC 2018
Vital QMS Process Validation Statistics - OMTEC 2018Vital QMS Process Validation Statistics - OMTEC 2018
Vital QMS Process Validation Statistics - OMTEC 2018April Bright
 
ISSTA'16 Summer School: Intro to Statistics
ISSTA'16 Summer School: Intro to StatisticsISSTA'16 Summer School: Intro to Statistics
ISSTA'16 Summer School: Intro to StatisticsAndrea Arcuri
 
Calculating a Sample Size
Calculating a Sample SizeCalculating a Sample Size
Calculating a Sample SizeMatt Hansen
 
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...Smarten Augmented Analytics
 

Similar to Statistical hypothesis testing in e commerce (20)

ABTest-20231020.pptx
ABTest-20231020.pptxABTest-20231020.pptx
ABTest-20231020.pptx
 
Elementary Data Analysis with MS Excel_Day-5
Elementary Data Analysis with MS Excel_Day-5Elementary Data Analysis with MS Excel_Day-5
Elementary Data Analysis with MS Excel_Day-5
 
Introduction To Data Science Using R
Introduction To Data Science Using RIntroduction To Data Science Using R
Introduction To Data Science Using R
 
Intro to data science
Intro to data scienceIntro to data science
Intro to data science
 
How Significant is Statistically Significant? The case of Audio Music Similar...
How Significant is Statistically Significant? The case of Audio Music Similar...How Significant is Statistically Significant? The case of Audio Music Similar...
How Significant is Statistically Significant? The case of Audio Music Similar...
 
A05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsA05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat Tests
 
A05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsA05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat Tests
 
1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Project two guidelines and rubric.html competencyin this pr
Project two guidelines and rubric.html competencyin this prProject two guidelines and rubric.html competencyin this pr
Project two guidelines and rubric.html competencyin this pr
 
Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)
 
hypothesis teesting
 hypothesis teesting hypothesis teesting
hypothesis teesting
 
Chi square analysis-for_attribute_data_(01-14-06)
Chi square analysis-for_attribute_data_(01-14-06)Chi square analysis-for_attribute_data_(01-14-06)
Chi square analysis-for_attribute_data_(01-14-06)
 
Business Research Methods Unit V
Business Research Methods Unit VBusiness Research Methods Unit V
Business Research Methods Unit V
 
Hypothsis testing
Hypothsis testingHypothsis testing
Hypothsis testing
 
Meetup_FGVA_Uplift @ Dataiku
Meetup_FGVA_Uplift @ DataikuMeetup_FGVA_Uplift @ Dataiku
Meetup_FGVA_Uplift @ Dataiku
 
Vital QMS Process Validation Statistics - OMTEC 2018
Vital QMS Process Validation Statistics - OMTEC 2018Vital QMS Process Validation Statistics - OMTEC 2018
Vital QMS Process Validation Statistics - OMTEC 2018
 
ISSTA'16 Summer School: Intro to Statistics
ISSTA'16 Summer School: Intro to StatisticsISSTA'16 Summer School: Intro to Statistics
ISSTA'16 Summer School: Intro to Statistics
 
Calculating a Sample Size
Calculating a Sample SizeCalculating a Sample Size
Calculating a Sample Size
 
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
 

Recently uploaded

WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )Pooja Nehwal
 
SBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSebastiano Panichella
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807
 
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝soniya singh
 
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfhenrik385807
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...NETWAYS
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024eCommerce Institute
 
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSimulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSebastiano Panichella
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfakankshagupta7348026
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxmavinoikein
 
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...NETWAYS
 
The 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringThe 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringSebastiano Panichella
 
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...NETWAYS
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 

Recently uploaded (20)

WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
 
SBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation Track
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
 
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
 
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024
 
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSimulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdf
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptx
 
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
OSCamp Kubernetes 2024 | Zero-Touch OS-Infrastruktur für Container und Kubern...
 
The 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringThe 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software Engineering
 
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 

Statistical hypothesis testing in e commerce

  • 1. The basics of statistical hypothesis testing in E-commerce. By Anatoly Vuets
  • 2. Agenda • Why do we use (we should use) statistical hypothesis testing in e-commerce? • Statistical test: how does it work and its main parameters • Key features for e-commerce
  • 3. Why do we need statistical testing in e-commerce?
  • 4. We need the right decisions • A/B tests • Ad-hoc analyses • Building models
  • 5. We need the right decisions • A question: which of these groups makes more profit? • What is missing here?
  • 6. We need the right decisions • A/B test: which version is better?
  • 7. Statistical test: let’s recall the basics!
  • 8. • Random variable (discrete or continuous) • Probability distribution function (PMF(x), PDF(x)) • Mean M or μ • Standard deviation SD or σ Basics of statistics
  • 9. Basics of statistics: standard distribution
  • 11. ... ... ... ... .................. True metrics value Statistical population Sample Possible samples ... Observed value Other possible values (distribution) Uncertainty
  • 12. We want to conclude about the statistical population based on single sample that we have observed Statistical population Observed sample Possible samples Why is this important?
  • 14. Statistical test: basic idea and main parameters.
  • 15. • We want to test a statement (typically existence of an effect). • We have a set of observations (sample) from which we conclude the statement. • Scenario, in which the statement is TRUE is called alternative hypothesis H1. • Scenario, in which the statement is FALSE is called null hypothesis H0. • Estimate the probability to observe the sample we have under H0. • If the probability is high enough - we conclude that H1 can not be accepted. In the opposite case, we accept H1. Idea
  • 16. ... H0/H1𝗧(S) H0: C = 5% H1: C > 5% Statistical test
  • 17. H0 H1 H0 Correct P: 1 - α Error T1 P: α H1 Error T2 P: β Correct P: 1 - β Test T(s) Truth • Error T1 - accept H1 when H0 is true. • Error T2 -accept H0 when H1 is true. • We would like to have a perfect test (α = 0, β = 0). However as we shall see later, this is impossible in practice. Because of this, test design and result interpretation are crucial for proper decision making. Statistical test parameters
  • 18. A detector can be considered as a binary classifier: passenger does not have (H0) or has metal objects (H1) (weapon etc.) The detector has a sensitivity knob (decision boundary). If the sensitivity is low detector falsely detects metal in α = 5% of cases, but skips metal in β = 67% of cases. If the sensitivity is high - it falsely detects metal in α = 50%, but skips in β = 0.3% of cases. Intermediate sensitivity values allow choosing the trade-off between skipping a passenger who has hidden metal objects (increases probability of an incident) and the service speed (additional airport costs and lower passenger satisfaction). Statistical test parameters: metal detector in airport
  • 19. Statistical tests based on data achieved from an A/B test can be treated as a classifier which is supposed to tell whether conversion rate increased (H1) or remained the same (H0). Question: which trade-off between α and β would you choose? Statistical test parameters: increasing web-page conversion rate
  • 20. • H0: C = 5%, H1: C > 5% • T(s) = c/n, n = 3600 • significance level = 5% • P(T|H0) - ? Theory: Simulation: bootstrap How does statistical test work: distribution P(T|H0)
  • 21. How does statistical test works: significance level and decision boundary
  • 22. • H0: C = 5%, H1: C > 5% • T(s) = c/n, n = 3600 • significance level = 5% • P(T|H1) - ? Hypothesis H1 consists of infinite number of hypotheses: C = 5.1%, C = 5.2% … Which one should we consider? • H1: С = 5.5% (+ 10%, minimum expected boost) How does statistical test works: distribution P(T|H1)
  • 23. How does statistical test work: significance level vs power
  • 24. How does statistical test work: significance level vs power
  • 25. Important features of statistical testing in e-commerce
  • 27. Significance level vs power trade-off improvement: sample size
  • 28. Significance level vs power trade-off improvement: effect size
  • 29. Question: what should we do if we choose α = 10% but got p.value = 12%? Uncertainty of p-value
  • 30. • Key parameters of the statistical test are significance level and power that correspond to the probability of false detection and probability to miss effect. • Increased test power can be achieved in two ways: by increasing sample size or by increasing effect size • Keep in mind that p-value is a random statistic! It is important to account for its uncertainty. • Mind that some metrics (like conversion from registration to buyer) may take significant time to measure • Anomalies in data may dramatically impact test results Summary
  • 31. Conclusions • In e-commerce, test power is often of the most importance (probability not to miss effect) • In the case of high-traffic business: the required trade-off between significance level and power can be easily achieved by increasing the sample size. • In the case of low-traffic business: focus on features which: 1) are cheap, easy to implement and not risky, or 2) have potentially big effects.
  • 32. Thank you for your attention!