SlideShare a Scribd company logo
1 of 12
Download to read offline
Probability Distribution for Non-math People
People who don’t have advanced mathematical knowledge still
needs to understanding the AI/BI, machine learning, data
analysis results so they could use these results
Author: Dr. Guang Yang
email: guangyang@btconnect.com
Head on Probability and Distribution
• How much confidence you could gain from results from BI/
AI, analytical tools, and machine learning. How the
confidence changes with environmental changes.
• Probability distribution is fundamental to understand
statistics, AI/BI, machine learning, exploration analysis
results for probabilistic/soft classification or regression
• Understand probability and distribution as an observer
without going mathematical knowledge and details
• The basic concepts: probability, conditional probability, join
probability and their distribution
Example for Explain Probability Distribution
Glasgow office has 30 male and 10 female staffs. There are 3 male managers
and 4 female managers among them.
We could define 2 events according to the characters: event1: staff gender: male,
female; event 2: staff role: manager; worker. Image a visitor knock the door, what
are the probabilities ?
probability: if a male staff to open the door: P(gender=male)=0.75) and P(gender !
=male)=0.25;
conditional probability: if a female opened door, what chance she was a manager
P(role=manger|gender=female) = 0.4
join probability: what a chance if the person who open door is a female manager:
P(gender=female, role=manager)=0.4 x 0.25 = 0.1
probability distribution: for some reason, work from home, etc.. the office never
full, what are probabilities of male staff in 18, 19, 20 … with 30 staffs in office,
respectively.
Generally Looking at Probabilities
• Understand events and their outcomes: event gender has outcome
{male, female}, event role has outcome {manager, worker}; beware
event domain or scope of space
• Experiment: count the number of repeating event trail outcome
separately
• Computing probabilities using the counted numbers against
corresponding event domains.
• Define the relationships in terms of conditional, join, and both
• Conditional probability, known first event outcome, what a chance
of second event outcome.
• Join probability: want to know what chance if two event outcomes
come together
Probability Distribution Introduction
• Event is a set of outcomes of an experiment to which probability is
assigned
• Probability distribution is a description of random phenomenon in
terms of the probabilities of events
• Have a probability, want to know how the probability changes with
the changes of environment or parameters.
• Choose appropriated probability distribution for calculation
according to the characters of outcomes and events.
• The most popular type of distribution is identical independent
distribution — i.i.d
• There are hundreds of probability distributions, but 15 are common
distribution, and their relationships show next slide.
Understand Probability Distribution Parameters
• Identify random variables and distribution parameters.
• Common distribution parameters include mean, variance, and size
of domain.
• Mean is a weighted average of possible values that random variable
can take. It is also called Expected value of the random variable.
• Variance measures the spread or variability of distribution. It
indicates the likely range of variability among the mean. Its square
root called standard deviation
• Identify the source of the distribution parameters, i. e. from sample
space or from population.
Relationships of probability distributions
Author: Sean Owen
Choosing Probability Distributions—1
• Bernoulli and Uniform both are single trail, Bernoulli has
two outcomes with one probability as p (not necessary 0.5),
another 1-p. Uniform has n outcomes, and each outcome
has probability 1/n
• Binormal distribution: repeat trials of Bernoulli distributions
and trails are independent to each other.
• Hypergeometric distribution: similar to Binormal distribution
except trails are NOT independent each other, such as pick
up a colour ball from urns without replacement.
• Poisson distribution: binary outcomes, probability p is small,
and trails n is large. λ <—np such as the river had been
flooded 3 times within 100 years ( λ=3)
Choosing Probability Distribution — 2
• Geometric distribution: solves “how many failed until a
success” comparing to Binormal distribution which solves
“how many successes”
• Negative binormal distribution is a simple generalisation to
geometric distribution with r success instead 1 success; i.e.
“how many failures until r successes”
• Exponential distribution: a continuous distribution, typically to
solve “how long until an event” in comparison with Poisson
“how many events per time”
• Weibull distribution: a generalisation of exponential
distribution to describe time-to-failure. The rate λ of failure is
vary, whereas exponential distribution rate is constant.
Choosing Probability Distribution — 3
• Normal (Gaussian) distribution: a most important continuous
distribution. It is also call Bell shape distribution. The
distributions of the sums of other distributions follows
(approximately) the normal distribution (central limit theorem)
• Log-normal distribution: it takes values whose logarithm is
normally distributed, or exponentiation of a normally distributed
value.
• t-distribution: reasoning about the means of normal distribution,
and approaches the normal distribution as its parameters
changes.
• chi-squared distribution: a distribution of the sum of squares of
normal -distributed values. chi-squared test is the sum of
squires of differences, which supposed to be normal distribution.
Choosing Probability Distribution — 4
• Gamma distribution: a two parameters family continue
distribution, the generalisation of both the exponential and
ch-squared distributions, model continuous variables that
are always positive and have skewed distributions. it
could be used to model waiting time until next n event
occur. Conjugate prior to a couple distributions in machine
learning.
• Beta distribution: a family of continuous probability
distribution defined on the interval [0, 1], used to model
the behaviour of random variable limited to intervals of
finite length in a wide variety of disciplines. It also for the
conjugate prior
Conclusion
• Many people won’t do data analysis or machine learning, but
they may use the analytic or machine learning results. They
may need to communicate with data scientist and analyst.
• In many cases, probability distributions are basis for data
visualisation. To understand chart or diagram, the basic
concepts should be understood
• Many classifications, such as maximum likelihood estimation
(MLE) are based on probability distribution principles.

More Related Content

What's hot

Sampling Distribution and Simulation in R
Sampling Distribution and Simulation in RSampling Distribution and Simulation in R
Sampling Distribution and Simulation in RPremier Publishers
 
Statistical inference 2
Statistical inference 2Statistical inference 2
Statistical inference 2safi Ullah
 
Types of Statistics Descriptive and Inferential Statistics
Types of Statistics Descriptive and Inferential StatisticsTypes of Statistics Descriptive and Inferential Statistics
Types of Statistics Descriptive and Inferential StatisticsDr. Amjad Ali Arain
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive StatisticsBhagya Silva
 
Statistical treatment of data
Statistical treatment of dataStatistical treatment of data
Statistical treatment of datasenseiDelfin
 
Introduction to Statistics and Statistical Inference
Introduction to Statistics and Statistical InferenceIntroduction to Statistics and Statistical Inference
Introduction to Statistics and Statistical InferenceJezhabeth Villegas
 
Statistical Treatment
Statistical TreatmentStatistical Treatment
Statistical TreatmentDaryl Tabogoc
 
Chap06 sampling and sampling distributions
Chap06 sampling and sampling distributionsChap06 sampling and sampling distributions
Chap06 sampling and sampling distributionsJudianto Nugroho
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variabilityJed Abolencia
 
CABT SHS Statistics & Probability - Estimation of Parameters (intro)
CABT SHS Statistics & Probability -  Estimation of Parameters (intro)CABT SHS Statistics & Probability -  Estimation of Parameters (intro)
CABT SHS Statistics & Probability - Estimation of Parameters (intro)Gilbert Joseph Abueg
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distributionDanu Saputra
 
CABT SHS Statistics & Probability - Sampling Distribution of Means
CABT SHS Statistics & Probability - Sampling Distribution of MeansCABT SHS Statistics & Probability - Sampling Distribution of Means
CABT SHS Statistics & Probability - Sampling Distribution of MeansGilbert Joseph Abueg
 
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and IndependenceThe Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and Independencejasondroesch
 
Stratified Random Sampling - Problems
Stratified Random Sampling -  ProblemsStratified Random Sampling -  Problems
Stratified Random Sampling - ProblemsSundar B N
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsmercy rani
 
Sampling, measurement, and stats(2013)
Sampling, measurement, and stats(2013)Sampling, measurement, and stats(2013)
Sampling, measurement, and stats(2013)BarryCRNA
 

What's hot (20)

Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
Sampling Distribution and Simulation in R
Sampling Distribution and Simulation in RSampling Distribution and Simulation in R
Sampling Distribution and Simulation in R
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
Statistical inference 2
Statistical inference 2Statistical inference 2
Statistical inference 2
 
Types of Statistics Descriptive and Inferential Statistics
Types of Statistics Descriptive and Inferential StatisticsTypes of Statistics Descriptive and Inferential Statistics
Types of Statistics Descriptive and Inferential Statistics
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Statistical treatment of data
Statistical treatment of dataStatistical treatment of data
Statistical treatment of data
 
Introduction to Statistics and Statistical Inference
Introduction to Statistics and Statistical InferenceIntroduction to Statistics and Statistical Inference
Introduction to Statistics and Statistical Inference
 
Statistical Treatment
Statistical TreatmentStatistical Treatment
Statistical Treatment
 
Sampling & Sampling Distribtutions
Sampling & Sampling DistribtutionsSampling & Sampling Distribtutions
Sampling & Sampling Distribtutions
 
Chap06 sampling and sampling distributions
Chap06 sampling and sampling distributionsChap06 sampling and sampling distributions
Chap06 sampling and sampling distributions
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variability
 
CABT SHS Statistics & Probability - Estimation of Parameters (intro)
CABT SHS Statistics & Probability -  Estimation of Parameters (intro)CABT SHS Statistics & Probability -  Estimation of Parameters (intro)
CABT SHS Statistics & Probability - Estimation of Parameters (intro)
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
CABT SHS Statistics & Probability - Sampling Distribution of Means
CABT SHS Statistics & Probability - Sampling Distribution of MeansCABT SHS Statistics & Probability - Sampling Distribution of Means
CABT SHS Statistics & Probability - Sampling Distribution of Means
 
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and IndependenceThe Chi-Square Statistic: Tests for Goodness of Fit and Independence
The Chi-Square Statistic: Tests for Goodness of Fit and Independence
 
Stratified Random Sampling - Problems
Stratified Random Sampling -  ProblemsStratified Random Sampling -  Problems
Stratified Random Sampling - Problems
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Probability Distribution
Probability DistributionProbability Distribution
Probability Distribution
 
Sampling, measurement, and stats(2013)
Sampling, measurement, and stats(2013)Sampling, measurement, and stats(2013)
Sampling, measurement, and stats(2013)
 

Similar to Probability introduction for non-math people

G4 PROBABLITY.pptx
G4 PROBABLITY.pptxG4 PROBABLITY.pptx
G4 PROBABLITY.pptxSmitKajbaje1
 
Sampling distribution by Dr. Ruchi Jain
Sampling distribution by Dr. Ruchi JainSampling distribution by Dr. Ruchi Jain
Sampling distribution by Dr. Ruchi JainRuchiJainRuchiJain
 
Discrete and continuous probability models
Discrete and continuous probability modelsDiscrete and continuous probability models
Discrete and continuous probability modelsAkshay Kumar Mishra
 
Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxzerihunnana
 
Statistical Methods in Research
Statistical Methods in ResearchStatistical Methods in Research
Statistical Methods in ResearchManoj Sharma
 
COM 201_Inferential Statistics_18032022.pptx
COM 201_Inferential Statistics_18032022.pptxCOM 201_Inferential Statistics_18032022.pptx
COM 201_Inferential Statistics_18032022.pptxAkinsolaAyomidotun
 
Presentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlalPresentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlalIstiqlalEid
 
Research method ch07 statistical methods 1
Research method ch07 statistical methods 1Research method ch07 statistical methods 1
Research method ch07 statistical methods 1naranbatn
 
Quantitative analysis
Quantitative analysisQuantitative analysis
Quantitative analysisRajesh Mishra
 
Probability distribution 10
Probability distribution 10Probability distribution 10
Probability distribution 10Sundar B N
 
Statr sessions 9 to 10
Statr sessions 9 to 10Statr sessions 9 to 10
Statr sessions 9 to 10Ruru Chowdhury
 
ADVANCED STATISTICS PREVIOUS YEAR QUESTIONS.docx
ADVANCED STATISTICS PREVIOUS YEAR QUESTIONS.docxADVANCED STATISTICS PREVIOUS YEAR QUESTIONS.docx
ADVANCED STATISTICS PREVIOUS YEAR QUESTIONS.docxTaskiaSarkar
 
Review of Chapters 1-5.ppt
Review of Chapters 1-5.pptReview of Chapters 1-5.ppt
Review of Chapters 1-5.pptNobelFFarrar
 
Estimation and hypothesis
Estimation and hypothesisEstimation and hypothesis
Estimation and hypothesisJunaid Ijaz
 

Similar to Probability introduction for non-math people (20)

PA_EPGDM_2_2023.pptx
PA_EPGDM_2_2023.pptxPA_EPGDM_2_2023.pptx
PA_EPGDM_2_2023.pptx
 
G4 PROBABLITY.pptx
G4 PROBABLITY.pptxG4 PROBABLITY.pptx
G4 PROBABLITY.pptx
 
Sampling distribution by Dr. Ruchi Jain
Sampling distribution by Dr. Ruchi JainSampling distribution by Dr. Ruchi Jain
Sampling distribution by Dr. Ruchi Jain
 
Discrete and continuous probability models
Discrete and continuous probability modelsDiscrete and continuous probability models
Discrete and continuous probability models
 
Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptx
 
Statistical Methods in Research
Statistical Methods in ResearchStatistical Methods in Research
Statistical Methods in Research
 
Res701 research methodology lecture 7 8-devaprakasam
Res701 research methodology lecture 7 8-devaprakasamRes701 research methodology lecture 7 8-devaprakasam
Res701 research methodology lecture 7 8-devaprakasam
 
COM 201_Inferential Statistics_18032022.pptx
COM 201_Inferential Statistics_18032022.pptxCOM 201_Inferential Statistics_18032022.pptx
COM 201_Inferential Statistics_18032022.pptx
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Presentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlalPresentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlal
 
day9.ppt
day9.pptday9.ppt
day9.ppt
 
Research method ch07 statistical methods 1
Research method ch07 statistical methods 1Research method ch07 statistical methods 1
Research method ch07 statistical methods 1
 
FandTtests.ppt
FandTtests.pptFandTtests.ppt
FandTtests.ppt
 
Presentation1
Presentation1Presentation1
Presentation1
 
Quantitative analysis
Quantitative analysisQuantitative analysis
Quantitative analysis
 
Probability distribution 10
Probability distribution 10Probability distribution 10
Probability distribution 10
 
Statr sessions 9 to 10
Statr sessions 9 to 10Statr sessions 9 to 10
Statr sessions 9 to 10
 
ADVANCED STATISTICS PREVIOUS YEAR QUESTIONS.docx
ADVANCED STATISTICS PREVIOUS YEAR QUESTIONS.docxADVANCED STATISTICS PREVIOUS YEAR QUESTIONS.docx
ADVANCED STATISTICS PREVIOUS YEAR QUESTIONS.docx
 
Review of Chapters 1-5.ppt
Review of Chapters 1-5.pptReview of Chapters 1-5.ppt
Review of Chapters 1-5.ppt
 
Estimation and hypothesis
Estimation and hypothesisEstimation and hypothesis
Estimation and hypothesis
 

Recently uploaded

Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 

Recently uploaded (20)

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 

Probability introduction for non-math people

  • 1. Probability Distribution for Non-math People People who don’t have advanced mathematical knowledge still needs to understanding the AI/BI, machine learning, data analysis results so they could use these results Author: Dr. Guang Yang email: guangyang@btconnect.com
  • 2. Head on Probability and Distribution • How much confidence you could gain from results from BI/ AI, analytical tools, and machine learning. How the confidence changes with environmental changes. • Probability distribution is fundamental to understand statistics, AI/BI, machine learning, exploration analysis results for probabilistic/soft classification or regression • Understand probability and distribution as an observer without going mathematical knowledge and details • The basic concepts: probability, conditional probability, join probability and their distribution
  • 3. Example for Explain Probability Distribution Glasgow office has 30 male and 10 female staffs. There are 3 male managers and 4 female managers among them. We could define 2 events according to the characters: event1: staff gender: male, female; event 2: staff role: manager; worker. Image a visitor knock the door, what are the probabilities ? probability: if a male staff to open the door: P(gender=male)=0.75) and P(gender ! =male)=0.25; conditional probability: if a female opened door, what chance she was a manager P(role=manger|gender=female) = 0.4 join probability: what a chance if the person who open door is a female manager: P(gender=female, role=manager)=0.4 x 0.25 = 0.1 probability distribution: for some reason, work from home, etc.. the office never full, what are probabilities of male staff in 18, 19, 20 … with 30 staffs in office, respectively.
  • 4. Generally Looking at Probabilities • Understand events and their outcomes: event gender has outcome {male, female}, event role has outcome {manager, worker}; beware event domain or scope of space • Experiment: count the number of repeating event trail outcome separately • Computing probabilities using the counted numbers against corresponding event domains. • Define the relationships in terms of conditional, join, and both • Conditional probability, known first event outcome, what a chance of second event outcome. • Join probability: want to know what chance if two event outcomes come together
  • 5. Probability Distribution Introduction • Event is a set of outcomes of an experiment to which probability is assigned • Probability distribution is a description of random phenomenon in terms of the probabilities of events • Have a probability, want to know how the probability changes with the changes of environment or parameters. • Choose appropriated probability distribution for calculation according to the characters of outcomes and events. • The most popular type of distribution is identical independent distribution — i.i.d • There are hundreds of probability distributions, but 15 are common distribution, and their relationships show next slide.
  • 6. Understand Probability Distribution Parameters • Identify random variables and distribution parameters. • Common distribution parameters include mean, variance, and size of domain. • Mean is a weighted average of possible values that random variable can take. It is also called Expected value of the random variable. • Variance measures the spread or variability of distribution. It indicates the likely range of variability among the mean. Its square root called standard deviation • Identify the source of the distribution parameters, i. e. from sample space or from population.
  • 7. Relationships of probability distributions Author: Sean Owen
  • 8. Choosing Probability Distributions—1 • Bernoulli and Uniform both are single trail, Bernoulli has two outcomes with one probability as p (not necessary 0.5), another 1-p. Uniform has n outcomes, and each outcome has probability 1/n • Binormal distribution: repeat trials of Bernoulli distributions and trails are independent to each other. • Hypergeometric distribution: similar to Binormal distribution except trails are NOT independent each other, such as pick up a colour ball from urns without replacement. • Poisson distribution: binary outcomes, probability p is small, and trails n is large. λ <—np such as the river had been flooded 3 times within 100 years ( λ=3)
  • 9. Choosing Probability Distribution — 2 • Geometric distribution: solves “how many failed until a success” comparing to Binormal distribution which solves “how many successes” • Negative binormal distribution is a simple generalisation to geometric distribution with r success instead 1 success; i.e. “how many failures until r successes” • Exponential distribution: a continuous distribution, typically to solve “how long until an event” in comparison with Poisson “how many events per time” • Weibull distribution: a generalisation of exponential distribution to describe time-to-failure. The rate λ of failure is vary, whereas exponential distribution rate is constant.
  • 10. Choosing Probability Distribution — 3 • Normal (Gaussian) distribution: a most important continuous distribution. It is also call Bell shape distribution. The distributions of the sums of other distributions follows (approximately) the normal distribution (central limit theorem) • Log-normal distribution: it takes values whose logarithm is normally distributed, or exponentiation of a normally distributed value. • t-distribution: reasoning about the means of normal distribution, and approaches the normal distribution as its parameters changes. • chi-squared distribution: a distribution of the sum of squares of normal -distributed values. chi-squared test is the sum of squires of differences, which supposed to be normal distribution.
  • 11. Choosing Probability Distribution — 4 • Gamma distribution: a two parameters family continue distribution, the generalisation of both the exponential and ch-squared distributions, model continuous variables that are always positive and have skewed distributions. it could be used to model waiting time until next n event occur. Conjugate prior to a couple distributions in machine learning. • Beta distribution: a family of continuous probability distribution defined on the interval [0, 1], used to model the behaviour of random variable limited to intervals of finite length in a wide variety of disciplines. It also for the conjugate prior
  • 12. Conclusion • Many people won’t do data analysis or machine learning, but they may use the analytic or machine learning results. They may need to communicate with data scientist and analyst. • In many cases, probability distributions are basis for data visualisation. To understand chart or diagram, the basic concepts should be understood • Many classifications, such as maximum likelihood estimation (MLE) are based on probability distribution principles.