STASTICS :: Lecture 07

STAM101 :: Lecture 07 :: Poisson Distributions - properties, Normal Distributions- properties

Theoretical distributions are

Poisson

Normal Distribution_ Empirical Rule

Normal Distribution Qualitative sense of normal distributions

Standard Normal Distribution and the Empirical Rule

Discrete Probability distribution
Bernoulli distribution
A random variable x takes two values 0 and 1, with probabilities q and p ie., p(x=1) = p and p(x=0)=q, q-1-p is called a Bernoulli variate and is said to be Bernoulli distribution where p and q are probability of success and failure. It was given by Swiss mathematician James Bernoulli (1654-1705)
Example

Tossing a coin(head or tail)
Germination of seed(germinate or not)

Binomial distribution
Binomial distribution was discovered by James Bernoulli (1654-1705). Let a random experiment be performed repeatedly and the occurrence of an event in a trial be called as success and its non-occurrence is failure. Consider a set of n independent trails (n being finite), in which the probability p of success in any trail is constant for each trial. Then q=1-p is the probability of failure in any trail.
The probability of x success and consequently n-x failures in n independent trails. But x successes in n trails can occur in ncx ways. Probability for each of these ways is pxqn-x.
P(sss…ff…fsf…f)=p(s)p(s)….p(f)p(f)….
= p,p…q,q…
= (p,p…p)(q,q…q)
(x times) (n-x times)
Hence the probability of x success in n trials is given by
ncx pxqn-x
Definition
A random variable x is said to follow binomial distribution if it assumes non-negative values and its probability mass function is given by
P(X=x) =p(x) =
ncx pxqn-x ,          x=0,1,2…n
q=1-p
0,                     otherwise
The two independent constants n and p in the distribution are known as the parameters of the distribution.
Condition for Binomial distribution
We get the binomial distribution under the following experimentation conditions

The number of trial n is finite
The trials are independent of each other.
The probability of success p is constant for each trial.
Each trial must result in a success or failure.
The events are discrete events.

Properties

If p and q are equal, the given binomial distribution will be symmetrical. If p and q are not equal, the distribution will be skewed distribution.
Mean = E(x) = np
Variance =V(x) = npq (mean>variance)

Application

Quality control measures and sampling process in industries to classify items as defectives or non-defective.
Medical applications such as success or failure, cure or no-cure.

Example 1
Eight coins are tossed simultaneously. Find the probability of getting atleast six heads.
Solution
Here number of trials, n = 8, p denotes the probability of getting a head.
\ and
If the random variable X denotes the number of heads, then the probability of a success in n trials is given by
P(X = x) = ncx px qn-x , x = 0 , 1, 2, ..., n

Probability of getting atleast six heads is given by
P(x ³ 6) = P(x = 6) + P(x = 7) + P(x = 8)

Example 2 Ten coins are tossed simultaneously. Find the probability of getting (i) atleast seven heads (ii) exactly seven heads (iii) atmost seven heads
Solution
p = Probability of getting a head = 2
q = Probability of not getting a head =
The probability of getting x heads throwing 10 coins simultaneously is given by
P(X = x) = nCx px qn-x. , x = 0, 1, 2, ..., n

i) Probability of getting atleast seven heads
P(x ³ 7) = P (x = 7) + P(x = 8) + P (x = 9) + P (x =10)

ii) Probability of getting exactly 7 heads

iii) Probability of getting almost 7 heads
P(x £ 7) = 1 – P(x > 7)
= 1 symbol {P(x = 8) + P (x = 9) + P(x = 10)}

Example 3:20 wrist watches in a box of 100 are defective. If 10 watches are selected at random, find the probability that (i) 10 are defective (ii) 10 are good (iii) at least one watch is defective (iv) at most 3 are defective.
Solution
20 out of 100 wrist watches are defective
Probability of defective wrist watch, p

Since 10 watches are selected at random, n =10
P(X = x) = nCx px qn-x, x = 0, 1, 2, ..., 10

i) Probability of selecting 10 defective watches
P( x =10) =
ii) Probability of selecting 10 good watches (i.e. no defective)
P(x = 0) =
=

iii) Probability of selecting at least one defective watch
P(x ³ 1) = 1 – P(x < 1)
= 1 – P(x = 0)
= 1 -
=1-
iv) Probability of selecting at most 3 defective watches
P (x 3) = P (x = 0) + P(x =1) + P(x = 2) + P(x = 3)
=

= 1. (0.107) + 10 (0.026) + 45 (0.0062) + 120 (0.0016)
= 0.859 (approx)

Poisson distribution
The Poisson distribution, named after Simeon Denis Poisson (1781-1840). Poisson distribution is a discrete distribution. It describes random events that occurs rarely over a unit of time or space.
It differs from the binomial distribution in the sense that we count the number of success and number of failures, while in Poisson distribution, the average number of success in given unit of time or space.

The Poisson Distribution

Definition
The probability that exactly x events will occur in a given time is as follows
P(x) = , x=0,1,2…
called as probability mass function of Poisson distribution.
where λ is the average number of occurrences per unit of time
λ = np
Condition for Poisson distribution
Poisson distribution is the limiting case of binomial distribution under the following assumptions.

The number of trials n should be indefinitely large ie., n->∞
The probability of success p for each trial is indefinitely small.
np= λ, should be finite where λ is constant.

Properties

Poisson distribution is defined by single parameter λ.
Mean = λ
Variance = λ. Mean and Variance are equal.

Application

It is used in quality control statistics to count the number of defects of an item.
In biology, to count the number of bacteria.
In determining the number of deaths in a district in a given period, by rare disease.
The number of error per page in typed material.
The number of plants infected with a particular disease in a plot of field.
Number of weeds in particular species in different plots of a field.

Example 4: Suppose on an average 1 house in 1000 in a certain district has a fire during a year. If there are 2000 houses in that district, what is the probability that exactly 5 houses will have a fire during the year? [given that e-2 = 0.13534]
Solution:
Mean, = np , n = 2000 and p =

                 l=2
The Poisson distribution is


                         = 0.036
Example 5
If 2% of electric bulbs manufactured by a certain company are defective. Find the probability that in a sample of 200 bulbs i) less than 2 bulbs ii) more than 3 bulbs are defective.[e-4 = 0.0183]
Solution
The probability of a defective bulb
Given that n = 200 since p is small and n is large
We use the Poisson distribution
mean, m = np = 200 ´ 0.02 = 4
Now, Poisson Probability function,
i) Probability of less than 2 bulbs are defective
= P(X<2)
= P(x = 0) + P(x = 1)
= e- 4 + e- 4 (4)
= e- 4 (1 + 4) = 0.0183 ´ 5
= 0.0915
ii) Probability of getting more than 3 defective bulbs
P(x > 3) = 1- P(x £ 3)
= 1- {P(x = 0) + P(x =1) + P(x=2) + P(x=3)}

= 1- {0.0183 ´ (1 + 4 + 8 + 10.67)}
= 0.567
Normal distribution
Continuous Probability distribution is normal distribution. It is also known as error law or Normal law or Laplacian law or Gaussian distribution. Many of the sampling distribution like student-t, f distribution and χ2 distribution.
Definition
A continuous random variable x is said to be a normal distribution with parameters µ and σ2, if the density function is given by the probability law
f(x)=; -¥ < x < ¥, -¥ < m < ¥, s >0
Note
The mean and standard deviation are called the parameters of Normal distribution. The normal distribution is expressed by X N(, 2)
Condition of Normal Distribution
i) Normal distribution is a limiting form of the binomial distribution under the following conditions.
a) n, the number of trials is indefinitely large ie., nand
b) Neither p nor q is very small.
ii) Normal distribution can also be obtained as a limiting form of Poisson distribution with parameter m
iii) Constants of normal distribution are mean = , variation =2, Standard deviation = .

Normal probability curve
The curve representing the normal distribution is called the normal probability curve. The curve is symmetrical about the mean (), bell-shaped and the two tails on the right and left sides of the mean extends to the infinity. The shape of the curve is shown in the following figure.

- x = 

Properties of normal distribution
1. The normal curve is bell shaped and is symmetric at x = .
2. Mean, median, and mode of the distribution are coincide
i.e., Mean = Median = Mode = 
3. It has only one mode at x = (i.e., unimodal)
4. The points of inflection are at x = 
5. The maximum ordinate occurs at x = and its value is =
6. Area Property P(- < < + ) = 0.6826
P(- 2< < + 2) = 0.9544
P(- 3< < + 3) = 0.9973

Standard Normal distribution
Let X be random variable which follows normal distribution with mean and variance 2 .The standard normal variate is defined as which follows standard normal distribution with mean 0 and standard deviation 1 i.e., Z N(0,1). The standard normal distribution is given by ; -< z< 
The advantage of the above function is that it doesn’t contain any parameter. This enables us to compute the area under the normal probability curve.

Note
Property of

Example 6: In a normal distribution whose mean is 12 and standard deviation is 2. Find the probability for the interval from x = 9.6 to x = 13.8
Solution
Given that Z~ N (12, 4)

= P(-1.2 ≤ Z ≤ 0)+P(0 ≤ Z ≤ 0.9)
= P(0≤ Z ≤ 1.2)+P(0 ≤ Z ≤ 0.9) [by using symmetric property]
=0.3849 +0.3159
=0.7008
When it is converted to percentage (ie) 70% of the observations are covered between 9.6 to 13.8.
Example 7: For a normal distribution whose mean is 2 and standard deviation 3. Find the value of the variate such that the probability of the variate from the mean to the value is 0.4115
Solution:
Given that Z~ N (2, 9)
To find X1:
We have P (2 ≤ Z ≤X1) =0.4115

P (0 ≤ Z ≤ Z1) =0.4115 where
[From the normal table where 0.4115 lies is rthe value of Z1]
Form the normal table we have Z1=1.35

Þ3(1.35)+2=X1
=X1=6.05
(i.e) 41 % of the observation converged between 2 and 6.05

Download this lecture as PDF here