left-icon

Statistics Using Excel Succinctly®
by Charles Zaiontz

Previous
Chapter

of
A
A
A

CHAPTER 4

Distributions

Distributions


Discrete Distributions

The (probability) frequency function f, also called the probability density function (abbreviated pdf), of a discrete random variable x is defined so that for any value t in the domain of the random variable:

ft=P(x=t)

where P(x=t) = the probability that x assumes the value t.

The corresponding (cumulative) distribution function Fx is defined by

Ft= u≤tf(u)

for any value t in the domain of the random variable x.

For any discrete random variable defined over the domain S with frequency function f and distribution function F

0 ≤ft≤1 for all t in S

t∈Sf(t)=1

Ft= P(x≤t)

Pt1<x≤t2=Ft2-F(t1)

A frequency function can be expressed as a table or a bar chart, as described in the following example.

Example: Find the distribution function for the frequency function given in columns A and B in Figure 22. Also show the graph of the frequency and distribution functions.

Table defining the frequency and distribution functions

  1. Table defining the frequency and distribution functions

Given the frequency function f(x) defined in the range B4:B11 of Figure 22, we can calculate the distribution function F(x), in the range C4:C11, by putting the formula =B4 in cell C4 and the formula =B5+C4 in cell C5 and then copying this formula into cells C6 to C11 (e.g., by highlighting the range C5:C11 and pressing Ctrl-D).

From the table in Figure 22, we can create the charts in Figure 23.

  1. Charts of frequency and distribution functions

Continuous Distributions

While for a discrete random variable x, the probability that x assumes a value between a and b (exclusive) is given by a<x<bf(x), the frequency function f of a continuous random variable can assume an infinite number of values (even in a finite interval) and so we can’t simply sum up the values in the ordinary way. For continuous variables, the equivalent formulation is that the probability that x assumes a value between a and b is given by Areaa<x<b f(x), i.e. the area under the graph of y=f(x) bounded by the x-axis and the lines x=a and x=b.

Area under the curve y=fx

  1. Area under the curve y=fx

For a continuous random variable x, f is a frequency function, also called the probability density function (pdf) provided:

f is the frequency function, more commonly called the probability density function for a particular random variable provided the area of the region indicated in Figure 24 represents the probability that x assumes a value between a and b inclusively. f only takes non-negative values and the area between the curve y=f(x) and the x-axis is 1.

The corresponding (cumulative) distribution function F is defined by

Fx= Areat<xft

For any continuous random variable with distribution function F

Fb= P(x<b)

F(b)-F(a)= P(a<x<b)

Note that the probability that x takes any particular value a is not f(a). In fact for any specific value a, the probability that x takes the value a is considered to be 0.

Essentially the area under a curve is a way of summing when dealing with an infinite range of values in a continuum. For those of you familiar with calculus, Areaa<x<bfx=abfxdx.

Excel Distribution Functions

The following table provides a summary of the various distribution functions provided by Excel. The entries labelled PDF/CDF specify the probability density function (where the last argument for the function is FALSE) as well as the (left tailed) cumulative distribution function (where the last argument of the function is TRUE.

Table 6: Excel Distribution Functions

Distribution

PDF/CDF

Inverse

Right Tailed

Test

Beta

BETA.DIST

BETA.INV

Binomial

BINOM.DIST

BINOM.INV

Chi Square

CHISQ.DIST

CHISQ.INV

CHISQ.DIST.RT

CHISQ.INV.RT

CHISQ.TEST

Exponential

EXPON.DIST

F

F.DIST

F.INV

F.DIST.RT

F.INV.RT

F.TEST

Gamma

GAMMA.DIST

GAMMA.INV

Hyper-geometric

HYPGEOM.DIST

Lognormal

LOGNORM.DIST

LOGNORM.INV

Negative Binomial

NEGBINOM.DIST

Normal

NORM.DIST

NORM.INV

Poisson

POISSON.DIST

Standard Normal

NORM.S.DIST

NORM.S.INV

Z.TEST

Student’s t

T.DIST

T.INV

T.DIST.RT

T.DIST.2T

T.INV.2T

T.TEST

Weibull

WEIBULL.DIST

The functions listed in Table 6 are not available for versions of Excel prior to Excel 2010. For earlier releases of Excel, a similar collection of functions is available, as summarized in Table 7 Here the column labelled PDF indicates whether the function in the CDF column (cumulative distribution function) also provides the probability density function (pdf) when the last argument in the function is FALSE (instead of TRUE).

Table 7: Distribution Functions for versions of Excel prior to Excel 2010

Distribution

CDF

PDF

Inverse

Test

Beta

BETADIST

BETAINV

Binomial

BINOMDIST

CRITBINOM

Chi Square

CHIDIST

CHIINV

CHITEST

Exponential

EXPONDIST

yes

F

FDIST

FINV

FTEST

Gamma

GAMMADIST

GAMMAINV

Hypergeometric

HYPGEOMDIST

Lognormal

LOGNORMDIST

LOGINV

Negative Binomial

NEGBINOMDIST

Normal

NORMDIST

yes

NORMINV

Poisson

POISSON

yes

Standard Normal

NORMSDIST

yes

NORMSINV

ZTEST

Student’s T

TDIST

TINV

TTEST

Weibull

WEIBULL

yes

Scroll To Top
Disclaimer
DISCLAIMER: Web reader is currently in beta. Please report any issues through our support system. PDF and Kindle format files are also available for download.

Previous

Next



You are one step away from downloading ebooks from the Succinctly® series premier collection!
A confirmation has been sent to your email address. Please check and confirm your email subscription to complete the download.