A Distribution of the First Order Statistic When the Sample Size is Random

East Tennessee State University East Tennessee State University

Digital Commons @ East Digital Commons @ East

Tennessee State University Tennessee State University

Electronic Theses and Dissertations Student Works

5-2017

A Distribution of the First Order Statistic When the Sample Size is A Distribution of the First Order Statistic When the Sample Size is

Random Random

Vincent Z. Forgo Mr

East Tennessee State University

Follow this and additional works at: https://dc.etsu.edu/etd

Part of the Statistical Theory Commons

Recommended Citation Recommended Citation

Forgo, Vincent Z. Mr, "A Distribution of the First Order Statistic When the Sample Size is Random" (2017).

Electronic Theses and Dissertations.

Paper 3181. https://dc.etsu.edu/etd/3181

This Thesis - unrestricted is brought to you for free and open access by the Student Works at Digital Commons @

East Tennessee State University. It has been accepted for inclusion in Electronic Theses and Dissertations by an

authorized administrator of Digital Commons @ East Tennessee State University. For more information, please

contact [email protected].

A Distribution of the First Order Statistic when the Sample Size is Random

A thesis

presented to

the faculty of the Department of Mathematics

East Tennessee State University

In partial fulﬁllment

of the requirements for the degree

Master of Science in Mathematical Sciences

Vincent Forgo

May 2017

Bob Price, Ph.D., Chair

Nicole Lewis, Ph.D.

JeanMarie Hendrickson, Ph.D.

Keywords: First order statistic, random sample size, Cumulative density function,

Probability density function, Expectation, Variance, Percentile

ABSTRACT

A Distribution of the First Order Statistic when the Sample Size is Random

Vincent Forgo

Statistical distributions also known as probability distributions are used to model

a random experiment. Probability distributions consist of probability density func-

tions (pdf) and cumulative density functions (cdf). Probability distributions are

widely used in the area of engineering, actuarial science, computer science, biological

science, physics, and other applicable areas of study. Statistics are used to draw con-

clusions about the population through probability models. Sample statistics such as

the minimum, ﬁrst quartile, median, third quartile, and maximum, referred to as the

ﬁve-number summary, are examples of order statistics. The minimum and maximum

observations are important in extreme value theory. This paper will focus on the

probability distribution of the minimum observation, also known as the ﬁrst order

statistic, when the sample size is random.

ACKNOWLEDGMENTS

I am grateful for the great support from the faculty of Department of Mathematics

and Statistics at East Tennessee State University. I appreciate the tremendous work

by committee members: Dr. Robert Price (committee and department Chair), Dr.

Nicole Lewis, and Dr. JeanMarie Hendrickson for making this paper possible. I am

more grateful to Dr. Robert Price for how instrumental He has being throughout this

paper.

TABLE OF CONTENTS

ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

ACKNOWLEDGEMENTS . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1 INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2 FIXED SAMPLE SIZE . . . . . . . . . . . . . . . . . . . . . . . . . . 7

3 TRUNCATED POISSON MIXTURE . . . . . . . . . . . . . . . . . . 9

3.1 Uniform - Truncated Poisson Mixture Distribution . . . . . . 10

3.2 Exponential - Truncated Poisson Mixture Distribution . . . . 16

4 TRUNCATED BINOMIAL MIXTURE . . . . . . . . . . . . . . . . . 18

4.1 Uniform-Truncated Binomial Mixture Distribution . . . . . . . 19

4.2 Exponential-Truncated Binomial Mixture Distribution . . . . 22

5 TRUNCATED GEOMETRIC MIXTURE . . . . . . . . . . . . . . . 24

5.1 Uniform-Truncated Geometric Mixture . . . . . . . . . . . . . 25

5.2 Exponential-Truncated Geometric Mixture . . . . . . . . . . . 28

6 CONCLUSION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

VITA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

1 INTRODUCTION

The concept of order statistics is familiar in areas of ﬁnance and insurance (Risk

assessment). The order statistics of a random sample X

, X

, ...X

are deﬁned as

(1)

≤ X

(2)

≤ .... ≤ X

(n)

. A situation can occur in actuarial science with a joint

life insurance. The policy pays out when one of the spouse’s dies. In this problem,

we want to know the distribution of the minimum payment, which is the random

variable of the two life spans. Another form of application of order statistics is about

a machine, which may run on 15 batteries and shuts oﬀ when the seventh battery

dies. We may want to know the distribution of X

(7)

. Thus, the distribution of the

random variable of the seventh longest lasting battery.

In order statistics the variables are considered as independent and identically

distributed, iid. The cumulative distribution function of the n

order statistic is

given as

(x) = P {allX

≤ x} = [P (X ≤ x)]

= [F

(x)]

This implies the cumulative distribution is F

(x) for this random variable. The

cumulative distribution function (cdf) of the ﬁrst order statistic or the minimum is

(1 − P [(X

> x)])

The general formula of the cumulative distribution for the k

(th)

order statistic is given

j=k





F (x)

(1 − F (x))

n−j

Order statistics is among the most essential functions of a set of random variables that

are studied in probability and statistics. There is natural interest in studying the highs

and lows of a sequence, and the order statistics help in understanding concentration of

probability in a distribution[4]. It is important to note that the variables in the sample

are independent and identically distributed but because of the the sequential order

associated with order statistics, the order statistics is not distributed identically and

independently. Since the variables in the sample appear in order, there is a minimum

and maximum order statistics. Therefore, the n

(maximum) order statistic has a

pdf of

x(n)

= n[F (x)]

n−1

f(x)

and the ﬁrst order statistic(minimum) will have a pdf of

x(1)

= n[1 − F (x)]

n−1

2 FIXED SAMPLE SIZE

Now let us consider X

to be iid continuous random variables i.e. all the random

variables have equal probability distribution and mutually independent of each other

such that the random variables follow a uniform distribution, X

∼ unif(0, 1) and

Z = min(X

, ...., X

). If n is ﬁxed then

P (Z ≤ z) = 1 − P (Z > z) = 1 − [1 − F

(z)]

= 1 − [1 − z]

, 0 ≤ z ≤ 1.

where F

(z) is the cdf of the uniform distribution. The idea, of starting with 1 −

P (Z > z) is because we can say that if the ﬁrst order statistics is the smallest,

then automatically z is less than Z. Hence we can ﬁnd P (Z ≤ z) by starting with

1 − P (Z > z). The pdf is generated by taking the derivative of the cdf which gives

f(z) = n[1 − z]

n−1

. The cdf and pdf is graphically shown in ﬁgure 1 and becomes

steeper as n gets large.

0.0 0.2 0.4 0.6 0.8 1.0

f(x)

n=5,yellow

n=10,red

n=15,green

n=20,brown

n=25,pink

(a) Cummulative distribution function plots for

diﬀerent sample sizes.

0.0 0.2 0.4 0.6 0.8 1.0

0 1 2 3 4 5

f(x)

n=5,yellow

n=10,red

n=15,green

n=20,brown

n=25,pink

(b) Probability density function plots for diﬀerent

sample sizes.

Figure 1: The cdf and pdf of the smallest order statistic when the underlying distri-

bution is uniform.

3 TRUNCATED POISSON MIXTURE

The Poisson distribution is a discrete probability distribution. The Poisson distri-

bution is some times truncated, i.e. the random variables are assigned numbers that

are greater than zero. The Poisson distribution is a discrete distribution used for the

interval counts of events that randomly occur in given interval (or space)[3]. The

probability mass function (pmf) is

P (N = n) =

−λ

, n = 0, 1, 2, 3...; λ > 0.

with expectation E(N) = λ and variance V (N) = λ. The probability generating

function of the Poisson distribution is G(t) = e

λ(t−1)

and the moomemt generating

function (mgf) is M(t) = e

λ(e

−1)

, where the events occur on a given time t.

The truncated Poisson is a discrete probability distribution which is used to de-

scribe events that occur per unit time and can not be a zero event. In this case,

the starting point will not be zero but 1. This process is termed as the truncated

Poisson distribution or the zero truncated Poisson distribution. The pmf of the zero

truncated Poisson is given below as

P (N = n) =

−λ

(1 − e

−λ

)n!

, n = 1, 2 . . . .

with an expectation of E(N ) =

1−e

−λ

and a variance of V (N) =

(1−e

−λ

)

If the random variable X

follows a continuous probability distribution and Z|N =

min(X

, ...., X

), then we can ﬁnd a distribution for the ﬁrst other statistic X

(1)

when

the sample size is ﬁxed or random. In the next section, the paper will focus more on a

general formula for ﬁnding the cdf and pdf of a random variable with any continuous

probability distribution like uniform, exponential, etc. and a random sample size(N).

If N is a random sample size and follows a truncated Poisson distribution then

for any continuous distribution of X

, we can ﬁnd the cdf of the distribution by using

the generalised formula for P (Z > z), i.e,

P (Z > z) =

[1 − F

(z)]

−λ

(1 − e

−λ

)n!

−λ

(1 − e

−λ

)

(1−F

(z))λ

− 1].

The general cdf will be

F (z) = 1 −

−λ

(1 − e

−λ

)

(1−F

(z))λ

− 1]

with a pdf

f(z) =

λe

−λ

1 − e

−λ

(z)[e

(1−F

(z)λ

], λ > 0,

where F

(z) and f

(z) are the cdf and pdf of the continuous random variable X

respectively. If X

follows a continuous distribution which is not closed like the

normal distribution, then we can use R functions like the pnorm and dnorm to ﬁnd

the cdf and pdf respectively, i.e., F

(z) = pnorm(x) and f

(z) = dnorm(x).

3.1 Uniform - Truncated Poisson Mixture Distribution

If the random variable X

follows a continuous probability distribution and Z =

min(X

, ...., X

) , then we can ﬁnd a distribution for the ﬁrst other statistic X

(1)

In this section we are focused on a distribution of the ﬁrst order statistics with

an underlying uniform distribution and a random sample which follows a truncated

Poisson distribution. If N is random then

P (Z > z) =

P (Z > z|N = n)P (N = n), 0 < z < 1

where P (Z > z|N ) is the conditional distribution and P (N = n) is the marginal

distribution. The idea of N being random will be widely explored in this paper. Our

new distribution is mainly based on the idea of ﬁrst order statistics and N following

truncated Poisson. Let us consider X

∼ Unif(0, 1) and Z|N = min {X

, X

, ...., X

}

where N is sample size and is random with distribution

P (N = n) =

−λ

(1 − e

−λ

)n!

, n = 1, 2, 3, . . . .

Then

P (Z > z) =

∞

n=1

(1 − z)

−λ

(1 − e

−λ

)n!

∞

n=0

(1 − z)

−λ

(1 − e

−λ

)n!

− (1 − z)

−λ

(1 − e

−λ)

−λ

(1 − e

−λ

)

∞

n=0

((1 − z)λ)

−

−λ

(1 − e

−λ

)

Using the deﬁnition of

∞

n=0

= e

we can simplify the above equation as

−λ

(1 − e

−λ

)

(1−z)λ

− 1].

Hence the cumulative distribution function (cdf) is given as F (z) = 1−

−λ

(1−e

−λ

)

(1−z)λ

− 1].

where 0 < z < 1, λ > 0.

The probability density function (pdf) can be derived by taking the derivative of

the cdf with respect to z. The pdf of a ﬁrst order statistic when the underlying

distribution is uniform with a random sample that is a truncated Poisson is

f(z) =

λe

−λ

1 − e

−λ

(1−z)λ

], 0 < z < 1, λ > 0.

It can be proven that f(z) satisﬁes the conditions of a pdf i.e.

f(z)dz = 1

f(z)dz =

λe

−λ

1 − e

−λ

(1−z)λ

]dz =

λe

−λ

1 − e

−λ

(1−z)λ

]

− 1

− 1] = 1.

From the distribution generated, the expectation is given as

E(Z) =

λe

−λ

1 − e

−λ

z[e

(1−z)λ

]dz =

− λ − 1

λ(e

− 1)

The moment generating function (mgf) can be used to estimate both the

expectation and variance. The mgf of the distribution is given as

M(t) = E(e

) =

f(z)dz =

−λ

1 − e

−λ



(1−z)λ



λe

−λ

1 − e

−λ



− e

λ − t



0.0 0.2 0.4 0.6 0.8 1.0

f(x)

(a) Cumulative distribution function plot

0.0 0.2 0.4 0.6 0.8 1.0

0 1 2 3 4 5

f(x)

(b) Probability density function plot

Figure 2: The cdf and pdf of the smallest order statistic when the underlying distri-

bution is uniform and random sample size which follows a truncated Poisson where

rate (λ) = 5.

From Figure 3, it is clear that the cdf in both the random and ﬁxed cases tend to be

the same as λ and n increases. The behaviour of the cdf as λ and n increases shows

a steep and sharp turn closer to 1. This implies that the larger n and λ gets, the

more steeper the curve becomes and the ﬁxed sample size (n) and the random

sample size (N) all tend to have the same cdf.

0.0 0.2 0.4 0.6 0.8 1.0

f(x)

lambda=10,brown

n=10,red

lambda=50,blue

n=50,green

Figure 3: A cumulative distribution function plots of diﬀerent samples (n) and diﬀer-

ent rates (λ) when the underlying distribution is uniform for ﬁxed sample and random

sample which follows a truncated Poisson.

The percentile function is relevant in statistics because it can be used to indicate

the value below a certain percentage. The percentile function can also be used to

calculate the lower quartile(Q

), median(Q

) and the upper quartile(Q

). The

percentile function of the ﬁrst order statistic when the underlying distribution is

uniform and a random sample size that follows a truncated Poisson is

P = F (µ) =

−∞

f(y)dy.

where f(y) is the probability density function(pdf). From the uniform truncated

Poisson distribution, the probability distribution function pdf is

λe

−λ

1−e

−λ

(1−z)λ

− 1], λ > 0. The percentile function is generated by

λe

−λ

1 − e

−λ

(1−z)λ

]dz =

λe

−λ

1 − e

−λ

(1−z)λ

]dz

−λ

1 − e

−λ

− e

λ(1−µ)

)

P =

−λ

1 − e

−λ

− e

λ(1−µ)

), λ > 0, µ > 0

From the above percentile equation, the 50th percentile(median) is calculated in terms

of λ as

0.5 =

− e

λ(1−µ)

− 1

⇒ µ =

λ − log(1 − e

)



0.5 +

1−e



3.2 Exponential - Truncated Poisson Mixture Distribution

In this section, we are focused on a distribution of the ﬁrst order statistics with an

underlying exponential distribution and a random sample which follows a truncated

Poisson. If Z|N = min {X

, ...., X

}, where N is the sample size which is random

with a distribution

P (N = n) =

−λ

(1 − e

−λ

)n!

We have

P (Z > z) =

∞

n=1

−zµ

)

−λ

(1 − e

−λ

)n!

P (Z > z) =

∞

n=1

−λ−znµ

(1 − e

−λ

)n!

∞

n=0

−λ−znµ

(1 − e

−λ

)n!

−

−λ

1 − e

−λ

1 − e

−λ



∞

n=0

−znµ

− 1



−λ

(1 − e

−λ

)

λe

−zµ

− 1].

The cdf of an underlying exponential distribution with a random sample size which

follows a truncated Poisson is

F (z) = 1 −

−λ

1 − e

−λ

λe

−zµ

− 1], λ > 0, z > 0, µ > 0.

and a pdf of

f(z) =



1 −

−λ

1 − e

−λ

λe

−zµ

− 1]



λµ

− 1

λe

−zµ

], λ > 0, z > 0, µ > 0.

0 2000 4000 6000 8000 10000

0.0 0.2 0.4 0.6 0.8 1.0

Index

(a) Cumulative density function plot.

0 2000 4000 6000 8000 10000

0.0 0.5 1.0 1.5 2.0 2.5

Index

(b) Probability density function plot.

Figure 4: A ﬁgure representing the cdf and pdf with an underlying exponential dis-

tribution and a random sample size which follows a truncated Poisson when rate

(λ) = 0.5 and µ = 1.

4 TRUNCATED BINOMIAL MIXTURE

The truncated binomial distribution is a discrete probability distribution with a

probability mass function

P (N = n) =





(1 − p)

k−n

1 − (1 − p)

for n = 1, 2, ...k. Where k is the number of success and p is the probability of success

with an expectation

E(N) =

1 − (1 − p)

and a variance

V (N) =

kp(1 − p − (1 − p + kp))(1 − p)

(1 − (1 − p)

)

The binomial distribution is often used to model the number of success(k) among a

sample of size(n).

In this section of the paper, we will focus on ﬁnding a general cdf and pdf when

the random sample size follows a truncated binomial distribution and the random

variable X

follows a continuous distribution that can be exponential, uniform, etc.

The general formula for P (Z > z) will be

P (Z > z) =

[1 − F

(z)]





(1 − p)

k−n

1 − (1 − p)



(1−F

(z)p)

−(1−p)



The general cdf will be

F (z) = 1 −

1 − (1 − p)



(1 − F

(z)p)

− (1 − p)



and a pdf of

f(z) =

kp(1 − F

(z)p)

k−1

(z)

1 − (1 − p)

, 0 ≤ p ≤ 1,

where F

(z) and f

(z) are the cdf and pdf of the continuous random variable X

respectively.

4.1 Uniform-Truncated Binomial Mixture Distribution

In this section, we are focused on ﬁnding a distribution of the ﬁrst order statistics

with an underlying uniform distribution and a random sample which follows a trun-

cated binomial distribution. If Z|N = min {X

, ...., X

} where N is the sample size

which is random with a distribution

P (N = n) =





(1 − p)

k−n

1 − (1 − p)

for n=1, 2, ...k. This implies that

P (Z>z)=

n=1

(1 − z)





(1 − p)

k−n

1 − (1 − p)



n=1

(1 − z)





(1 − p)

k−n



1 − (1 − p)



n=0

(1 − z)





(1 − p)

k−n

− (1 − p)



1 − (1 − p)



n=0





((1 − z)p)

(1 − p)

k−n

− (1 − p)



1 − (1 − p)



((1 − z)p + (1 − p))

− (1 − p)



1 − (1 − p)



(1 − zp)

− (1 − p)



Thus the cdf of an underlying uniform distribution with a random sample which

follows a truncated binomial distribution is

F (z)=1 −

1 − (1 − p)



(1 − zp)

− (1 − p)



0.0 0.2 0.4 0.6 0.8 1.0

f(x)

(a) Cumulative distribution function plot with k=

5 and p=0.8.

0.0 0.2 0.4 0.6 0.8 1.0

0 1 2 3 4

f(x)

(b) Probability density function plot with k=5 and

p=0.8.

Figure 5: A ﬁgure representing the cdf and pdf with an underlying uniform distribu-

tion and random sample size which follows truncated binomial.

with a probability density function

f(z)=

kp(1 − zp)

k−1

1 − (1 − p)

and

E(Z)=

kp(1 − zp)

k−1

1 − (1 − p)

dz=

1 − (1 − p)

(kp + 1)

p(k + 1)

Figure 5a shows the cdf of an underlying uniform distribution and a random

sample size that follows truncated binomial distribution. Figure 5b is the pdf of an

underlying uniform distribution and a random sample size which follows a truncated

binomial distribution.

0.0 0.2 0.4 0.6 0.8 1.0

f(x)

K=10,P=0.8,brown

n=10,red

k=50,p=0.9,blue

n=50,green

Figure 6: A cumulative distribution function plots of diﬀerent samples (n) and diﬀer-

ent p and k when the underlying distribution is uniform for ﬁxed sample and random

sample that follows a truncated binomial.

From Figure 6, it is clear that the cdf in both the random and ﬁxed cases tend to

be the same as p, k and n increases. The behaviour of the cdf as p, k and n increases

shows a steep and sharp turn closer to 1. This implies that the larger p, k and n gets,

the steeper the curve becomes.

The percentile function of an underlying uniform distribution with a random sam-

ple size which follows a truncated binomial distribution is given as

P =

kp(1 − zp)

k−1

1 − (1 − p)

dz=

(1 − µp)

− 1

(1 − p)

− 1

From the Percentile function, the 50th percentile (µ) or the median is calculated using

the relation

P =

(1 − pµ)

− 1

(1 − p)

− 1

Hence

µ=

1 − (0.5 −

1−(1−p)

)((1 − p)

− 1))

0≤p≤1, P =0.5 and 0<z<1.

4.2 Exponential-Truncated Binomial Mixture Distribution

In this section, we are focused on ﬁnding a distribution of the ﬁrst order statistics

with an underlying exponential distribution and a random sample which follows a

truncated binomial distribution. If Z|N=min {X

, ...., X

} where N is the sample

size which is random with a distribution

P (N=n)=





(1 − p)

k−n

1 − (1 − p)

for n=1, 2, ...k . This implies that

P (Z>z)=

n=1

−zµ

)





(1 − p)

k−n

1 − (1 − p)



n=1

−zµ

)





(1 − p)

k−n



0 2000 4000 6000 8000 10000

0.0 0.2 0.4 0.6 0.8 1.0

Index

(a) Cumulative distribution function plot when p=

0.2, k=5 and µ=1.

0 2000 4000 6000 8000 10000

0.0 0.5 1.0 1.5 2.0

Index

(b) Probability density function plot when p=

0.8, k=5 and µ=1.

Figure 7: The cdf and pdf of the smallest order statistic when the underlying distri-

bution is exponential and random sample size which follows a truncated binomial.

1 − (1 − p)



n=0

−zµ

)





(1 − p)

k−n

− (1 − p)



(pe

−zµ

+ 1 − p)

− (1 − p)

1 − (1 − p)

Thus the cdf of an underlying exponential distribution with a random sample which

follows a truncated binomial distribution is

F (z)=1 −

(pe

−zµ

+ 1 − p)

− (1 − p)

1 − (1 − p)

, µ>0, 0≤p≤1, z>0

with a probability density function

f(z)=kµp

1−k

−zµ

(pe

−zµ

− p + 1)

k−1

, µ>0, 0≤p≤1, z>0.

5 TRUNCATED GEOMETRIC MIXTURE

In this section of the paper, the focus will be ﬁnding a distribution of the ﬁrst

order statistic when X

is any continuous distribution and the random sample size

follows a geometric distribution. The geometric distribution is a discrete probability

distribution which is used to represent the ﬁrst outcome of a speciﬁc event with a

probability p of the event occurring. The pmf of the geometric distribution is

P (N=n)=p(1 − p)

, n=0, 1, 2, . . .

with an expectation

E(N)=

1 − p

and variance

V (N)=

1 − p

0≤p≤1.

The truncated geometric distribution is a modiﬁed form of the geometric distribution

with a probability mass function (pmf)

P (N=n)=p(1 − p)

n−1

, n=1, 2, . . .

with an expectation

E(N)=

and variance

V (N)=

1 − p

0≤p≤1.

If N follows a truncated geometric distribution then for any continuous distribution

of X

, the generalised formula for P (Z>z) will be

P (Z>z)=

P (Z>z|N =n)P (N=n)

∞

n=1

(1 − F

(z))

p(1 − p)

n−1

∞

n=0



((1 − F

(z))(1 − p))

− (1 − p)

−1



The general cdf of an underlying continuous distribution with a random sample which

follows truncated geometric distribution is

F (z)=1 −

1 − p



−1

(z) − p − F

(z)



− 1



and a pdf of

f(z)=

(z)(pF

(z) − p − F

(z))

0≤p≤1.

Where F

(z) and f

(z) are the cdf and pdf of the continuous random variable X

respectively.

5.1 Uniform-Truncated Geometric Mixture

The distribution of a ﬁrst order statistic with an underlying uniform distribution if

Z|N=min {X

, ...., X

} where N is the random sample size which follows a truncated

geometric distribution

P (N=n)=p(1 − p)

n−1

, n=1, 2, . . . .

This implies

P (Z>z)=

∞

n=1

(1 − z))

p(1 − p)

n−1

∞

n=0



((1 − z)(1 − p))

− (1 − p)

−1



0.0 0.2 0.4 0.6 0.8 1.0

f(x)

(a) Cumulative distribution function plot

0.0 0.2 0.4 0.6 0.8 1.0

0 20 40 60 80 100

f(x)

(b) Probability density function plot

Figure 8: A Figure representing the cdf and pdf with an underlying uniform distri-

bution and a random sample size which follows a truncated geometric distribution

when p=0.01.

1 − p



−1

zp − p − z



− 1



The cdf of a ﬁrst order statistic with an underlying uniform distribution and a random

sample size which follows a truncated geometric distribution is

F (z)=1 −

1 − p



−1

zp − p − z



− 1



, 0≤p≤1, 0<z<1,

with a pdf of

f(z)=

(pz − p − z)

, 0≤p≤1, 0<z<1.

0.0 0.2 0.4 0.6 0.8 1.0

f(x)

P=0.1,brown

n=10,red

p=0.001,blue

n=100,green

Figure 9: A cumulative distribution function plots of diﬀerent samples (n) and diﬀer-

ent probabilities p when the underlying distribution is uniform for ﬁxed and a random

sample which follows a truncated geometric .

From Figure 9, it is clear that the cdf in both the random and ﬁxed cases tend to

take the similar shape as p decreases and n increases. The behaviour of the cdf as p

decreases and n increases shows a steep and sharp turn closer to 1. This implies that

the smaller p gets and the larger n gets, both ﬁxed and random sample size tend to

have the same cdf.

5.2 Exponential-Truncated Geometric Mixture

The distribution of a ﬁrst order statistic with an underlying exponential distribu-

tion if (Z|N )=min {X

, ...., X

} where N is the random sample size which follows a

truncated geometric distribution

P (N=n)=p(1 − p)

n−1

, n=1, 2, . . . .

P (Z>z)=

∞

n=1

−zµ

))

p(1 − p)

n−1

∞

n=0



((e

−zµ

(1 − p))

− (1 − p)

n−1



This implies

P (Z>z)=

−zµ

(1 − (e

−zµ

))(1 − p)

Hence, the cdf of a ﬁrst order statistic with an underlying exponential distribution

and a random sample size which follows a truncated geometric distribution is

F (z)=1 −

−zµ

(1 − (e

−zµ

))(1 − p)

, rate(µ)>0, 0≤p≤1, z>0

with a pdf of

f(z)=

pµe

zµ

+ p − 1)

, rate(µ)>0, 0≤p≤1, z>0.

0 2000 4000 6000 8000 10000

0.0 0.2 0.4 0.6 0.8 1.0

Index

(a) Cumulative distribution function plot

0 2000 4000 6000 8000 10000

0 20 40 60 80 100

Index

(b) Probability distribution function plot

Figure 10: The cdf and pdf of the smallest order statistic when the underlying dis-

tribution is exponential and random sample size which follows a truncated geometric

with p=0.01 and µ=1.

6 CONCLUSION

The paper has focused on ﬁnding probability distributions of the ﬁrst-order statis-

tic when the sample size is random. The pivot of these joint distributions is a merge

between the marginal and conditional probability distribution. In some instances,

some properties that include the expectation, variance and percentile are calculated.

The primary objective of this paper is to consider a random sample size and compare

its behaviour to a ﬁxed sample size in terms of their cumulative distribution func-

tions(cdf). A comparison between the cdf when the sample size is ﬁxed and random

sample size is shown in ﬁgures 3, 6 and 9. It is clear at the end of the comparison

in ﬁgure 3 and 6 that, as the sample size(n) increases in the ﬁxed case, the cdf ap-

proaches one and gets more steep. We see from ﬁgures 3 and 6 that as the sample

size increases and λ, p and k increases the cdf in both the ﬁxed and random case

appear the same. In ﬁgure 9, as n increases and p decreases, both cdfs in the ﬁxed

and random case take similar shape and becomes more steep and turns sharply close

to one.

REFERENCES

[1] Statistical inference 2nd Edition by George Cassela,Roger. L. Berger.

[2] Introduction to Mathematical Statistics, 7th edition, by Hogg, Robert V. Mckean,

and Allen T. Published by Pearson, 2013.

[3] The Poisson Distribution, by Jonathan Marchini, Nov 2008.

[4] Finite sample theory of order statistics and extremes, by Anirban DasGupta,

May 2011.

VITA

VINCENT FORGO

Education: B.S. Mathematics and Statistics, University of

Capecoast, Capecoast, Ghana 2009

M.S. Applied Mathematics, Indiana University of

Pennsylvania, Indiana, Pennsylvania 2014

M.S. Mathematical Science,

East Tennessee State University 2017

Professional Experience: Mathematics Teacher, S D A Senior High School,

Bekwai, Ghana, 2009–2011

Graduate Assistant, Indiana University of

Pennsylvania, Indiana, Pennsylvania, 2012–2013

Graduate Assistant, East Tennessee State

University, Johnson City, Tennessee, 2014–2016

Projects: Vincent Forgo, “A Distribution of the First

Order Statistic when the Sample Size is Random”