ผลต่างระหว่างรุ่นของ "Probstat/notes/parameter estimation"

รุ่นแก้ไขปัจจุบันเมื่อ 18:23, 6 ธันวาคม 2557

This is part of probstat.

Previously we tried to estimate the population means and variances using the sample means and variances. In this section, we shall see the justification why what we did makes sense.

There are many ways to estimate parameters.

เนื้อหา

1 Method of moments estimators
2 Maximum likelihood estimators
- 2.1 Examples
3 Bayes estimators

Method of moments estimators

See also wikipedia article.

This is probably the simplest estimators. However, they are often biased (as we shall show in the example).

Definition: For a random variable $X$ , $E[X^{k}]$ is called the k-th moment of $X$ . Note that the first moment is the mean $E[X]$ . The variance of a random variable depends on the first and the second moments.

If we want to estimate a parameter $\theta$ , using the method of moments, we start by writing the parameter as a function of the moments, i.e.,

$\theta =g(E[X],E[X^{2}],E[X^{3}],\ldots ,E[X^{r}])$

We then estimate the sample moments

$M_{k}={\frac {\sum _{i=1}^{n}X^{k}}{n}},$

for $k=1,\ldots ,r$ . Our estimate ${\hat {\theta }}$ is thus

${\hat {\theta }}=g(M_{1},M_{2},\ldots ,M_{r})$

EX1: We show how to estimate the variance with the method of moments. Recall that the variance

$Var(X)=g(E[X],E[X^{2}])=E[X^{2}]-E[X]^{2}.$

We first estimate the first moment $M_{1}={\frac {\sum _{i=1}^{n}X}{n}}$ and the second moment $M_{2}={\frac {\sum _{i=1}^{n}X^{2}}{n}}$ . The estimator is

${\hat {\theta }}=M_{2}-M_{1}^{2}=\left({\frac {\sum _{i=1}^{n}X^{2}}{n}}\right)-\left({\frac {\sum _{i=1}^{n}X}{n}}\right)^{2}={\frac {\sum _{i=1}^{n}(X_{i}-{\bar {X}})^{2}}{n}}.$

Note that the estimate ${\hat {\theta }}$ is biased, because $E[{\hat {\theta }}]={\frac {n-1}{n}}Var(X)$ .

As this example shows, other estimation techniques are usually preferred over the method of moments.

Maximum likelihood estimators

Suppose that we want to estimate parameter $\theta$ based on observations (or sample) $X_{1},X_{2},\ldots ,X_{n}$ . If we can find a joint-distribution function

$f(x_{1},x_{2},x_{3},\ldots ,x_{n}|\theta )$ ,

that gives the probability that we observe $(x_{1},x_{2},\ldots ,x_{n})$ given a particular value $\theta$ . The maximum likelihood estimator is ${\hat {\theta }}$ such that $f(x_{1},x_{2},\ldots ,x_{n}|{\hat {\theta }})$ is maximum, i.e.,

${\hat {\theta }}=\arg \max _{\theta }f(x_{1},x_{2},x_{3},\ldots ,x_{n}|\theta )$ .

Examples

EX1: A box contains 5 balls; they are blue balls and white balls. We randomly choose a ball from the box 3 times with replacement and get 2 blue balls and 1 white balls. Estimate the number of blue balls in the box using the maximum likelihood method.

Solution: Let $\theta$ denote the number of blue balls. The possible values are 0,1,2,3,4, and 5. For each value of $\theta$ the probability that we observe the outcome of getting 2 blue balls and 1 while balls is

${3 \choose 2}(\theta /5)^{2}(1-\theta /5)$ .

We put the probabilities in the following table.

$\theta$	0	1	2	3	4	5
Probability	0	0.096	0.288	0.432	0.384	0

Since the probability is maximum at $\theta =3$ , the maximum likelihood estimator for the number of blue balls is 3.

EX2: Maximum likelihood estimator for a Bernoulli parameter

A coin has probability $p$ of turning up heads. We perform $n$ independent trials and obtain random variables $X_{1},X_{2},\ldots ,X_{n}$ such that $X_{i}=1$ if in the i-th trial, we get a Head, and $X_{i}=0$ otherwise. What is the maximum likelihood estimator for $p$ ?

Solution: Note that $P\{X_{i}=1\}=p=1-P\{X_{i}=0\}$ . We rewrite the probability that the random variable $X_{i}$ takes value $x_{i}$ for $x_{i}=0,1$ as

$P\{X_{i}=x_{i}\}=p^{x_{i}}(1-p)^{1-x_{i}}$ .

(Remarks: The random variable is $X_{i}$ . We denote the actual value of the variable with $x_{i}$ (i.e., with a small x).)

Therefore, the probability of obtaining the data $x_{1},x_{2},\ldots ,x_{n}$ given parameter $p$ is

$f(x_{1},x_{2},\ldots ,x_{n}|p)=p^{x_{1}}(1-p)^{1-x_{1}}\cdot p^{x_{2}}(1-p)^{1-x_{2}}\cdots p^{x_{n}}(1-p)^{1-x_{n}}=p^{\left(\sum _{i}x_{i}\right)}(1-p)^{\left(\sum _{i}(1-x_{i})\right)}.$

Since $f(x_{1},x_{2},\ldots ,x_{n}|p)$ maximizes also when $\ln f(x_{1},x_{2},\ldots ,x_{n}|p)$ is maximum, we can take a logarithm of the above term and get

$\ln f(x_{1},x_{2},\ldots ,x_{n}|p)=\left(\sum _{i}x_{i}\right)\ln p+\left(\sum _{i}(1-x_{i})\ln(1-p)\right)=\left(\sum _{i}x_{i}\right)\ln p+\left(n-\sum _{i}x_{i}\right)\ln(1-p).$

To find its maximum, we differentiate the term and get

${\frac {d}{dp}}\ln f(x_{1},x_{2},\ldots ,x_{n}|p)={\frac {\sum _{i}x_{i}}{p}}-{\frac {n-\sum _{i}x_{i}}{1-p}}.$

Setting ${\frac {d}{d{\hat {p}}}}\ln f(x_{1},x_{2},\ldots ,x_{n}|{\hat {p}})=0$ , we solve the equation and get that

${\frac {\sum _{i}x_{i}}{\hat {p}}}={\frac {n-\sum _{i}x_{i}}{1-{\hat {p}}}},$

implying that

${\hat {p}}={\frac {\sum _{i}x_{i}}{n}},$

which is the maximum likelihood estimator for $p$ .

EX3: maximum likelihood estimator for the Poisson mean

To be added...

Bayes estimators

Let the observed data be $x_{1},x_{2},\ldots ,x_{n}$ . We fist compute the conditional probability

$f(p|x_{1},x_{2},\ldots ,x_{n})={\frac {f(x_{1},x_{2},\ldots ,x_{n},p)}{f(x_{1},x_{2},\ldots ,x_{n})}}={\frac {f(x_{1},x_{2},\ldots ,x_{n}|p)f(p)}{f(x_{1},x_{2},\ldots ,x_{n})}}.$

With that distribution function, the Bayes estimator is

$E[p|X_{1}=x_{1},X_{2}=x_{2},\ldots ,X_{n}=x_{n}].$

@@ แถว 30: / แถว 30: @@
 <math>\hat{\theta}=g(M_1,M_2,\ldots,M_r)</math>
 </center>
+'''EX1:''' We show how to estimate the variance with the method of moments.  Recall that the variance
+<center>
+<math>Var(X) = g(E[X],E[X^2]) = E[X^2] - E[X]^2.</math>
+</center>
+We first estimate the first moment <math>M_1 = \frac{\sum_{i=1}^n X}{n}</math> and the second moment <math>M_2 = \frac{\sum_{i=1}^n X^2}{n}</math>.  The estimator is
+<center>
+<math>\hat{\theta} = M_2 - M_1^2 = \left(\frac{\sum_{i=1}^n X^2}{n}\right) - \left(\frac{\sum_{i=1}^n X}{n}\right)^2
+= \frac{\sum_{i=1}^n (X_i-\bar{X})^2}{n}.</math>
+</center>
+Note that the estimate <math>\hat{\theta}</math> is biased, because <math>E[\hat{\theta}] = \frac{n-1}{n}Var(X)</math>.
+As this example shows, other estimation techniques are usually preferred over the method of moments.
 == Maximum likelihood  estimators ==
+Suppose that we want to estimate parameter <math>\theta</math> based on observations (or sample) <math>X_1,X_2,\ldots,X_n</math>.  If we can find a joint-distribution function
+<center>
+<math>f(x_1,x_2,x_3,\ldots,x_n|\theta)</math>,
+</center>
+that gives the probability that we observe <math>(x_1,x_2,\ldots,x_n)</math> given a particular value <math>\theta</math>.  The ''maximum likelihood estimator'' is <math>\hat{\theta}</math> such that <math>f(x_1,x_2,\ldots,x_n|\hat{\theta})</math> is maximum, i.e.,
+<center>
+<math>\hat{\theta} = \arg\max_\theta f(x_1,x_2,x_3,\ldots,x_n|\theta)</math>.
+</center>
+=== Examples ===
+'''EX1''': A box contains 5 balls; they are blue balls and white balls.  We randomly choose a ball from the box 3 times with replacement and get 2 blue balls and 1 white balls.  Estimate the number of blue balls in the box using the maximum likelihood method.
+'''Solution:''' Let <math>\theta</math> denote the number of blue balls.  The possible values are 0,1,2,3,4, and 5.  For each value of <math>\theta</math> the probability that we observe the outcome of getting 2 blue balls and 1 while balls is
+<center>
+<math>{3\choose 2}(\theta/5)^2(1-\theta/5)</math>.
+</center>
+We put the probabilities in the following table.
+{| class="wikitable"
+|-
+! <math>\theta</math>
+! 0
+! 1
+! 2
+! 3
+! 4
+! 5
+|-
+| Probability
+| 0
+| 0.096
+| 0.288
+| 0.432
+| 0.384
+| 0
+|}
+Since the probability is maximum at <math>\theta=3</math>, the maximum likelihood estimator for the number of blue balls is 3.
+'''EX2''': Maximum likelihood estimator for a Bernoulli parameter
+A coin has probability <math>p</math> of turning up heads.  We perform <math>n</math> independent trials and obtain random variables <math>X_1,X_2,\ldots,X_n</math> such that <math>X_i=1</math> if in the ''i''-th trial, we get a Head, and <math>X_i=0</math> otherwise.  What is the maximum likelihood estimator for <math>p</math>?
+'''Solution:'''  Note that <math>P\{X_i=1\} = p = 1 - P\{X_i=0\}</math>.  We rewrite the probability that the random variable <math>X_i</math> takes value <math>x_i</math> for <math>x_i=0,1</math> as
+<center>
+<math>P\{X_i = x_i\} = p^{x_i}(1-p)^{1-x_i}</math>.
+</center>
+('''Remarks:''' The random variable is <math>X_i</math>.  We denote the actual value of the variable with <math>x_i</math> (i.e., with a small x).)
+Therefore, the probability of obtaining the data <math>x_1,x_2,\ldots,x_n</math> given parameter <math>p</math> is
+<center>
+<math>f(x_1,x_2,\ldots,x_n|p)
+= p^{x_1}(1-p)^{1-x_1}\cdot p^{x_2}(1-p)^{1-x_2} \cdots p^{x_n}(1-p)^{1-x_n}
+= p^{\left(\sum_i x_i\right)}(1-p)^{\left(\sum_i (1-x_i)\right)}.
+</math>
+</center>
+Since <math>f(x_1,x_2,\ldots,x_n|p)</math> maximizes also when <math>\ln f(x_1,x_2,\ldots,x_n|p)</math> is maximum, we can take a logarithm of the above term and get
+<center>
+<math>\ln f(x_1,x_2,\ldots,x_n|p)
+= \left(\sum_i x_i\right)\ln p + \left(\sum_i (1-x_i)\ln(1-p)\right)
+= \left(\sum_i x_i\right)\ln p + \left(n - \sum_i x_i\right)\ln(1-p).
+</math>
+</center>
+To find its maximum, we differentiate the term and get
+<center>
+<math>\frac{d}{dp}\ln f(x_1,x_2,\ldots,x_n|p)
+= \frac{\sum_i x_i}{p} - \frac{n - \sum_i x_i}{1-p}.
+</math>
+</center>
+Setting <math>\frac{d}{d\hat{p}}\ln f(x_1,x_2,\ldots,x_n|\hat{p}) = 0</math>, we solve the equation and get that
+<center>
+<math>\frac{\sum_i x_i}{\hat{p}} = \frac{n - \sum_i x_i}{1-\hat{p}},
+</math>
+</center>
+implying that
+<center>
+<math>\hat{p} = \frac{\sum_i x_i}{n},
+</math>
+</center>
+which is the maximum likelihood estimator for <math>p</math>.
+'''EX3''': maximum likelihood estimator for the Poisson mean
+: ''To be added...''
 == Bayes estimators ==
+Let the observed data be <math>x_1,x_2,\ldots,x_n</math>.  We fist compute the conditional probability
+<center>
+<math>
+f(p|x_1,x_2,\ldots,x_n)
+= \frac{f(x_1,x_2,\ldots,x_n,p)}{f(x_1,x_2,\ldots,x_n)}
+= \frac{f(x_1,x_2,\ldots,x_n|p)f(p)}{f(x_1,x_2,\ldots,x_n)}.
+</math>
+</center>
+With that distribution function, the Bayes estimator is
+<center>
+<math>
+E[p|X_1=x_1,X_2=x_2,\ldots,X_n=x_n].
+</math>
+</center>

ผลต่างระหว่างรุ่นของ "Probstat/notes/parameter estimation"

รุ่นแก้ไขปัจจุบันเมื่อ 18:23, 6 ธันวาคม 2557

เนื้อหา

Method of moments estimators

Maximum likelihood estimators

Examples

Bayes estimators

รายการเลือกการนำทาง

เครื่องมือส่วนตัว

เนมสเปซ

สิ่งที่แตกต่าง

ดู

เพิ่มเติม

ค้นหา

การนำทาง

เครื่องมือ