Template:Lognormal distribution confidence bounds
Confidence Bounds
The method used by the application in estimating the different types of confidence bounds for lognormally distributed data is presented in this section. Note that there are closed-form solutions for both the normal and lognormal reliability that can be obtained without the use of the Fisher information matrix. However, these closed-form solutions only apply to complete data. To achieve consistent application across all possible data types, Weibull++ always uses the Fisher matrix in computing confidence intervals. The complete derivations were presented in detail for a general function in Chapter 5. For a discussion on exact confidence bounds for the normal and lognormal, see Chapter 8.
Fisher Matrix Bounds
Bounds on the Parameters
The lower and upper bounds on the mean, [math]\displaystyle{ {\mu }' }[/math] , are estimated from:
- [math]\displaystyle{ \begin{align} & \mu _{U}^{\prime }= & {{\widehat{\mu }}^{\prime }}+{{K}_{\alpha }}\sqrt{Var({{\widehat{\mu }}^{\prime }})}\text{ (upper bound),} \\ & \mu _{L}^{\prime }= & {{\widehat{\mu }}^{\prime }}-{{K}_{\alpha }}\sqrt{Var({{\widehat{\mu }}^{\prime }})}\text{ (lower bound)}\text{.} \end{align} }[/math]
For the standard deviation, [math]\displaystyle{ {{\widehat{\sigma }}_{{{T}'}}} }[/math] , [math]\displaystyle{ \ln ({{\widehat{\sigma }}_{{{T}'}}}) }[/math] is treated as normally distributed, and the bounds are estimated from:
- [math]\displaystyle{ \begin{align} & {{\sigma }_{U}}= & {{\widehat{\sigma }}_{{{T}'}}}\cdot {{e}^{\tfrac{{{K}_{\alpha }}\sqrt{Var({{\widehat{\sigma }}_{{{T}'}}})}}{{{\widehat{\sigma }}_{{{T}'}}}}}}\text{ (upper bound),} \\ & {{\sigma }_{L}}= & \frac{{{\widehat{\sigma }}_{{{T}'}}}}{{{e}^{\tfrac{{{K}_{\alpha }}\sqrt{Var({{\widehat{\sigma }}_{{{T}'}}})}}{{{\widehat{\sigma }}_{{{T}'}}}}}}}\text{ (lower bound),} \end{align} }[/math]
where [math]\displaystyle{ {{K}_{\alpha }} }[/math] is defined by:
- [math]\displaystyle{ \alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }}) }[/math]
If [math]\displaystyle{ \delta }[/math] is the confidence level, then [math]\displaystyle{ \alpha =\tfrac{1-\delta }{2} }[/math] for the two-sided bounds and [math]\displaystyle{ \alpha =1-\delta }[/math] for the one-sided bounds.
The variances and covariances of [math]\displaystyle{ {{\widehat{\mu }}^{\prime }} }[/math] and [math]\displaystyle{ {{\widehat{\sigma }}_{{{T}'}}} }[/math] are estimated as follows:
- [math]\displaystyle{ \left( \begin{matrix} \widehat{Var}\left( {{\widehat{\mu }}^{\prime }} \right) & \widehat{Cov}\left( {{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}} \right) \\ \widehat{Cov}\left( {{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}} \right) & \widehat{Var}\left( {{\widehat{\sigma }}_{{{T}'}}} \right) \\ \end{matrix} \right)=\left( \begin{matrix} -\tfrac{{{\partial }^{2}}\Lambda }{\partial {{({\mu }')}^{2}}} & -\tfrac{{{\partial }^{2}}\Lambda }{\partial {\mu }'\partial {{\sigma }_{{{T}'}}}} \\ {} & {} \\ -\tfrac{{{\partial }^{2}}\Lambda }{\partial {\mu }'\partial {{\sigma }_{{{T}'}}}} & -\tfrac{{{\partial }^{2}}\Lambda }{\partial \sigma _{{{T}'}}^{2}} \\ \end{matrix} \right)_{{\mu }'={{\widehat{\mu }}^{\prime }},{{\sigma }_{{{T}'}}}={{\widehat{\sigma }}_{{{T}'}}}}^{-1} }[/math]
where [math]\displaystyle{ \Lambda }[/math] is the log-likelihood function of the lognormal distribution.
Bounds on Reliability
The reliability of the lognormal distribution is:
- [math]\displaystyle{ \hat{R}({T}';{\mu }',{{\sigma }_{{{T}'}}})=\int_{{{T}'}}^{\infty }\frac{1}{{{\widehat{\sigma }}_{{{T}'}}}\sqrt{2\pi }}{{e}^{-\tfrac{1}{2}{{\left( \tfrac{t-{{\widehat{\mu }}^{\prime }}}{{{\widehat{\sigma }}_{{{T}'}}}} \right)}^{2}}}}dt }[/math]
Let [math]\displaystyle{ \widehat{z}(t;{{\hat{\mu }}^{\prime }},{{\hat{\sigma }}_{{{T}'}}})=\tfrac{t-{{\widehat{\mu }}^{\prime }}}{{{\widehat{\sigma }}_{{{T}'}}}}, }[/math] then [math]\displaystyle{ \tfrac{d\widehat{z}}{dt}=\tfrac{1}{{{\widehat{\sigma }}_{{{T}'}}}}. }[/math] For [math]\displaystyle{ t={T}' }[/math] , [math]\displaystyle{ \widehat{z}=\tfrac{{T}'-{{\widehat{\mu }}^{\prime }}}{{{\widehat{\sigma }}_{{{T}'}}}} }[/math] , and for [math]\displaystyle{ t=\infty , }[/math] [math]\displaystyle{ \widehat{z}=\infty . }[/math] The above equation then becomes:
- [math]\displaystyle{ \hat{R}(\widehat{z})=\int_{\widehat{z}({T}')}^{\infty }\frac{1}{\sqrt{2\pi }}{{e}^{-\tfrac{1}{2}{{z}^{2}}}}dz }[/math]
The bounds on [math]\displaystyle{ z }[/math] are estimated from:
- [math]\displaystyle{ \begin{align} & {{z}_{U}}= & \widehat{z}+{{K}_{\alpha }}\sqrt{Var(\widehat{z})} \\ & {{z}_{L}}= & \widehat{z}-{{K}_{\alpha }}\sqrt{Var(\widehat{z})} \end{align} }[/math]
- where:
- [math]\displaystyle{ \begin{align} & Var(\widehat{z})= & \left( \frac{\partial z}{\partial {\mu }'} \right)_{{{\widehat{\mu }}^{\prime }}}^{2}Var({{\widehat{\mu }}^{\prime }})+\left( \frac{\partial z}{\partial {{\sigma }_{{{T}'}}}} \right)_{{{\widehat{\sigma }}_{{{T}'}}}}^{2}Var({{\widehat{\sigma }}_{{{T}'}}}) \\ & & +2{{\left( \frac{\partial z}{\partial {\mu }'} \right)}_{{{\widehat{\mu }}^{\prime }}}}{{\left( \frac{\partial z}{\partial {{\sigma }_{{{T}'}}}} \right)}_{{{\widehat{\sigma }}_{{{T}'}}}}}Cov\left( {{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}} \right) \end{align} }[/math]
- or:
- [math]\displaystyle{ Var(\widehat{z})=\frac{1}{\widehat{\sigma }_{{{T}'}}^{2}}\left[ Var({{\widehat{\mu }}^{\prime }})+{{\widehat{z}}^{2}}Var({{\widehat{\sigma }}_{{{T}'}}})+2\cdot \widehat{z}\cdot Cov\left( {{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}} \right) \right] }[/math]
The upper and lower bounds on reliability are:
- [math]\displaystyle{ \begin{align} & {{R}_{U}}= & \int_{{{z}_{L}}}^{\infty }\frac{1}{\sqrt{2\pi }}{{e}^{-\tfrac{1}{2}{{z}^{2}}}}dz\text{ (Upper bound)} \\ & {{R}_{L}}= & \int_{{{z}_{U}}}^{\infty }\frac{1}{\sqrt{2\pi }}{{e}^{-\tfrac{1}{2}{{z}^{2}}}}dz\text{ (Lower bound)} \end{align} }[/math]
Bounds on Time
The bounds around time for a given lognormal percentile, or unreliability, are estimated by first solving the reliability equation with respect to time, as follows:
- [math]\displaystyle{ {T}'({{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}})={{\widehat{\mu }}^{\prime }}+z\cdot {{\widehat{\sigma }}_{{{T}'}}} }[/math]
- where:
- [math]\displaystyle{ z={{\Phi }^{-1}}\left[ F({T}') \right] }[/math]
- and:
- [math]\displaystyle{ \Phi (z)=\frac{1}{\sqrt{2\pi }}\int_{-\infty }^{z({T}')}{{e}^{-\tfrac{1}{2}{{z}^{2}}}}dz }[/math]
The next step is to calculate the variance of [math]\displaystyle{ {T}'({{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}}): }[/math]
- [math]\displaystyle{ \begin{align} & Var({{{\hat{T}}}^{\prime }})= & {{\left( \frac{\partial {T}'}{\partial {\mu }'} \right)}^{2}}Var({{\widehat{\mu }}^{\prime }})+{{\left( \frac{\partial {T}'}{\partial {{\sigma }_{{{T}'}}}} \right)}^{2}}Var({{\widehat{\sigma }}_{{{T}'}}}) \\ & & +2\left( \frac{\partial {T}'}{\partial {\mu }'} \right)\left( \frac{\partial {T}'}{\partial {{\sigma }_{{{T}'}}}} \right)Cov\left( {{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}} \right) \\ & & \\ & Var({{{\hat{T}}}^{\prime }})= & Var({{\widehat{\mu }}^{\prime }})+{{\widehat{z}}^{2}}Var({{\widehat{\sigma }}_{{{T}'}}})+2\cdot \widehat{z}\cdot Cov\left( {{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}} \right) \end{align} }[/math]
The upper and lower bounds are then found by:
- [math]\displaystyle{ \begin{align} & T_{U}^{\prime }= & \ln {{T}_{U}}={{{\hat{T}}}^{\prime }}+{{K}_{\alpha }}\sqrt{Var({{{\hat{T}}}^{\prime }})} \\ & T_{L}^{\prime }= & \ln {{T}_{L}}={{{\hat{T}}}^{\prime }}-{{K}_{\alpha }}\sqrt{Var({{{\hat{T}}}^{\prime }})} \end{align} }[/math]
Solving for [math]\displaystyle{ {{T}_{U}} }[/math] and [math]\displaystyle{ {{T}_{L}} }[/math] we get:
- [math]\displaystyle{ \begin{align} & {{T}_{U}}= & {{e}^{T_{U}^{\prime }}}\text{ (upper bound),} \\ & {{T}_{L}}= & {{e}^{T_{L}^{\prime }}}\text{ (lower bound)}\text{.} \end{align} }[/math]
Example 4
Using the data of Example 2 and assuming a lognormal distribution, estimate the parameters using the MLE method.
Solution to Example 4
In this example we have only complete data. Thus, the partials reduce to:
- [math]\displaystyle{ \begin{align} & \frac{\partial \Lambda }{\partial {\mu }'}= & \frac{1}{\sigma _{{{T}'}}^{2}}\cdot \underset{i=1}{\overset{14}{\mathop \sum }}\,\ln ({{T}_{i}})-{\mu }'=0 \\ & \frac{\partial \Lambda }{\partial {{\sigma }_{{{T}'}}}}= & \underset{i=1}{\overset{14}{\mathop \sum }}\,\left( \frac{\ln ({{T}_{i}})-{\mu }'}{\sigma _{{{T}'}}^{3}}-\frac{1}{{{\sigma }_{{{T}'}}}} \right)=0 \end{align} }[/math]
Substituting the values of [math]\displaystyle{ {{T}_{i}} }[/math] and solving the above system simultaneously, we get:
- [math]\displaystyle{ \begin{align} & {{{\hat{\sigma }}}_{{{T}'}}}= & 0.849 \\ & {{{\hat{\mu }}}^{\prime }}= & 3.516 \end{align} }[/math]
Using Eqns. (mean) and (sdv) we get:
- [math]\displaystyle{ \overline{T}=\hat{\mu }=48.25\text{ hours} }[/math]
- and:
- [math]\displaystyle{ {{\hat{\sigma }}_{{{T}'}}}=49.61\text{ hours}. }[/math]
The variance/covariance matrix is given by:
- [math]\displaystyle{ \left[ \begin{matrix} \widehat{Var}\left( {{{\hat{\mu }}}^{\prime }} \right)=0.0515 & {} & \widehat{Cov}\left( {{{\hat{\mu }}}^{\prime }},{{{\hat{\sigma }}}_{{{T}'}}} \right)=0.0000 \\ {} & {} & {} \\ \widehat{Cov}\left( {{{\hat{\mu }}}^{\prime }},{{{\hat{\sigma }}}_{{{T}'}}} \right)=0.0000 & {} & \widehat{Var}\left( {{{\hat{\sigma }}}_{{{T}'}}} \right)=0.0258 \\ \end{matrix} \right] }[/math]
Note About Bias
See the discussion regarding bias with the normal distribution in Chapter 8 for information regarding parameter bias in the lognormal distribution.
Likelihood Ratio Confidence Bounds
Bounds on Parameters
As covered in Chapter 5, the likelihood confidence bounds are calculated by finding values for [math]\displaystyle{ {{\theta }_{1}} }[/math] and [math]\displaystyle{ {{\theta }_{2}} }[/math] that satisfy:
- [math]\displaystyle{ -2\cdot \text{ln}\left( \frac{L({{\theta }_{1}},{{\theta }_{2}})}{L({{\widehat{\theta }}_{1}},{{\widehat{\theta }}_{2}})} \right)=\chi _{\alpha ;1}^{2} }[/math]
This equation can be rewritten as:
- [math]\displaystyle{ L({{\theta }_{1}},{{\theta }_{2}})=L({{\widehat{\theta }}_{1}},{{\widehat{\theta }}_{2}})\cdot {{e}^{\tfrac{-\chi _{\alpha ;1}^{2}}{2}}} }[/math]
For complete data, the likelihood formula for the normal distribution is given by:
- [math]\displaystyle{ L({\mu }',{{\sigma }_{{{T}'}}})=\underset{i=1}{\overset{N}{\mathop \prod }}\,f({{x}_{i}};{\mu }',{{\sigma }_{{{T}'}}})=\underset{i=1}{\overset{N}{\mathop \prod }}\,\frac{1}{{{x}_{i}}\cdot {{\sigma }_{{{T}'}}}\cdot \sqrt{2\pi }}\cdot {{e}^{-\tfrac{1}{2}{{\left( \tfrac{\text{ln}({{x}_{i}})-{\mu }'}{{{\sigma }_{{{T}'}}}} \right)}^{2}}}} }[/math]
where the [math]\displaystyle{ {{x}_{i}} }[/math] values represent the original time-to-failure data. For a given value of [math]\displaystyle{ \alpha }[/math] , values for [math]\displaystyle{ {\mu }' }[/math] and [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] can be found which represent the maximum and minimum values that satisfy Eqn. (lratio3). These represent the confidence bounds for the parameters at a confidence level [math]\displaystyle{ \delta , }[/math] where [math]\displaystyle{ \alpha =\delta }[/math] for two-sided bounds and [math]\displaystyle{ \alpha =2\delta -1 }[/math] for one-sided.
Example 5
Five units are put on a reliability test and experience failures at 45, 60, 75, 90, and 115 hours. Assuming a lognormal distribution, the MLE parameter estimates are calculated to be [math]\displaystyle{ {{\widehat{\mu }}^{\prime }}=4.2926 }[/math] and [math]\displaystyle{ {{\widehat{\sigma }}_{{{T}'}}}=0.32361. }[/math] Calculate the two-sided 75% confidence bounds on these parameters using the likelihood ratio method.
Solution to Example 5
The first step is to calculate the likelihood function for the parameter estimates:
where [math]\displaystyle{ {{x}_{i}} }[/math] are the original time-to-failure data points. We can now rearrange Eqn. (lratio3) to the form:
- [math]\displaystyle{ L({\mu }',{{\sigma }_{{{T}'}}})-L({{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}})\cdot {{e}^{\tfrac{-\chi _{\alpha ;1}^{2}}{2}}}=0 }[/math]
Since our specified confidence level, [math]\displaystyle{ \delta }[/math] , is 75%, we can calculate the value of the chi-squared statistic, [math]\displaystyle{ \chi _{0.75;1}^{2}=1.323303. }[/math] We can now substitute this information into the equation:
- [math]\displaystyle{ \begin{align} & L({\mu }',{{\sigma }_{{{T}'}}})-L({{\widehat{\mu }}^{\prime }},{{\widehat{\sigma }}_{{{T}'}}})\cdot {{e}^{\tfrac{-\chi _{\alpha ;1}^{2}}{2}}}= & 0 \\ & L({\mu }',{{\sigma }_{{{T}'}}})-1.115256\times {{10}^{-10}}\cdot {{e}^{\tfrac{-1.323303}{2}}}= & 0 \\ & L({\mu }',{{\sigma }_{{{T}'}}})-5.754703\times {{10}^{-11}}= & 0 \end{align} }[/math]
It now remains to find the values of [math]\displaystyle{ {\mu }' }[/math] and [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] which satisfy this equation. This is an iterative process that requires setting the value of [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] and finding the appropriate values of [math]\displaystyle{ {\mu }' }[/math] , and vice versa.
The following table gives the values of [math]\displaystyle{ {\mu }' }[/math] based on given values of [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] .
These points are represented graphically in the following contour plot:
(Note that this plot is generated with degrees of freedom [math]\displaystyle{ k=1 }[/math] , as we are only determining bounds on one parameter. The contour plots generated in Weibull++ are done with degrees of freedom [math]\displaystyle{ k=2 }[/math] , for use in comparing both parameters simultaneously.) As can be determined from the table the lowest calculated value for [math]\displaystyle{ {\mu }' }[/math] is 4.1145, while the highest is 4.4708. These represent the two-sided 75% confidence limits on this parameter. Since solutions for the equation do not exist for values of [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] below 0.24 or above 0.48, these can be considered the two-sided 75% confidence limits for this parameter. In order to obtain more accurate values for the confidence limits on [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] , we can perform the same procedure as before, but finding the two values of [math]\displaystyle{ \sigma }[/math] that correspond with a given value of [math]\displaystyle{ {\mu }'. }[/math] Using this method, we find that the 75% confidence limits on [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] are 0.23405 and 0.48936, which are close to the initial estimates of 0.24 and 0.48.
Bounds on Time and Reliability
In order to calculate the bounds on a time estimate for a given reliability, or on a reliability estimate for a given time, the likelihood function needs to be rewritten in terms of one parameter and time/reliability, so that the maximum and minimum values of the time can be observed as the parameter is varied. This can be accomplished by substituting a form of the normal reliability equation into the likelihood function. The normal reliability equation can be written as:
- [math]\displaystyle{ R=1-\Phi \left( \frac{\text{ln}(t)-{\mu }'}{{{\sigma }_{{{T}'}}}} \right) }[/math]
This can be rearranged to the form:
- [math]\displaystyle{ {\mu }'=\text{ln}(t)-{{\sigma }_{{{T}'}}}\cdot {{\Phi }^{-1}}(1-R) }[/math]
where [math]\displaystyle{ {{\Phi }^{-1}} }[/math] is the inverse standard normal. This equation can now be substituted into Eqn. (lognormlikelihood) to produce a likelihood equation in terms of [math]\displaystyle{ {{\sigma }_{{{T}'}}}, }[/math] [math]\displaystyle{ t }[/math] and [math]\displaystyle{ R\ \ : }[/math]
- [math]\displaystyle{ L({{\sigma }_{{{T}'}}},t/R)=\underset{i=1}{\overset{N}{\mathop \prod }}\,\frac{1}{{{x}_{i}}\cdot {{\sigma }_{{{T}'}}}\cdot \sqrt{2\pi }}\cdot {{e}^{-\tfrac{1}{2}{{\left( \tfrac{\text{ln}({{x}_{i}})-\left( \text{ln}(t)-{{\sigma }_{{{T}'}}}\cdot {{\Phi }^{-1}}(1-R) \right)}{{{\sigma }_{{{T}'}}}} \right)}^{2}}}} }[/math]
The unknown variable [math]\displaystyle{ t/R }[/math] depends on what type of bounds are being determined. If one is trying to determine the bounds on time for a given reliability, then [math]\displaystyle{ R }[/math] is a known constant and [math]\displaystyle{ t }[/math] is the unknown variable. Conversely, if one is trying to determine the bounds on reliability for a given time, then [math]\displaystyle{ t }[/math] is a known constant and [math]\displaystyle{ R }[/math] is the unknown variable. Either way, Eqn. (lognormliketr) can be used to solve Eqn. (lratio3) for the values of interest.
Example 6
For the data given in Example 5, determine the two-sided 75% confidence bounds on the time estimate for a reliability of 80%. The ML estimate for the time at [math]\displaystyle{ R(t)=80% }[/math] is 55.718.
Solution to Example 6
In this example, we are trying to determine the two-sided 75% confidence bounds on the time estimate of 55.718. This is accomplished by substituting [math]\displaystyle{ R=0.80 }[/math] and [math]\displaystyle{ \alpha =0.75 }[/math] into Eqn. (lognormliketr), and varying [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] until the maximum and minimum values of [math]\displaystyle{ t }[/math] are found. The following table gives the values of [math]\displaystyle{ t }[/math] based on given values of [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] .
This data set is represented graphically in the following contour plot:
As can be determined from the table, the lowest calculated value for [math]\displaystyle{ t }[/math] is 43.634, while the highest is 66.085. These represent the two-sided 75% confidence limits on the time at which reliability is equal to 80%.
Example 7
For the data given in Example 5, determine the two-sided 75% confidence bounds on the reliability estimate for [math]\displaystyle{ t=65 }[/math] . The ML estimate for the reliability at [math]\displaystyle{ t=65 }[/math] is 64.261%.
Solution to Example 7
In this example, we are trying to determine the two-sided 75% confidence bounds on the reliability estimate of 64.261%. This is accomplished by substituting [math]\displaystyle{ t=65 }[/math] and [math]\displaystyle{ \alpha =0.75 }[/math] into Eqn. (lognormliketr), and varying [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] until the maximum and minimum values of [math]\displaystyle{ R }[/math] are found. The following table gives the values of [math]\displaystyle{ R }[/math] based on given values of [math]\displaystyle{ {{\sigma }_{{{T}'}}} }[/math] .
This data set is represented graphically in the following contour plot:
As can be determined from the table, the lowest calculated value for [math]\displaystyle{ R }[/math] is 43.444%, while the highest is 81.508%. These represent the two-sided 75% confidence limits on the reliability at [math]\displaystyle{ t=65 }[/math] .