The Exponential Distribution: Difference between revisions
Kate Racaza (talk | contribs) No edit summary |
Kate Racaza (talk | contribs) No edit summary |
||
Line 240: | Line 240: | ||
'''The Correlation Coefficient''' | '''The Correlation Coefficient''' | ||
The estimator of <math>\rho </math> is the sample correlation coefficient, <math>\hat{\rho }</math>, given by: | The estimator of <math>\rho </math> is the sample correlation coefficient, <math>\hat{\rho }</math>, given by: | ||
Line 250: | Line 249: | ||
===Rank Regression on X for Exponential Distribution=== | |||
Similar to rank regression on Y, performing a rank regression on X requires that a straight line be fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized. | |||
Again the first task is to bring our exponential <math>cdf</math> function into a linear form. This step is exactly the same as in regression on Y analysis. The deviation from the previous analysis begins on the least squares fit step, since in this case we treat <math>x</math> as the dependent variable and <math>y</math> as the independent variable. The best-fitting straight line to the data, for regression on X (see Chapter [[Parameter Estimation]]), is the straight line: | |||
::<math>x=\hat{a}+\hat{b}y</math> | |||
The corresponding equations for <math>\hat{a}</math> and <math>\hat{b}</math> are: | |||
::<math>\hat{a}=\overline{x}-\hat{b}\overline{y}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}</math> | |||
and: | |||
::<math>\hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}} \right)}^{2}}}{N}}</math> | |||
where: | |||
::<math>{{y}_{i}}=\ln [1-F({{t}_{i}})]</math> | |||
and: | |||
::<math>{{x}_{i}}={{t}_{i}}</math> | |||
The values of <math>F({{t}_{i}})</math> are estimated from the median ranks. Once <math>\hat{a}</math> and <math>\hat{b}</math> are obtained, solve for the unknown <math>y</math> value, which corresponds to: | |||
::<math>y=-\frac{\hat{a}}{\hat{b}}+\frac{1}{\hat{b}}x</math> | |||
Solving for the parameters from above equations we get: | |||
::<math>a=-\frac{\hat{a}}{\hat{b}}=\lambda \gamma \Rightarrow \gamma =\hat{a}</math> | |||
and: | |||
::<math>b=\frac{1}{\hat{b}}=-\lambda \Rightarrow \lambda =-\frac{1}{\hat{b}}</math> | |||
For the one-parameter exponential case, equations for estimating a and b become: | |||
::<math>\begin{align} | |||
\hat{a}= & 0 \\ | |||
\hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}} | |||
\end{align}</math> | |||
The correlation coefficient is evaluated as before. | |||
'''Example 3:''' | |||
{{Example: 2 Parameter Exponential Distribution RRX}} | |||
===Maximum Likelihood Estimation for Exponential Distribution=== | |||
As outlined in Chapter [[Parameter Estimation]], maximum likelihood estimation works by developing a likelihood function based on the available data and finding the values of the parameter estimates that maximize the likelihood function. This can be achieved by using iterative methods to determine the parameter estimate values that maximize the likelihood function. This can be rather difficult and time-consuming, particularly when dealing with the three-parameter distribution. Another method of finding the parameter estimates involves taking the partial derivatives of the likelihood equation with respect to the parameters, setting the resulting equations equal to zero, and solving simultaneously to determine the values of the parameter estimates. The log-likelihood functions and associated partial derivatives used to determine maximum likelihood estimates for the exponential distribution are covered in [[Appendix: Distribution Log-Likelihood Equations|Appendix]]. | |||
'''Example 4:''' | |||
{{Example: MLE for Exponential Distribution}} | |||
==Confidence Bounds== | |||
In this section, we present the methods used in the application to estimate the different types of confidence bounds for exponentially distributed data. The complete derivations were presented in detail (for a general function) in the chapter for [[Confidence Bounds]]. | |||
At this time we should point out that exact confidence bounds for the exponential distribution have been derived, and exist in a closed form, utilizing the <math>{{\chi }^{2}}</math> distribution. These are described in detail in [[Appendix: Weibull References|Kececioglu [20]]], and are covered in the section in the [[Reliability Test Design|test design chapter]]. For most exponential data analyses, Weibull++ will use the approximate confidence bounds, provided from the Fisher information matrix or the likelihood ratio, in order to stay consistent with all of the other available distributions in the application. The <math>{{\chi }^{2}}</math> confidence bounds for the exponential distribution are discussed in more detail in the [[Reliability Test Design|test design chapter]]. | |||
===Fisher Matrix Bounds=== | |||
====Bounds on the Parameters==== | |||
For the failure rate <math>\hat{\lambda }</math> the upper (<math>{{\lambda }_{U}}</math>) and lower (<math>{{\lambda }_{L}}</math>) bounds are estimated by [[Appendix: Weibull References|[30]]]: | |||
::<math>\begin{align} | |||
& {{\lambda }_{U}}= & \hat{\lambda }\cdot {{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}} \\ | |||
& & \\ | |||
& {{\lambda }_{L}}= & \frac{\hat{\lambda }}{{{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}}} | |||
\end{align}</math> | |||
where <math>{{K}_{\alpha }}</math> is defined by: | |||
::<math>\alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }})</math> | |||
If <math>\delta </math> is the confidence level, then <math>\alpha =\tfrac{1-\delta }{2}</math> for the two-sided bounds, and <math>\alpha =1-\delta </math> for the one-sided bounds. | |||
The variance of <math>\hat{\lambda },</math> <math>Var(\hat{\lambda }),</math> is estimated from the Fisher matrix, as follows: | |||
::<math>Var(\hat{\lambda })={{\left( -\frac{{{\partial }^{2}}\Lambda }{\partial {{\lambda }^{2}}} \right)}^{-1}}</math> | |||
where <math>\Lambda </math> is the log-likelihood function of the exponential distribution, described in [[Appendix: Distribution Log-Likelihood Equations|Appendix for log-likelihood function]]. | |||
Note that no true MLE solution exists for the case of the two-parameter exponential distribution. The mathematics simply break down while trying to simultaneously solve the partial derivative equations for both the <math>\gamma </math> and <math>\lambda </math> parameters, resulting in unrealistic conditions. The way around this conundrum involves setting <math>\gamma ={{t}_{1}},</math> or the first time-to-failure, and calculating <math>\lambda </math> in the regular fashion for this methodology. Weibull++ treats <math>\gamma </math> as a constant when computing bounds, i.e. <math>Var(\hat{\gamma })=0.</math> (See the discussion in [[Appendix: Distribution Log-Likelihood Equations|Appendix for log-likelihood function]] for more information.) | |||
{{Bounds on Reliability FMB ED}} | |||
{{Bounds on Time FMB ED}} | |||
{{ | {{Likelihood Ratio Confidence Bounds FMB ED}} | ||
{{Bayesian Confidence Bounds ED}} | {{Bayesian Confidence Bounds ED}} |
Revision as of 10:11, 6 July 2012
The exponential distribution is a commonly used distribution in reliability engineering. Mathematically, it is a fairly simple distribution, which many times leads to its use in inappropriate situations. It is, in fact, a special case of the Weibull distribution where [math]\displaystyle{ \beta =1 }[/math]. The exponential distribution is used to model the behavior of units that have a constant failure rate (or units that do not degrade with time or wear out).
Exponential Probability Density Function
The Two-Parameter Exponential Distribution
The two-parameter exponential pdf is given by:
- [math]\displaystyle{ f(t)=\lambda {{e}^{-\lambda (t-\gamma )}},f(t)\ge 0,\lambda \gt 0,t\ge 0\text{ or }\gamma }[/math]
where [math]\displaystyle{ \gamma }[/math] is the location parameter. Some of the characteristics of the two-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{t}-\gamma =m-\gamma }[/math].
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The distribution starts at [math]\displaystyle{ t=\gamma }[/math] at the level of [math]\displaystyle{ f(t=\gamma )=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases beyond [math]\displaystyle{ \gamma }[/math] and is convex.
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The One-Parameter Exponential Distribution
The one-parameter exponential [math]\displaystyle{ pdf }[/math] is obtained by setting [math]\displaystyle{ \gamma =0 }[/math], and is given by:
- [math]\displaystyle{ \begin{align}f(t)= & \lambda {{e}^{-\lambda t}}=\frac{1}{m}{{e}^{-\tfrac{1}{m}t}}, & t\ge 0, \lambda \gt 0,m\gt 0 \end{align} }[/math]
where:
- [math]\displaystyle{ \lambda }[/math] = constant rate, in failures per unit of measurement, e.g failures per hour, per cycle, etc.,
- [math]\displaystyle{ \lambda =\frac{1}{m} }[/math],
- [math]\displaystyle{ m }[/math] = mean time between failures, or to failure,
- [math]\displaystyle{ t }[/math] = operating time, life, or age, in hours, cycles, miles, actuations, etc.
This distribution requires the knowledge of only one parameter, [math]\displaystyle{ \lambda }[/math], for its application. Some of the characteristics of the one-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], is zero.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=m }[/math].
- As [math]\displaystyle{ \lambda }[/math] is decreased in value, the distribution is stretched out to the right, and as [math]\displaystyle{ \lambda }[/math] is increased, the distribution is pushed toward the origin.
- This distribution has no shape parameter as it has only one shape, i.e. the exponential, and the only parameter it has is the failure rate, [math]\displaystyle{ \lambda }[/math].
- The distribution starts at [math]\displaystyle{ t=0 }[/math] at the level of [math]\displaystyle{ f(t=0)=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases, and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ f(t)\to 0 }[/math].
- The [math]\displaystyle{ pdf }[/math] can be thought of as a special case of the Weibull [math]\displaystyle{ pdf }[/math] with [math]\displaystyle{ \gamma =0 }[/math] and [math]\displaystyle{ \beta =1 }[/math].
Exponential Statistical Properties
The Mean or MTTF
The mean, [math]\displaystyle{ \overline{T}, }[/math] or mean time to failure (MTTF) is given by:
- [math]\displaystyle{ \begin{align} \bar{T}= & \int_{\gamma }^{\infty }t\cdot f(t)dt \\ = & \int_{\gamma }^{\infty }t\cdot \lambda \cdot {{e}^{-\lambda t}}dt \\ = & \gamma +\frac{1}{\lambda }=m \end{align} }[/math]
Note that when [math]\displaystyle{ \gamma =0 }[/math], the MTTF is the inverse of the exponential distribution's constant failure rate. This is only true for the exponential distribution. Most other distributions do not have a constant failure rate. Consequently, the inverse relationship between failure rate and MTTF does not hold for these other distributions.
The Median
The median, [math]\displaystyle{ \breve{T}, }[/math] is:
- [math]\displaystyle{ \breve{T}=\gamma +\frac{1}{\lambda}\cdot 0.693 }[/math]
The Mode
The mode, [math]\displaystyle{ \tilde{T}, }[/math] is:
- [math]\displaystyle{ \tilde{T}=\gamma }[/math]
The Standard Deviation
The standard deviation, [math]\displaystyle{ {\sigma }_{T} }[/math], is:
- [math]\displaystyle{ {\sigma}_{T}=\frac{1}{\lambda }=m }[/math]
The Exponential Reliability Function
The equation for the two-parameter exponential cumulative density function, or [math]\displaystyle{ cdf, }[/math] is given by:
- [math]\displaystyle{ F(t)=Q(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Recalling that the reliability function of a distribution is simply one minus the [math]\displaystyle{ cdf }[/math], the reliability function of the two-parameter exponential distribution is given by:
- [math]\displaystyle{ R(t)=1-Q(t)=1-\int_{0}^{t-\gamma }f(x)dx }[/math]
- [math]\displaystyle{ R(t)=1-\int_{0}^{t-\gamma }\lambda {{e}^{-\lambda x}}dx={{e}^{-\lambda (t-\gamma )}} }[/math]
One-Parameter Exponential Reliability Function
The one-parameter exponential reliability function is given by:
- [math]\displaystyle{ R(t)={{e}^{-\lambda t}}={{e}^{-\tfrac{t}{m}}} }[/math]
The Exponential Conditional Reliability
The exponential conditional reliability equation gives the reliability for a mission of [math]\displaystyle{ t }[/math] duration, having already successfully accumulated [math]\displaystyle{ T }[/math] hours of operation up to the start of this new mission. The exponential conditional reliability function is:
- [math]\displaystyle{ R(t|T)=\frac{R(T+t)}{R(T)}=\frac{{{e}^{-\lambda (T+t-\gamma )}}}{{{e}^{-\lambda (T-\gamma )}}}={{e}^{-\lambda t}} }[/math]
which says that the reliability for a mission of [math]\displaystyle{ t }[/math] duration undertaken after the component or equipment has already accumulated [math]\displaystyle{ T }[/math] hours of operation from age zero is only a function of the mission duration, and not a function of the age at the beginning of the mission. This is referred to as the memoryless property.
The Exponential Reliable Life
The reliable life, or the mission duration for a desired reliability goal, [math]\displaystyle{ {{t}_{R}} }[/math], for the one-parameter exponential distribution is:
- [math]\displaystyle{ R({{t}_{R}})={{e}^{-\lambda ({{t}_{R}}-\gamma )}} }[/math]
- [math]\displaystyle{ \ln[R({{t}_{R}})]=-\lambda({{t}_{R}}-\gamma ) }[/math]
or:
- [math]\displaystyle{ {{t}_{R}}=\gamma -\frac{\ln [R({{t}_{R}})]}{\lambda } }[/math]
The Exponential Failure Rate Function
The exponential failure rate function is:
- [math]\displaystyle{ \lambda (t)=\frac{f(t)}{R(t)}=\frac{\lambda {{e}^{-\lambda (t-\gamma )}}}{{{e}^{-\lambda (t-\gamma )}}}=\lambda =\text{constant} }[/math]
Once again, note that the constant failure rate is a characteristic of the exponential distribution, and special cases of other distributions only. Most other distributions have failure rates that are functions of time.
Characteristics of the Exponential Distribution
As mentioned before, the primary trait of the exponential distribution is that it is used for modeling the behavior of items with a constant failure rate. It has a fairly simple mathematical form, which makes it fairly easy to manipulate. Unfortunately, this fact also leads to the use of this model in situations where it is not appropriate. For example, it would not be appropriate to use the exponential distribution to model the reliability of an automobile. The constant failure rate of the exponential distribution would require the assumption that the automobile would be just as likely to experience a breakdown during the first mile as it would during the one-hundred-thousandth mile. Clearly, this is not a valid assumption. However, some inexperienced practitioners of reliability engineering and life data analysis will overlook this fact, lured by the siren-call of the exponential distribution's relatively simple mathematical models.
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential [math]\displaystyle{ pdf }[/math]
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The exponential [math]\displaystyle{ pdf }[/math] is always convex and is stretched to the right as [math]\displaystyle{ \lambda }[/math] decreases in value.
- The value of the [math]\displaystyle{ pdf }[/math] function is always equal to the value of [math]\displaystyle{ \lambda }[/math] at [math]\displaystyle{ t=0 }[/math] (or [math]\displaystyle{ t=\gamma }[/math]).
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before this time.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{T}-\gamma =m-\gamma }[/math].
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential Reliability Function
- The one-parameter exponential reliability function starts at the value of 100% at [math]\displaystyle{ t=0 }[/math], decreases thereafter monotonically and is convex.
- The two-parameter exponential reliability function remains at the value of 100% for [math]\displaystyle{ t=0 }[/math] up to [math]\displaystyle{ t=\gamma }[/math], and decreases thereafter monotonically and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ R(t\to \infty )\to 0 }[/math].
- The reliability for a mission duration of [math]\displaystyle{ t=m=\tfrac{1}{\lambda } }[/math], or of one MTTF duration, is always equal to [math]\displaystyle{ 0.3679 }[/math] or 36.79%. This means that the reliability for a mission which is as long as one MTTF is relatively low and is not recommended because only 36.8% of the missions will be completed successfully. In other words, of the equipment undertaking such a mission, only 36.8% will survive their mission.
Estimation of the Exponential Parameters
Probability Plotting
Estimation of the parameters for the exponential distribution via probability plotting is very similar to the process used when dealing with the Weibull distribution. Recall, however, that the appearance of the probability plotting paper and the methods by which the parameters are estimated vary from distribution to distribution, so there will be some noticeable differences. In fact, due to the nature of the exponential [math]\displaystyle{ cdf }[/math], the exponential probability plot is the only one with a negative slope. This is because the y-axis of the exponential probability plotting paper represents the reliability, whereas the y-axis for most of the other life distributions represents the unreliability.
This is illustrated in the process of linearizing the [math]\displaystyle{ cdf }[/math], which is necessary to construct the exponential probability plotting paper. For the two-parameter exponential distribution the cumulative density function is given by:
- [math]\displaystyle{ F(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Taking the natural logarithm of both sides of the above equation yields:
- [math]\displaystyle{ \ln \left[ 1-F(t) \right]=-\lambda (t-\gamma ) }[/math]
or:
- [math]\displaystyle{ \ln [1-F(t)]=\lambda \gamma -\lambda t }[/math]
Now, let:
- [math]\displaystyle{ y=\ln [1-F(t)] }[/math]
- [math]\displaystyle{ a=\lambda \gamma }[/math]
and:
- [math]\displaystyle{ b=-\lambda }[/math]
which results in the linear equation of:
- [math]\displaystyle{ y=a+bt }[/math]
Note that with the exponential probability plotting paper, the y-axis scale is logarithmic and the x-axis scale is linear. This means that the zero value is present only on the x-axis. For [math]\displaystyle{ t=0 }[/math], [math]\displaystyle{ R=1 }[/math] and [math]\displaystyle{ F(t)=0 }[/math]. So if we were to use [math]\displaystyle{ F(t) }[/math] for the y-axis, we would have to plot the point [math]\displaystyle{ (0,0) }[/math]. However, since the y-axis is logarithmic, there is no place to plot this on the exponential paper. Also, the failure rate, [math]\displaystyle{ \lambda }[/math], is the negative of the slope of the line, but there is an easier way to determine the value of [math]\displaystyle{ \lambda }[/math] from the probability plot, as will be illustrated in the following example.
Example 1:
1-Parameter Exponential Probability Plot Example
6 units are put on a life test and tested to failure. The failure times are 7, 12, 19, 29, 41, and 67 hours. Estimate the failure rate for a 1-parameter exponential distribution using the probability plotting method.
In order to plot the points for the probability plot, the appropriate reliability estimate values must be obtained. These will be equivalent to [math]\displaystyle{ 100%-MR\,\! }[/math] since the y-axis represents the reliability and the [math]\displaystyle{ MR\,\! }[/math] values represent unreliability estimates.
Next, these points are plotted on an exponential probability plotting paper. A sample of this type of plotting paper is shown next, with the sample points in place. Notice how these points describe a line with a negative slope.
Once the points are plotted, draw the best possible straight line through these points. The time value at which this line intersects with a horizontal line drawn at the 36.8% reliability mark is the mean life, and the reciprocal of this is the failure rate [math]\displaystyle{ \lambda\,\! }[/math]. This is because at [math]\displaystyle{ t=m=\tfrac{1}{\lambda }\,\! }[/math]:
- [math]\displaystyle{ \begin{align} R(t)= & {{e}^{-\lambda \cdot t}} \\ R(t)= & {{e}^{-\lambda \cdot \tfrac{1}{\lambda }}} \\ R(t)= & {{e}^{-1}}=0.368=36.8%. \end{align}\,\! }[/math]
The following plot shows that the best-fit line through the data points crosses the [math]\displaystyle{ R=36.8%\,\! }[/math] line at [math]\displaystyle{ t=33\,\! }[/math] hours. And because [math]\displaystyle{ \tfrac{1}{\lambda }=33\,\! }[/math] hours, [math]\displaystyle{ \lambda =0.0303\,\! }[/math] failures/hour.
Rank Regression on Y for Exponential Distribution
Performing a rank regression on Y requires that a straight line be fitted to the set of available data points such that the sum of the squares of the vertical deviations from the points to the line is minimized. The least squares parameter estimation method (regression analysis) was discussed in Chapter Parameter Estimation, and the following equations for rank regression on Y (RRY) were derived:
- [math]\displaystyle{ \hat{a}=\bar{y}-\hat{b}\bar{x}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}} \right)}^{2}}}{N}} }[/math]
In our case, the equations for [math]\displaystyle{ {{y}_{i}} }[/math] and [math]\displaystyle{ {{x}_{i}} }[/math] are:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
and the [math]\displaystyle{ F({{t}_{i}}) }[/math] is estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, then [math]\displaystyle{ \hat{\lambda } }[/math] and [math]\displaystyle{ \hat{\gamma } }[/math] can easily be obtained from above equations.
For the one-parameter exponential, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0, \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}} \end{align} }[/math]
The Correlation Coefficient
The estimator of [math]\displaystyle{ \rho }[/math] is the sample correlation coefficient, [math]\displaystyle{ \hat{\rho } }[/math], given by:
- [math]\displaystyle{ \hat{\rho }=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,({{x}_{i}}-\overline{x})({{y}_{i}}-\overline{y})}{\sqrt{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{x}_{i}}-\overline{x})}^{2}}\cdot \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{y}_{i}}-\overline{y})}^{2}}}} }[/math]
Example 2:
The exponential distribution is a commonly used distribution in reliability engineering. Mathematically, it is a fairly simple distribution, which many times leads to its use in inappropriate situations. It is, in fact, a special case of the Weibull distribution where [math]\displaystyle{ \beta =1 }[/math]. The exponential distribution is used to model the behavior of units that have a constant failure rate (or units that do not degrade with time or wear out).
Exponential Probability Density Function
The Two-Parameter Exponential Distribution
The two-parameter exponential pdf is given by:
- [math]\displaystyle{ f(t)=\lambda {{e}^{-\lambda (t-\gamma )}},f(t)\ge 0,\lambda \gt 0,t\ge 0\text{ or }\gamma }[/math]
where [math]\displaystyle{ \gamma }[/math] is the location parameter. Some of the characteristics of the two-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{t}-\gamma =m-\gamma }[/math].
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The distribution starts at [math]\displaystyle{ t=\gamma }[/math] at the level of [math]\displaystyle{ f(t=\gamma )=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases beyond [math]\displaystyle{ \gamma }[/math] and is convex.
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The One-Parameter Exponential Distribution
The one-parameter exponential [math]\displaystyle{ pdf }[/math] is obtained by setting [math]\displaystyle{ \gamma =0 }[/math], and is given by:
- [math]\displaystyle{ \begin{align}f(t)= & \lambda {{e}^{-\lambda t}}=\frac{1}{m}{{e}^{-\tfrac{1}{m}t}}, & t\ge 0, \lambda \gt 0,m\gt 0 \end{align} }[/math]
where:
- [math]\displaystyle{ \lambda }[/math] = constant rate, in failures per unit of measurement, e.g failures per hour, per cycle, etc.,
- [math]\displaystyle{ \lambda =\frac{1}{m} }[/math],
- [math]\displaystyle{ m }[/math] = mean time between failures, or to failure,
- [math]\displaystyle{ t }[/math] = operating time, life, or age, in hours, cycles, miles, actuations, etc.
This distribution requires the knowledge of only one parameter, [math]\displaystyle{ \lambda }[/math], for its application. Some of the characteristics of the one-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], is zero.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=m }[/math].
- As [math]\displaystyle{ \lambda }[/math] is decreased in value, the distribution is stretched out to the right, and as [math]\displaystyle{ \lambda }[/math] is increased, the distribution is pushed toward the origin.
- This distribution has no shape parameter as it has only one shape, i.e. the exponential, and the only parameter it has is the failure rate, [math]\displaystyle{ \lambda }[/math].
- The distribution starts at [math]\displaystyle{ t=0 }[/math] at the level of [math]\displaystyle{ f(t=0)=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases, and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ f(t)\to 0 }[/math].
- The [math]\displaystyle{ pdf }[/math] can be thought of as a special case of the Weibull [math]\displaystyle{ pdf }[/math] with [math]\displaystyle{ \gamma =0 }[/math] and [math]\displaystyle{ \beta =1 }[/math].
Exponential Statistical Properties
The Mean or MTTF
The mean, [math]\displaystyle{ \overline{T}, }[/math] or mean time to failure (MTTF) is given by:
- [math]\displaystyle{ \begin{align} \bar{T}= & \int_{\gamma }^{\infty }t\cdot f(t)dt \\ = & \int_{\gamma }^{\infty }t\cdot \lambda \cdot {{e}^{-\lambda t}}dt \\ = & \gamma +\frac{1}{\lambda }=m \end{align} }[/math]
Note that when [math]\displaystyle{ \gamma =0 }[/math], the MTTF is the inverse of the exponential distribution's constant failure rate. This is only true for the exponential distribution. Most other distributions do not have a constant failure rate. Consequently, the inverse relationship between failure rate and MTTF does not hold for these other distributions.
The Median
The median, [math]\displaystyle{ \breve{T}, }[/math] is:
- [math]\displaystyle{ \breve{T}=\gamma +\frac{1}{\lambda}\cdot 0.693 }[/math]
The Mode
The mode, [math]\displaystyle{ \tilde{T}, }[/math] is:
- [math]\displaystyle{ \tilde{T}=\gamma }[/math]
The Standard Deviation
The standard deviation, [math]\displaystyle{ {\sigma }_{T} }[/math], is:
- [math]\displaystyle{ {\sigma}_{T}=\frac{1}{\lambda }=m }[/math]
The Exponential Reliability Function
The equation for the two-parameter exponential cumulative density function, or [math]\displaystyle{ cdf, }[/math] is given by:
- [math]\displaystyle{ F(t)=Q(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Recalling that the reliability function of a distribution is simply one minus the [math]\displaystyle{ cdf }[/math], the reliability function of the two-parameter exponential distribution is given by:
- [math]\displaystyle{ R(t)=1-Q(t)=1-\int_{0}^{t-\gamma }f(x)dx }[/math]
- [math]\displaystyle{ R(t)=1-\int_{0}^{t-\gamma }\lambda {{e}^{-\lambda x}}dx={{e}^{-\lambda (t-\gamma )}} }[/math]
One-Parameter Exponential Reliability Function
The one-parameter exponential reliability function is given by:
- [math]\displaystyle{ R(t)={{e}^{-\lambda t}}={{e}^{-\tfrac{t}{m}}} }[/math]
The Exponential Conditional Reliability
The exponential conditional reliability equation gives the reliability for a mission of [math]\displaystyle{ t }[/math] duration, having already successfully accumulated [math]\displaystyle{ T }[/math] hours of operation up to the start of this new mission. The exponential conditional reliability function is:
- [math]\displaystyle{ R(t|T)=\frac{R(T+t)}{R(T)}=\frac{{{e}^{-\lambda (T+t-\gamma )}}}{{{e}^{-\lambda (T-\gamma )}}}={{e}^{-\lambda t}} }[/math]
which says that the reliability for a mission of [math]\displaystyle{ t }[/math] duration undertaken after the component or equipment has already accumulated [math]\displaystyle{ T }[/math] hours of operation from age zero is only a function of the mission duration, and not a function of the age at the beginning of the mission. This is referred to as the memoryless property.
The Exponential Reliable Life
The reliable life, or the mission duration for a desired reliability goal, [math]\displaystyle{ {{t}_{R}} }[/math], for the one-parameter exponential distribution is:
- [math]\displaystyle{ R({{t}_{R}})={{e}^{-\lambda ({{t}_{R}}-\gamma )}} }[/math]
- [math]\displaystyle{ \ln[R({{t}_{R}})]=-\lambda({{t}_{R}}-\gamma ) }[/math]
or:
- [math]\displaystyle{ {{t}_{R}}=\gamma -\frac{\ln [R({{t}_{R}})]}{\lambda } }[/math]
The Exponential Failure Rate Function
The exponential failure rate function is:
- [math]\displaystyle{ \lambda (t)=\frac{f(t)}{R(t)}=\frac{\lambda {{e}^{-\lambda (t-\gamma )}}}{{{e}^{-\lambda (t-\gamma )}}}=\lambda =\text{constant} }[/math]
Once again, note that the constant failure rate is a characteristic of the exponential distribution, and special cases of other distributions only. Most other distributions have failure rates that are functions of time.
Characteristics of the Exponential Distribution
As mentioned before, the primary trait of the exponential distribution is that it is used for modeling the behavior of items with a constant failure rate. It has a fairly simple mathematical form, which makes it fairly easy to manipulate. Unfortunately, this fact also leads to the use of this model in situations where it is not appropriate. For example, it would not be appropriate to use the exponential distribution to model the reliability of an automobile. The constant failure rate of the exponential distribution would require the assumption that the automobile would be just as likely to experience a breakdown during the first mile as it would during the one-hundred-thousandth mile. Clearly, this is not a valid assumption. However, some inexperienced practitioners of reliability engineering and life data analysis will overlook this fact, lured by the siren-call of the exponential distribution's relatively simple mathematical models.
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential [math]\displaystyle{ pdf }[/math]
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The exponential [math]\displaystyle{ pdf }[/math] is always convex and is stretched to the right as [math]\displaystyle{ \lambda }[/math] decreases in value.
- The value of the [math]\displaystyle{ pdf }[/math] function is always equal to the value of [math]\displaystyle{ \lambda }[/math] at [math]\displaystyle{ t=0 }[/math] (or [math]\displaystyle{ t=\gamma }[/math]).
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before this time.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{T}-\gamma =m-\gamma }[/math].
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential Reliability Function
- The one-parameter exponential reliability function starts at the value of 100% at [math]\displaystyle{ t=0 }[/math], decreases thereafter monotonically and is convex.
- The two-parameter exponential reliability function remains at the value of 100% for [math]\displaystyle{ t=0 }[/math] up to [math]\displaystyle{ t=\gamma }[/math], and decreases thereafter monotonically and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ R(t\to \infty )\to 0 }[/math].
- The reliability for a mission duration of [math]\displaystyle{ t=m=\tfrac{1}{\lambda } }[/math], or of one MTTF duration, is always equal to [math]\displaystyle{ 0.3679 }[/math] or 36.79%. This means that the reliability for a mission which is as long as one MTTF is relatively low and is not recommended because only 36.8% of the missions will be completed successfully. In other words, of the equipment undertaking such a mission, only 36.8% will survive their mission.
Estimation of the Exponential Parameters
Probability Plotting
Estimation of the parameters for the exponential distribution via probability plotting is very similar to the process used when dealing with the Weibull distribution. Recall, however, that the appearance of the probability plotting paper and the methods by which the parameters are estimated vary from distribution to distribution, so there will be some noticeable differences. In fact, due to the nature of the exponential [math]\displaystyle{ cdf }[/math], the exponential probability plot is the only one with a negative slope. This is because the y-axis of the exponential probability plotting paper represents the reliability, whereas the y-axis for most of the other life distributions represents the unreliability.
This is illustrated in the process of linearizing the [math]\displaystyle{ cdf }[/math], which is necessary to construct the exponential probability plotting paper. For the two-parameter exponential distribution the cumulative density function is given by:
- [math]\displaystyle{ F(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Taking the natural logarithm of both sides of the above equation yields:
- [math]\displaystyle{ \ln \left[ 1-F(t) \right]=-\lambda (t-\gamma ) }[/math]
or:
- [math]\displaystyle{ \ln [1-F(t)]=\lambda \gamma -\lambda t }[/math]
Now, let:
- [math]\displaystyle{ y=\ln [1-F(t)] }[/math]
- [math]\displaystyle{ a=\lambda \gamma }[/math]
and:
- [math]\displaystyle{ b=-\lambda }[/math]
which results in the linear equation of:
- [math]\displaystyle{ y=a+bt }[/math]
Note that with the exponential probability plotting paper, the y-axis scale is logarithmic and the x-axis scale is linear. This means that the zero value is present only on the x-axis. For [math]\displaystyle{ t=0 }[/math], [math]\displaystyle{ R=1 }[/math] and [math]\displaystyle{ F(t)=0 }[/math]. So if we were to use [math]\displaystyle{ F(t) }[/math] for the y-axis, we would have to plot the point [math]\displaystyle{ (0,0) }[/math]. However, since the y-axis is logarithmic, there is no place to plot this on the exponential paper. Also, the failure rate, [math]\displaystyle{ \lambda }[/math], is the negative of the slope of the line, but there is an easier way to determine the value of [math]\displaystyle{ \lambda }[/math] from the probability plot, as will be illustrated in the following example.
Example 1:
1-Parameter Exponential Probability Plot Example
6 units are put on a life test and tested to failure. The failure times are 7, 12, 19, 29, 41, and 67 hours. Estimate the failure rate for a 1-parameter exponential distribution using the probability plotting method.
In order to plot the points for the probability plot, the appropriate reliability estimate values must be obtained. These will be equivalent to [math]\displaystyle{ 100%-MR\,\! }[/math] since the y-axis represents the reliability and the [math]\displaystyle{ MR\,\! }[/math] values represent unreliability estimates.
Next, these points are plotted on an exponential probability plotting paper. A sample of this type of plotting paper is shown next, with the sample points in place. Notice how these points describe a line with a negative slope.
Once the points are plotted, draw the best possible straight line through these points. The time value at which this line intersects with a horizontal line drawn at the 36.8% reliability mark is the mean life, and the reciprocal of this is the failure rate [math]\displaystyle{ \lambda\,\! }[/math]. This is because at [math]\displaystyle{ t=m=\tfrac{1}{\lambda }\,\! }[/math]:
- [math]\displaystyle{ \begin{align} R(t)= & {{e}^{-\lambda \cdot t}} \\ R(t)= & {{e}^{-\lambda \cdot \tfrac{1}{\lambda }}} \\ R(t)= & {{e}^{-1}}=0.368=36.8%. \end{align}\,\! }[/math]
The following plot shows that the best-fit line through the data points crosses the [math]\displaystyle{ R=36.8%\,\! }[/math] line at [math]\displaystyle{ t=33\,\! }[/math] hours. And because [math]\displaystyle{ \tfrac{1}{\lambda }=33\,\! }[/math] hours, [math]\displaystyle{ \lambda =0.0303\,\! }[/math] failures/hour.
Rank Regression on Y for Exponential Distribution
Performing a rank regression on Y requires that a straight line be fitted to the set of available data points such that the sum of the squares of the vertical deviations from the points to the line is minimized. The least squares parameter estimation method (regression analysis) was discussed in Chapter Parameter Estimation, and the following equations for rank regression on Y (RRY) were derived:
- [math]\displaystyle{ \hat{a}=\bar{y}-\hat{b}\bar{x}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}} \right)}^{2}}}{N}} }[/math]
In our case, the equations for [math]\displaystyle{ {{y}_{i}} }[/math] and [math]\displaystyle{ {{x}_{i}} }[/math] are:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
and the [math]\displaystyle{ F({{t}_{i}}) }[/math] is estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, then [math]\displaystyle{ \hat{\lambda } }[/math] and [math]\displaystyle{ \hat{\gamma } }[/math] can easily be obtained from above equations.
For the one-parameter exponential, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0, \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}} \end{align} }[/math]
The Correlation Coefficient
The estimator of [math]\displaystyle{ \rho }[/math] is the sample correlation coefficient, [math]\displaystyle{ \hat{\rho } }[/math], given by:
- [math]\displaystyle{ \hat{\rho }=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,({{x}_{i}}-\overline{x})({{y}_{i}}-\overline{y})}{\sqrt{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{x}_{i}}-\overline{x})}^{2}}\cdot \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{y}_{i}}-\overline{y})}^{2}}}} }[/math]
Example 2:
Template loop detected: Template:2 parameter exponential distribution RRY example
Rank Regression on X for Exponential Distribution
Similar to rank regression on Y, performing a rank regression on X requires that a straight line be fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized.
Again the first task is to bring our exponential [math]\displaystyle{ cdf }[/math] function into a linear form. This step is exactly the same as in regression on Y analysis. The deviation from the previous analysis begins on the least squares fit step, since in this case we treat [math]\displaystyle{ x }[/math] as the dependent variable and [math]\displaystyle{ y }[/math] as the independent variable. The best-fitting straight line to the data, for regression on X (see Chapter Parameter Estimation), is the straight line:
- [math]\displaystyle{ x=\hat{a}+\hat{b}y }[/math]
The corresponding equations for [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are:
- [math]\displaystyle{ \hat{a}=\overline{x}-\hat{b}\overline{y}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}} \right)}^{2}}}{N}} }[/math]
where:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
The values of [math]\displaystyle{ F({{t}_{i}}) }[/math] are estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, solve for the unknown [math]\displaystyle{ y }[/math] value, which corresponds to:
- [math]\displaystyle{ y=-\frac{\hat{a}}{\hat{b}}+\frac{1}{\hat{b}}x }[/math]
Solving for the parameters from above equations we get:
- [math]\displaystyle{ a=-\frac{\hat{a}}{\hat{b}}=\lambda \gamma \Rightarrow \gamma =\hat{a} }[/math]
and:
- [math]\displaystyle{ b=\frac{1}{\hat{b}}=-\lambda \Rightarrow \lambda =-\frac{1}{\hat{b}} }[/math]
For the one-parameter exponential case, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0 \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}} \end{align} }[/math]
The correlation coefficient is evaluated as before.
Example 3:
Template loop detected: Template:Example: 2 Parameter Exponential Distribution RRX
Maximum Likelihood Estimation for Exponential Distribution
As outlined in Chapter Parameter Estimation, maximum likelihood estimation works by developing a likelihood function based on the available data and finding the values of the parameter estimates that maximize the likelihood function. This can be achieved by using iterative methods to determine the parameter estimate values that maximize the likelihood function. This can be rather difficult and time-consuming, particularly when dealing with the three-parameter distribution. Another method of finding the parameter estimates involves taking the partial derivatives of the likelihood equation with respect to the parameters, setting the resulting equations equal to zero, and solving simultaneously to determine the values of the parameter estimates. The log-likelihood functions and associated partial derivatives used to determine maximum likelihood estimates for the exponential distribution are covered in Appendix.
Example 4: Template loop detected: Template:Example: MLE for Exponential Distribution
Confidence Bounds
In this section, we present the methods used in the application to estimate the different types of confidence bounds for exponentially distributed data. The complete derivations were presented in detail (for a general function) in the chapter for Confidence Bounds. At this time we should point out that exact confidence bounds for the exponential distribution have been derived, and exist in a closed form, utilizing the [math]\displaystyle{ {{\chi }^{2}} }[/math] distribution. These are described in detail in Kececioglu [20], and are covered in the section in the test design chapter. For most exponential data analyses, Weibull++ will use the approximate confidence bounds, provided from the Fisher information matrix or the likelihood ratio, in order to stay consistent with all of the other available distributions in the application. The [math]\displaystyle{ {{\chi }^{2}} }[/math] confidence bounds for the exponential distribution are discussed in more detail in the test design chapter.
Fisher Matrix Bounds
Bounds on the Parameters
For the failure rate [math]\displaystyle{ \hat{\lambda } }[/math] the upper ([math]\displaystyle{ {{\lambda }_{U}} }[/math]) and lower ([math]\displaystyle{ {{\lambda }_{L}} }[/math]) bounds are estimated by [30]:
- [math]\displaystyle{ \begin{align} & {{\lambda }_{U}}= & \hat{\lambda }\cdot {{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}} \\ & & \\ & {{\lambda }_{L}}= & \frac{\hat{\lambda }}{{{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}}} \end{align} }[/math]
where [math]\displaystyle{ {{K}_{\alpha }} }[/math] is defined by:
- [math]\displaystyle{ \alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }}) }[/math]
If [math]\displaystyle{ \delta }[/math] is the confidence level, then [math]\displaystyle{ \alpha =\tfrac{1-\delta }{2} }[/math] for the two-sided bounds, and [math]\displaystyle{ \alpha =1-\delta }[/math] for the one-sided bounds.
The variance of [math]\displaystyle{ \hat{\lambda }, }[/math] [math]\displaystyle{ Var(\hat{\lambda }), }[/math] is estimated from the Fisher matrix, as follows:
- [math]\displaystyle{ Var(\hat{\lambda })={{\left( -\frac{{{\partial }^{2}}\Lambda }{\partial {{\lambda }^{2}}} \right)}^{-1}} }[/math]
where [math]\displaystyle{ \Lambda }[/math] is the log-likelihood function of the exponential distribution, described in Appendix for log-likelihood function.
Note that no true MLE solution exists for the case of the two-parameter exponential distribution. The mathematics simply break down while trying to simultaneously solve the partial derivative equations for both the [math]\displaystyle{ \gamma }[/math] and [math]\displaystyle{ \lambda }[/math] parameters, resulting in unrealistic conditions. The way around this conundrum involves setting [math]\displaystyle{ \gamma ={{t}_{1}}, }[/math] or the first time-to-failure, and calculating [math]\displaystyle{ \lambda }[/math] in the regular fashion for this methodology. Weibull++ treats [math]\displaystyle{ \gamma }[/math] as a constant when computing bounds, i.e. [math]\displaystyle{ Var(\hat{\gamma })=0. }[/math] (See the discussion in Appendix for log-likelihood function for more information.)
Template loop detected: Template:Bounds on Reliability FMB ED
Template loop detected: Template:Bounds on Time FMB ED
Template loop detected: Template:Likelihood Ratio Confidence Bounds FMB ED
Template loop detected: Template:Bayesian Confidence Bounds ED
General Examples
Example 8:
20 units were reliability tested with the following results:
Table - Life Test Data | |
Number of Units in Group | Time-to-Failure |
---|---|
7 | 100 |
5 | 200 |
3 | 300 |
2 | 400 |
1 | 500 |
2 | 600 |
1. Assuming a 2-parameter exponential distribution, estimate the parameters by hand using the MLE analysis method.
2. Repeat the above using Weibull++. (Enter the data as grouped data to duplicate the results.)
3. Show the Probability plot for the analysis results.
4. Show the Reliability vs. Time plot for the results.
5. Show the pdf plot for the results.
6. Show the Failure Rate vs. Time plot for the results.
7. Estimate the parameters using the rank regression on Y (RRY) analysis method (and using grouped ranks).
Solution
1. For the 2-parameter exponential distribution and for [math]\displaystyle{ \hat{\gamma }=100\,\! }[/math] hours (first failure), the partial of the log-likelihood function, [math]\displaystyle{ \lambda\,\! }[/math], becomes:
- [math]\displaystyle{ \begin{align} \frac{\partial \Lambda }{\partial \lambda }= &\underset{i=1}{\overset{6}{\mathop \sum }}\,{N_i} \left[ \frac{1}{\lambda }-\left( {{T}_{i}}-100 \right) \right]=0\\ \Rightarrow & 7[\frac{1}{\lambda }-(100-100)]+5[\frac{1}{\lambda}-(200-100)] + \ldots +2[\frac{1}{\lambda}-(600-100)]\\ = & 0\\ \Rightarrow & \hat{\lambda}=\frac{20}{3100}=0.0065 \text{fr/hr} \end{align} \,\! }[/math]
2. Enter the data in a Weibull++ standard folio and calculate it as shown next.
3. On the Plot page of the folio, the exponential Probability plot will appear as shown next.
4. View the Reliability vs. Time plot.
5. View the pdf plot.
6. View the Failure Rate vs. Time plot.
Note that, as described at the beginning of this chapter, the failure rate for the exponential distribution is constant. Also note that the Failure Rate vs. Time plot does show values for times before the location parameter, [math]\displaystyle{ \gamma \,\! }[/math], at 100 hours.
7. In the case of grouped data, one must be cautious when estimating the parameters using a rank regression method. This is because the median rank values are determined from the total number of failures observed by time [math]\displaystyle{ {{T}_{i}}\,\! }[/math] where [math]\displaystyle{ i\,\! }[/math] indicates the group number. In this example, the total number of groups is [math]\displaystyle{ N=6\,\! }[/math] and the total number of units is [math]\displaystyle{ {{N}_{T}}=20\,\! }[/math]. Thus, the median rank values will be estimated for 20 units and for the total failed units ([math]\displaystyle{ {{N}_{{{F}_{i}}}}\,\! }[/math]) up to the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] group, for the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] rank value. The median ranks values can be found from rank tables or they can be estimated using ReliaSoft's Quick Statistical Reference tool.
For example, the median rank value of the fourth group will be the [math]\displaystyle{ {{17}^{th}}\,\! }[/math] rank out of a sample size of twenty units (or 81.945%).
The following table is then constructed.
Given the values in the table above, calculate [math]\displaystyle{ \hat{a}\,\! }[/math] and [math]\displaystyle{ \hat{b}\,\! }[/math]:
- [math]\displaystyle{ \begin{align} & \hat{b}= & \frac{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}}{{y}_{i}}-(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{y}_{i}})/6}{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,t_{i}^{2}-{{(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})}^{2}}/6} \\ & & \\ & \hat{b}= & \frac{-4320.3362-(2100)(-9.6476)/6}{910,000-{{(2100)}^{2}}/6} \end{align}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{b}=-0.005392\,\! }[/math]
and:
- [math]\displaystyle{ \hat{a}=\overline{y}-\hat{b}\overline{t}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{t}_{i}}}{N}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{a}=\frac{-9.6476}{6}-(-0.005392)\frac{2100}{6}=0.2793\,\! }[/math]
Therefore:
- [math]\displaystyle{ \hat{\lambda }=-\hat{b}=-(-0.005392)=0.05392\text{ failures/hour}\,\! }[/math]
and:
- [math]\displaystyle{ \hat{\gamma }=\frac{\hat{a}}{\hat{\lambda }}=\frac{0.2793}{0.005392}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{\gamma }\simeq 51.8\text{ hours}\,\! }[/math]
Then:
- [math]\displaystyle{ f(T)=(0.005392){{e}^{-0.005392(T-51.8)}}\,\! }[/math]
Using Weibull++, the estimated parameters are:
- [math]\displaystyle{ \begin{align} \hat{\lambda }= & 0.0054\text{ failures/hour} \\ \hat{\gamma }= & 51.82\text{ hours} \end{align}\,\! }[/math]
The small difference in the values from Weibull++ is due to rounding. In the application, the calculations and the rank values are carried out up to the [math]\displaystyle{ 15^{th}\,\! }[/math] decimal point.
Example 9:
A number of leukemia patients were treated with either drug 6MP or a placebo, and the times in weeks until cancer symptoms returned were recorded. Analyze each treatment separately [21, p.175].
Table - Leukemia Treatment Results | |||
Time (weeks) | Number of Patients | Treament | Comments |
---|---|---|---|
1 | 2 | placebo | |
2 | 2 | placebo | |
3 | 1 | placebo | |
4 | 2 | placebo | |
5 | 2 | placebo | |
6 | 4 | 6MP | 3 patients completed |
7 | 1 | 6MP | |
8 | 4 | placebo | |
9 | 1 | 6MP | Not completed |
10 | 2 | 6MP | 1 patient completed |
11 | 2 | placebo | |
11 | 1 | 6MP | Not completed |
12 | 2 | placebo | |
13 | 1 | 6MP | |
15 | 1 | placebo | |
16 | 1 | 6MP | |
17 | 1 | placebo | |
17 | 1 | 6MP | Not completed |
19 | 1 | 6MP | Not completed |
20 | 1 | 6MP | Not completed |
22 | 1 | placebo | |
22 | 1 | 6MP | |
23 | 1 | placebo | |
23 | 1 | 6MP | |
25 | 1 | 6MP | Not completed |
32 | 2 | 6MP | Not completed |
34 | 1 | 6MP | Not completed |
35 | 1 | 6MP | Not completed |
Create a new Weibull++ standard folio that's configured for grouped times-to-failure data with suspensions. In the first column, enter the number of patients. Whenever there are uncompleted tests, enter the number of patients who completed the test separately from the number of patients who did not (e.g., if 4 patients had symptoms return after 6 weeks and only 3 of them completed the test, then enter 1 in one row and 3 in another). In the second column enter F if the patients completed the test and S if they didn't. In the third column enter the time, and in the fourth column (Subset ID) specify whether the 6MP drug or a placebo was used.
Next, open the Batch Auto Run utility and select to separate the 6MP drug from the placebo, as shown next.
The software will create two data sheets, one for each subset ID, as shown next.
Calculate both data sheets using the 2-parameter exponential distribution and the MLE analysis method, then insert an additional plot and select to show the analysis results for both data sheets on that plot, which will appear as shown next.
Rank Regression on X for Exponential Distribution
Similar to rank regression on Y, performing a rank regression on X requires that a straight line be fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized.
Again the first task is to bring our exponential [math]\displaystyle{ cdf }[/math] function into a linear form. This step is exactly the same as in regression on Y analysis. The deviation from the previous analysis begins on the least squares fit step, since in this case we treat [math]\displaystyle{ x }[/math] as the dependent variable and [math]\displaystyle{ y }[/math] as the independent variable. The best-fitting straight line to the data, for regression on X (see Chapter Parameter Estimation), is the straight line:
- [math]\displaystyle{ x=\hat{a}+\hat{b}y }[/math]
The corresponding equations for [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are:
- [math]\displaystyle{ \hat{a}=\overline{x}-\hat{b}\overline{y}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}} \right)}^{2}}}{N}} }[/math]
where:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
The values of [math]\displaystyle{ F({{t}_{i}}) }[/math] are estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, solve for the unknown [math]\displaystyle{ y }[/math] value, which corresponds to:
- [math]\displaystyle{ y=-\frac{\hat{a}}{\hat{b}}+\frac{1}{\hat{b}}x }[/math]
Solving for the parameters from above equations we get:
- [math]\displaystyle{ a=-\frac{\hat{a}}{\hat{b}}=\lambda \gamma \Rightarrow \gamma =\hat{a} }[/math]
and:
- [math]\displaystyle{ b=\frac{1}{\hat{b}}=-\lambda \Rightarrow \lambda =-\frac{1}{\hat{b}} }[/math]
For the one-parameter exponential case, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0 \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}} \end{align} }[/math]
The correlation coefficient is evaluated as before.
Example 3:
The exponential distribution is a commonly used distribution in reliability engineering. Mathematically, it is a fairly simple distribution, which many times leads to its use in inappropriate situations. It is, in fact, a special case of the Weibull distribution where [math]\displaystyle{ \beta =1 }[/math]. The exponential distribution is used to model the behavior of units that have a constant failure rate (or units that do not degrade with time or wear out).
Exponential Probability Density Function
The Two-Parameter Exponential Distribution
The two-parameter exponential pdf is given by:
- [math]\displaystyle{ f(t)=\lambda {{e}^{-\lambda (t-\gamma )}},f(t)\ge 0,\lambda \gt 0,t\ge 0\text{ or }\gamma }[/math]
where [math]\displaystyle{ \gamma }[/math] is the location parameter. Some of the characteristics of the two-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{t}-\gamma =m-\gamma }[/math].
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The distribution starts at [math]\displaystyle{ t=\gamma }[/math] at the level of [math]\displaystyle{ f(t=\gamma )=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases beyond [math]\displaystyle{ \gamma }[/math] and is convex.
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The One-Parameter Exponential Distribution
The one-parameter exponential [math]\displaystyle{ pdf }[/math] is obtained by setting [math]\displaystyle{ \gamma =0 }[/math], and is given by:
- [math]\displaystyle{ \begin{align}f(t)= & \lambda {{e}^{-\lambda t}}=\frac{1}{m}{{e}^{-\tfrac{1}{m}t}}, & t\ge 0, \lambda \gt 0,m\gt 0 \end{align} }[/math]
where:
- [math]\displaystyle{ \lambda }[/math] = constant rate, in failures per unit of measurement, e.g failures per hour, per cycle, etc.,
- [math]\displaystyle{ \lambda =\frac{1}{m} }[/math],
- [math]\displaystyle{ m }[/math] = mean time between failures, or to failure,
- [math]\displaystyle{ t }[/math] = operating time, life, or age, in hours, cycles, miles, actuations, etc.
This distribution requires the knowledge of only one parameter, [math]\displaystyle{ \lambda }[/math], for its application. Some of the characteristics of the one-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], is zero.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=m }[/math].
- As [math]\displaystyle{ \lambda }[/math] is decreased in value, the distribution is stretched out to the right, and as [math]\displaystyle{ \lambda }[/math] is increased, the distribution is pushed toward the origin.
- This distribution has no shape parameter as it has only one shape, i.e. the exponential, and the only parameter it has is the failure rate, [math]\displaystyle{ \lambda }[/math].
- The distribution starts at [math]\displaystyle{ t=0 }[/math] at the level of [math]\displaystyle{ f(t=0)=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases, and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ f(t)\to 0 }[/math].
- The [math]\displaystyle{ pdf }[/math] can be thought of as a special case of the Weibull [math]\displaystyle{ pdf }[/math] with [math]\displaystyle{ \gamma =0 }[/math] and [math]\displaystyle{ \beta =1 }[/math].
Exponential Statistical Properties
The Mean or MTTF
The mean, [math]\displaystyle{ \overline{T}, }[/math] or mean time to failure (MTTF) is given by:
- [math]\displaystyle{ \begin{align} \bar{T}= & \int_{\gamma }^{\infty }t\cdot f(t)dt \\ = & \int_{\gamma }^{\infty }t\cdot \lambda \cdot {{e}^{-\lambda t}}dt \\ = & \gamma +\frac{1}{\lambda }=m \end{align} }[/math]
Note that when [math]\displaystyle{ \gamma =0 }[/math], the MTTF is the inverse of the exponential distribution's constant failure rate. This is only true for the exponential distribution. Most other distributions do not have a constant failure rate. Consequently, the inverse relationship between failure rate and MTTF does not hold for these other distributions.
The Median
The median, [math]\displaystyle{ \breve{T}, }[/math] is:
- [math]\displaystyle{ \breve{T}=\gamma +\frac{1}{\lambda}\cdot 0.693 }[/math]
The Mode
The mode, [math]\displaystyle{ \tilde{T}, }[/math] is:
- [math]\displaystyle{ \tilde{T}=\gamma }[/math]
The Standard Deviation
The standard deviation, [math]\displaystyle{ {\sigma }_{T} }[/math], is:
- [math]\displaystyle{ {\sigma}_{T}=\frac{1}{\lambda }=m }[/math]
The Exponential Reliability Function
The equation for the two-parameter exponential cumulative density function, or [math]\displaystyle{ cdf, }[/math] is given by:
- [math]\displaystyle{ F(t)=Q(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Recalling that the reliability function of a distribution is simply one minus the [math]\displaystyle{ cdf }[/math], the reliability function of the two-parameter exponential distribution is given by:
- [math]\displaystyle{ R(t)=1-Q(t)=1-\int_{0}^{t-\gamma }f(x)dx }[/math]
- [math]\displaystyle{ R(t)=1-\int_{0}^{t-\gamma }\lambda {{e}^{-\lambda x}}dx={{e}^{-\lambda (t-\gamma )}} }[/math]
One-Parameter Exponential Reliability Function
The one-parameter exponential reliability function is given by:
- [math]\displaystyle{ R(t)={{e}^{-\lambda t}}={{e}^{-\tfrac{t}{m}}} }[/math]
The Exponential Conditional Reliability
The exponential conditional reliability equation gives the reliability for a mission of [math]\displaystyle{ t }[/math] duration, having already successfully accumulated [math]\displaystyle{ T }[/math] hours of operation up to the start of this new mission. The exponential conditional reliability function is:
- [math]\displaystyle{ R(t|T)=\frac{R(T+t)}{R(T)}=\frac{{{e}^{-\lambda (T+t-\gamma )}}}{{{e}^{-\lambda (T-\gamma )}}}={{e}^{-\lambda t}} }[/math]
which says that the reliability for a mission of [math]\displaystyle{ t }[/math] duration undertaken after the component or equipment has already accumulated [math]\displaystyle{ T }[/math] hours of operation from age zero is only a function of the mission duration, and not a function of the age at the beginning of the mission. This is referred to as the memoryless property.
The Exponential Reliable Life
The reliable life, or the mission duration for a desired reliability goal, [math]\displaystyle{ {{t}_{R}} }[/math], for the one-parameter exponential distribution is:
- [math]\displaystyle{ R({{t}_{R}})={{e}^{-\lambda ({{t}_{R}}-\gamma )}} }[/math]
- [math]\displaystyle{ \ln[R({{t}_{R}})]=-\lambda({{t}_{R}}-\gamma ) }[/math]
or:
- [math]\displaystyle{ {{t}_{R}}=\gamma -\frac{\ln [R({{t}_{R}})]}{\lambda } }[/math]
The Exponential Failure Rate Function
The exponential failure rate function is:
- [math]\displaystyle{ \lambda (t)=\frac{f(t)}{R(t)}=\frac{\lambda {{e}^{-\lambda (t-\gamma )}}}{{{e}^{-\lambda (t-\gamma )}}}=\lambda =\text{constant} }[/math]
Once again, note that the constant failure rate is a characteristic of the exponential distribution, and special cases of other distributions only. Most other distributions have failure rates that are functions of time.
Characteristics of the Exponential Distribution
As mentioned before, the primary trait of the exponential distribution is that it is used for modeling the behavior of items with a constant failure rate. It has a fairly simple mathematical form, which makes it fairly easy to manipulate. Unfortunately, this fact also leads to the use of this model in situations where it is not appropriate. For example, it would not be appropriate to use the exponential distribution to model the reliability of an automobile. The constant failure rate of the exponential distribution would require the assumption that the automobile would be just as likely to experience a breakdown during the first mile as it would during the one-hundred-thousandth mile. Clearly, this is not a valid assumption. However, some inexperienced practitioners of reliability engineering and life data analysis will overlook this fact, lured by the siren-call of the exponential distribution's relatively simple mathematical models.
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential [math]\displaystyle{ pdf }[/math]
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The exponential [math]\displaystyle{ pdf }[/math] is always convex and is stretched to the right as [math]\displaystyle{ \lambda }[/math] decreases in value.
- The value of the [math]\displaystyle{ pdf }[/math] function is always equal to the value of [math]\displaystyle{ \lambda }[/math] at [math]\displaystyle{ t=0 }[/math] (or [math]\displaystyle{ t=\gamma }[/math]).
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before this time.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{T}-\gamma =m-\gamma }[/math].
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential Reliability Function
- The one-parameter exponential reliability function starts at the value of 100% at [math]\displaystyle{ t=0 }[/math], decreases thereafter monotonically and is convex.
- The two-parameter exponential reliability function remains at the value of 100% for [math]\displaystyle{ t=0 }[/math] up to [math]\displaystyle{ t=\gamma }[/math], and decreases thereafter monotonically and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ R(t\to \infty )\to 0 }[/math].
- The reliability for a mission duration of [math]\displaystyle{ t=m=\tfrac{1}{\lambda } }[/math], or of one MTTF duration, is always equal to [math]\displaystyle{ 0.3679 }[/math] or 36.79%. This means that the reliability for a mission which is as long as one MTTF is relatively low and is not recommended because only 36.8% of the missions will be completed successfully. In other words, of the equipment undertaking such a mission, only 36.8% will survive their mission.
Estimation of the Exponential Parameters
Probability Plotting
Estimation of the parameters for the exponential distribution via probability plotting is very similar to the process used when dealing with the Weibull distribution. Recall, however, that the appearance of the probability plotting paper and the methods by which the parameters are estimated vary from distribution to distribution, so there will be some noticeable differences. In fact, due to the nature of the exponential [math]\displaystyle{ cdf }[/math], the exponential probability plot is the only one with a negative slope. This is because the y-axis of the exponential probability plotting paper represents the reliability, whereas the y-axis for most of the other life distributions represents the unreliability.
This is illustrated in the process of linearizing the [math]\displaystyle{ cdf }[/math], which is necessary to construct the exponential probability plotting paper. For the two-parameter exponential distribution the cumulative density function is given by:
- [math]\displaystyle{ F(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Taking the natural logarithm of both sides of the above equation yields:
- [math]\displaystyle{ \ln \left[ 1-F(t) \right]=-\lambda (t-\gamma ) }[/math]
or:
- [math]\displaystyle{ \ln [1-F(t)]=\lambda \gamma -\lambda t }[/math]
Now, let:
- [math]\displaystyle{ y=\ln [1-F(t)] }[/math]
- [math]\displaystyle{ a=\lambda \gamma }[/math]
and:
- [math]\displaystyle{ b=-\lambda }[/math]
which results in the linear equation of:
- [math]\displaystyle{ y=a+bt }[/math]
Note that with the exponential probability plotting paper, the y-axis scale is logarithmic and the x-axis scale is linear. This means that the zero value is present only on the x-axis. For [math]\displaystyle{ t=0 }[/math], [math]\displaystyle{ R=1 }[/math] and [math]\displaystyle{ F(t)=0 }[/math]. So if we were to use [math]\displaystyle{ F(t) }[/math] for the y-axis, we would have to plot the point [math]\displaystyle{ (0,0) }[/math]. However, since the y-axis is logarithmic, there is no place to plot this on the exponential paper. Also, the failure rate, [math]\displaystyle{ \lambda }[/math], is the negative of the slope of the line, but there is an easier way to determine the value of [math]\displaystyle{ \lambda }[/math] from the probability plot, as will be illustrated in the following example.
Example 1:
1-Parameter Exponential Probability Plot Example
6 units are put on a life test and tested to failure. The failure times are 7, 12, 19, 29, 41, and 67 hours. Estimate the failure rate for a 1-parameter exponential distribution using the probability plotting method.
In order to plot the points for the probability plot, the appropriate reliability estimate values must be obtained. These will be equivalent to [math]\displaystyle{ 100%-MR\,\! }[/math] since the y-axis represents the reliability and the [math]\displaystyle{ MR\,\! }[/math] values represent unreliability estimates.
Next, these points are plotted on an exponential probability plotting paper. A sample of this type of plotting paper is shown next, with the sample points in place. Notice how these points describe a line with a negative slope.
Once the points are plotted, draw the best possible straight line through these points. The time value at which this line intersects with a horizontal line drawn at the 36.8% reliability mark is the mean life, and the reciprocal of this is the failure rate [math]\displaystyle{ \lambda\,\! }[/math]. This is because at [math]\displaystyle{ t=m=\tfrac{1}{\lambda }\,\! }[/math]:
- [math]\displaystyle{ \begin{align} R(t)= & {{e}^{-\lambda \cdot t}} \\ R(t)= & {{e}^{-\lambda \cdot \tfrac{1}{\lambda }}} \\ R(t)= & {{e}^{-1}}=0.368=36.8%. \end{align}\,\! }[/math]
The following plot shows that the best-fit line through the data points crosses the [math]\displaystyle{ R=36.8%\,\! }[/math] line at [math]\displaystyle{ t=33\,\! }[/math] hours. And because [math]\displaystyle{ \tfrac{1}{\lambda }=33\,\! }[/math] hours, [math]\displaystyle{ \lambda =0.0303\,\! }[/math] failures/hour.
Rank Regression on Y for Exponential Distribution
Performing a rank regression on Y requires that a straight line be fitted to the set of available data points such that the sum of the squares of the vertical deviations from the points to the line is minimized. The least squares parameter estimation method (regression analysis) was discussed in Chapter Parameter Estimation, and the following equations for rank regression on Y (RRY) were derived:
- [math]\displaystyle{ \hat{a}=\bar{y}-\hat{b}\bar{x}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}} \right)}^{2}}}{N}} }[/math]
In our case, the equations for [math]\displaystyle{ {{y}_{i}} }[/math] and [math]\displaystyle{ {{x}_{i}} }[/math] are:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
and the [math]\displaystyle{ F({{t}_{i}}) }[/math] is estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, then [math]\displaystyle{ \hat{\lambda } }[/math] and [math]\displaystyle{ \hat{\gamma } }[/math] can easily be obtained from above equations.
For the one-parameter exponential, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0, \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}} \end{align} }[/math]
The Correlation Coefficient
The estimator of [math]\displaystyle{ \rho }[/math] is the sample correlation coefficient, [math]\displaystyle{ \hat{\rho } }[/math], given by:
- [math]\displaystyle{ \hat{\rho }=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,({{x}_{i}}-\overline{x})({{y}_{i}}-\overline{y})}{\sqrt{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{x}_{i}}-\overline{x})}^{2}}\cdot \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{y}_{i}}-\overline{y})}^{2}}}} }[/math]
Example 2:
Template loop detected: Template:2 parameter exponential distribution RRY example
Rank Regression on X for Exponential Distribution
Similar to rank regression on Y, performing a rank regression on X requires that a straight line be fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized.
Again the first task is to bring our exponential [math]\displaystyle{ cdf }[/math] function into a linear form. This step is exactly the same as in regression on Y analysis. The deviation from the previous analysis begins on the least squares fit step, since in this case we treat [math]\displaystyle{ x }[/math] as the dependent variable and [math]\displaystyle{ y }[/math] as the independent variable. The best-fitting straight line to the data, for regression on X (see Chapter Parameter Estimation), is the straight line:
- [math]\displaystyle{ x=\hat{a}+\hat{b}y }[/math]
The corresponding equations for [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are:
- [math]\displaystyle{ \hat{a}=\overline{x}-\hat{b}\overline{y}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}} \right)}^{2}}}{N}} }[/math]
where:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
The values of [math]\displaystyle{ F({{t}_{i}}) }[/math] are estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, solve for the unknown [math]\displaystyle{ y }[/math] value, which corresponds to:
- [math]\displaystyle{ y=-\frac{\hat{a}}{\hat{b}}+\frac{1}{\hat{b}}x }[/math]
Solving for the parameters from above equations we get:
- [math]\displaystyle{ a=-\frac{\hat{a}}{\hat{b}}=\lambda \gamma \Rightarrow \gamma =\hat{a} }[/math]
and:
- [math]\displaystyle{ b=\frac{1}{\hat{b}}=-\lambda \Rightarrow \lambda =-\frac{1}{\hat{b}} }[/math]
For the one-parameter exponential case, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0 \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}} \end{align} }[/math]
The correlation coefficient is evaluated as before.
Example 3:
Template loop detected: Template:Example: 2 Parameter Exponential Distribution RRX
Maximum Likelihood Estimation for Exponential Distribution
As outlined in Chapter Parameter Estimation, maximum likelihood estimation works by developing a likelihood function based on the available data and finding the values of the parameter estimates that maximize the likelihood function. This can be achieved by using iterative methods to determine the parameter estimate values that maximize the likelihood function. This can be rather difficult and time-consuming, particularly when dealing with the three-parameter distribution. Another method of finding the parameter estimates involves taking the partial derivatives of the likelihood equation with respect to the parameters, setting the resulting equations equal to zero, and solving simultaneously to determine the values of the parameter estimates. The log-likelihood functions and associated partial derivatives used to determine maximum likelihood estimates for the exponential distribution are covered in Appendix.
Example 4: Template loop detected: Template:Example: MLE for Exponential Distribution
Confidence Bounds
In this section, we present the methods used in the application to estimate the different types of confidence bounds for exponentially distributed data. The complete derivations were presented in detail (for a general function) in the chapter for Confidence Bounds. At this time we should point out that exact confidence bounds for the exponential distribution have been derived, and exist in a closed form, utilizing the [math]\displaystyle{ {{\chi }^{2}} }[/math] distribution. These are described in detail in Kececioglu [20], and are covered in the section in the test design chapter. For most exponential data analyses, Weibull++ will use the approximate confidence bounds, provided from the Fisher information matrix or the likelihood ratio, in order to stay consistent with all of the other available distributions in the application. The [math]\displaystyle{ {{\chi }^{2}} }[/math] confidence bounds for the exponential distribution are discussed in more detail in the test design chapter.
Fisher Matrix Bounds
Bounds on the Parameters
For the failure rate [math]\displaystyle{ \hat{\lambda } }[/math] the upper ([math]\displaystyle{ {{\lambda }_{U}} }[/math]) and lower ([math]\displaystyle{ {{\lambda }_{L}} }[/math]) bounds are estimated by [30]:
- [math]\displaystyle{ \begin{align} & {{\lambda }_{U}}= & \hat{\lambda }\cdot {{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}} \\ & & \\ & {{\lambda }_{L}}= & \frac{\hat{\lambda }}{{{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}}} \end{align} }[/math]
where [math]\displaystyle{ {{K}_{\alpha }} }[/math] is defined by:
- [math]\displaystyle{ \alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }}) }[/math]
If [math]\displaystyle{ \delta }[/math] is the confidence level, then [math]\displaystyle{ \alpha =\tfrac{1-\delta }{2} }[/math] for the two-sided bounds, and [math]\displaystyle{ \alpha =1-\delta }[/math] for the one-sided bounds.
The variance of [math]\displaystyle{ \hat{\lambda }, }[/math] [math]\displaystyle{ Var(\hat{\lambda }), }[/math] is estimated from the Fisher matrix, as follows:
- [math]\displaystyle{ Var(\hat{\lambda })={{\left( -\frac{{{\partial }^{2}}\Lambda }{\partial {{\lambda }^{2}}} \right)}^{-1}} }[/math]
where [math]\displaystyle{ \Lambda }[/math] is the log-likelihood function of the exponential distribution, described in Appendix for log-likelihood function.
Note that no true MLE solution exists for the case of the two-parameter exponential distribution. The mathematics simply break down while trying to simultaneously solve the partial derivative equations for both the [math]\displaystyle{ \gamma }[/math] and [math]\displaystyle{ \lambda }[/math] parameters, resulting in unrealistic conditions. The way around this conundrum involves setting [math]\displaystyle{ \gamma ={{t}_{1}}, }[/math] or the first time-to-failure, and calculating [math]\displaystyle{ \lambda }[/math] in the regular fashion for this methodology. Weibull++ treats [math]\displaystyle{ \gamma }[/math] as a constant when computing bounds, i.e. [math]\displaystyle{ Var(\hat{\gamma })=0. }[/math] (See the discussion in Appendix for log-likelihood function for more information.)
Template loop detected: Template:Bounds on Reliability FMB ED
Template loop detected: Template:Bounds on Time FMB ED
Template loop detected: Template:Likelihood Ratio Confidence Bounds FMB ED
Template loop detected: Template:Bayesian Confidence Bounds ED
General Examples
Example 8:
20 units were reliability tested with the following results:
Table - Life Test Data | |
Number of Units in Group | Time-to-Failure |
---|---|
7 | 100 |
5 | 200 |
3 | 300 |
2 | 400 |
1 | 500 |
2 | 600 |
1. Assuming a 2-parameter exponential distribution, estimate the parameters by hand using the MLE analysis method.
2. Repeat the above using Weibull++. (Enter the data as grouped data to duplicate the results.)
3. Show the Probability plot for the analysis results.
4. Show the Reliability vs. Time plot for the results.
5. Show the pdf plot for the results.
6. Show the Failure Rate vs. Time plot for the results.
7. Estimate the parameters using the rank regression on Y (RRY) analysis method (and using grouped ranks).
Solution
1. For the 2-parameter exponential distribution and for [math]\displaystyle{ \hat{\gamma }=100\,\! }[/math] hours (first failure), the partial of the log-likelihood function, [math]\displaystyle{ \lambda\,\! }[/math], becomes:
- [math]\displaystyle{ \begin{align} \frac{\partial \Lambda }{\partial \lambda }= &\underset{i=1}{\overset{6}{\mathop \sum }}\,{N_i} \left[ \frac{1}{\lambda }-\left( {{T}_{i}}-100 \right) \right]=0\\ \Rightarrow & 7[\frac{1}{\lambda }-(100-100)]+5[\frac{1}{\lambda}-(200-100)] + \ldots +2[\frac{1}{\lambda}-(600-100)]\\ = & 0\\ \Rightarrow & \hat{\lambda}=\frac{20}{3100}=0.0065 \text{fr/hr} \end{align} \,\! }[/math]
2. Enter the data in a Weibull++ standard folio and calculate it as shown next.
3. On the Plot page of the folio, the exponential Probability plot will appear as shown next.
4. View the Reliability vs. Time plot.
5. View the pdf plot.
6. View the Failure Rate vs. Time plot.
Note that, as described at the beginning of this chapter, the failure rate for the exponential distribution is constant. Also note that the Failure Rate vs. Time plot does show values for times before the location parameter, [math]\displaystyle{ \gamma \,\! }[/math], at 100 hours.
7. In the case of grouped data, one must be cautious when estimating the parameters using a rank regression method. This is because the median rank values are determined from the total number of failures observed by time [math]\displaystyle{ {{T}_{i}}\,\! }[/math] where [math]\displaystyle{ i\,\! }[/math] indicates the group number. In this example, the total number of groups is [math]\displaystyle{ N=6\,\! }[/math] and the total number of units is [math]\displaystyle{ {{N}_{T}}=20\,\! }[/math]. Thus, the median rank values will be estimated for 20 units and for the total failed units ([math]\displaystyle{ {{N}_{{{F}_{i}}}}\,\! }[/math]) up to the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] group, for the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] rank value. The median ranks values can be found from rank tables or they can be estimated using ReliaSoft's Quick Statistical Reference tool.
For example, the median rank value of the fourth group will be the [math]\displaystyle{ {{17}^{th}}\,\! }[/math] rank out of a sample size of twenty units (or 81.945%).
The following table is then constructed.
Given the values in the table above, calculate [math]\displaystyle{ \hat{a}\,\! }[/math] and [math]\displaystyle{ \hat{b}\,\! }[/math]:
- [math]\displaystyle{ \begin{align} & \hat{b}= & \frac{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}}{{y}_{i}}-(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{y}_{i}})/6}{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,t_{i}^{2}-{{(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})}^{2}}/6} \\ & & \\ & \hat{b}= & \frac{-4320.3362-(2100)(-9.6476)/6}{910,000-{{(2100)}^{2}}/6} \end{align}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{b}=-0.005392\,\! }[/math]
and:
- [math]\displaystyle{ \hat{a}=\overline{y}-\hat{b}\overline{t}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{t}_{i}}}{N}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{a}=\frac{-9.6476}{6}-(-0.005392)\frac{2100}{6}=0.2793\,\! }[/math]
Therefore:
- [math]\displaystyle{ \hat{\lambda }=-\hat{b}=-(-0.005392)=0.05392\text{ failures/hour}\,\! }[/math]
and:
- [math]\displaystyle{ \hat{\gamma }=\frac{\hat{a}}{\hat{\lambda }}=\frac{0.2793}{0.005392}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{\gamma }\simeq 51.8\text{ hours}\,\! }[/math]
Then:
- [math]\displaystyle{ f(T)=(0.005392){{e}^{-0.005392(T-51.8)}}\,\! }[/math]
Using Weibull++, the estimated parameters are:
- [math]\displaystyle{ \begin{align} \hat{\lambda }= & 0.0054\text{ failures/hour} \\ \hat{\gamma }= & 51.82\text{ hours} \end{align}\,\! }[/math]
The small difference in the values from Weibull++ is due to rounding. In the application, the calculations and the rank values are carried out up to the [math]\displaystyle{ 15^{th}\,\! }[/math] decimal point.
Example 9:
A number of leukemia patients were treated with either drug 6MP or a placebo, and the times in weeks until cancer symptoms returned were recorded. Analyze each treatment separately [21, p.175].
Table - Leukemia Treatment Results | |||
Time (weeks) | Number of Patients | Treament | Comments |
---|---|---|---|
1 | 2 | placebo | |
2 | 2 | placebo | |
3 | 1 | placebo | |
4 | 2 | placebo | |
5 | 2 | placebo | |
6 | 4 | 6MP | 3 patients completed |
7 | 1 | 6MP | |
8 | 4 | placebo | |
9 | 1 | 6MP | Not completed |
10 | 2 | 6MP | 1 patient completed |
11 | 2 | placebo | |
11 | 1 | 6MP | Not completed |
12 | 2 | placebo | |
13 | 1 | 6MP | |
15 | 1 | placebo | |
16 | 1 | 6MP | |
17 | 1 | placebo | |
17 | 1 | 6MP | Not completed |
19 | 1 | 6MP | Not completed |
20 | 1 | 6MP | Not completed |
22 | 1 | placebo | |
22 | 1 | 6MP | |
23 | 1 | placebo | |
23 | 1 | 6MP | |
25 | 1 | 6MP | Not completed |
32 | 2 | 6MP | Not completed |
34 | 1 | 6MP | Not completed |
35 | 1 | 6MP | Not completed |
Create a new Weibull++ standard folio that's configured for grouped times-to-failure data with suspensions. In the first column, enter the number of patients. Whenever there are uncompleted tests, enter the number of patients who completed the test separately from the number of patients who did not (e.g., if 4 patients had symptoms return after 6 weeks and only 3 of them completed the test, then enter 1 in one row and 3 in another). In the second column enter F if the patients completed the test and S if they didn't. In the third column enter the time, and in the fourth column (Subset ID) specify whether the 6MP drug or a placebo was used.
Next, open the Batch Auto Run utility and select to separate the 6MP drug from the placebo, as shown next.
The software will create two data sheets, one for each subset ID, as shown next.
Calculate both data sheets using the 2-parameter exponential distribution and the MLE analysis method, then insert an additional plot and select to show the analysis results for both data sheets on that plot, which will appear as shown next.
Maximum Likelihood Estimation for Exponential Distribution
As outlined in Chapter Parameter Estimation, maximum likelihood estimation works by developing a likelihood function based on the available data and finding the values of the parameter estimates that maximize the likelihood function. This can be achieved by using iterative methods to determine the parameter estimate values that maximize the likelihood function. This can be rather difficult and time-consuming, particularly when dealing with the three-parameter distribution. Another method of finding the parameter estimates involves taking the partial derivatives of the likelihood equation with respect to the parameters, setting the resulting equations equal to zero, and solving simultaneously to determine the values of the parameter estimates. The log-likelihood functions and associated partial derivatives used to determine maximum likelihood estimates for the exponential distribution are covered in Appendix.
Example 4:
The exponential distribution is a commonly used distribution in reliability engineering. Mathematically, it is a fairly simple distribution, which many times leads to its use in inappropriate situations. It is, in fact, a special case of the Weibull distribution where [math]\displaystyle{ \beta =1 }[/math]. The exponential distribution is used to model the behavior of units that have a constant failure rate (or units that do not degrade with time or wear out).
Exponential Probability Density Function
The Two-Parameter Exponential Distribution
The two-parameter exponential pdf is given by:
- [math]\displaystyle{ f(t)=\lambda {{e}^{-\lambda (t-\gamma )}},f(t)\ge 0,\lambda \gt 0,t\ge 0\text{ or }\gamma }[/math]
where [math]\displaystyle{ \gamma }[/math] is the location parameter. Some of the characteristics of the two-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{t}-\gamma =m-\gamma }[/math].
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The distribution starts at [math]\displaystyle{ t=\gamma }[/math] at the level of [math]\displaystyle{ f(t=\gamma )=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases beyond [math]\displaystyle{ \gamma }[/math] and is convex.
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The One-Parameter Exponential Distribution
The one-parameter exponential [math]\displaystyle{ pdf }[/math] is obtained by setting [math]\displaystyle{ \gamma =0 }[/math], and is given by:
- [math]\displaystyle{ \begin{align}f(t)= & \lambda {{e}^{-\lambda t}}=\frac{1}{m}{{e}^{-\tfrac{1}{m}t}}, & t\ge 0, \lambda \gt 0,m\gt 0 \end{align} }[/math]
where:
- [math]\displaystyle{ \lambda }[/math] = constant rate, in failures per unit of measurement, e.g failures per hour, per cycle, etc.,
- [math]\displaystyle{ \lambda =\frac{1}{m} }[/math],
- [math]\displaystyle{ m }[/math] = mean time between failures, or to failure,
- [math]\displaystyle{ t }[/math] = operating time, life, or age, in hours, cycles, miles, actuations, etc.
This distribution requires the knowledge of only one parameter, [math]\displaystyle{ \lambda }[/math], for its application. Some of the characteristics of the one-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], is zero.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=m }[/math].
- As [math]\displaystyle{ \lambda }[/math] is decreased in value, the distribution is stretched out to the right, and as [math]\displaystyle{ \lambda }[/math] is increased, the distribution is pushed toward the origin.
- This distribution has no shape parameter as it has only one shape, i.e. the exponential, and the only parameter it has is the failure rate, [math]\displaystyle{ \lambda }[/math].
- The distribution starts at [math]\displaystyle{ t=0 }[/math] at the level of [math]\displaystyle{ f(t=0)=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases, and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ f(t)\to 0 }[/math].
- The [math]\displaystyle{ pdf }[/math] can be thought of as a special case of the Weibull [math]\displaystyle{ pdf }[/math] with [math]\displaystyle{ \gamma =0 }[/math] and [math]\displaystyle{ \beta =1 }[/math].
Exponential Statistical Properties
The Mean or MTTF
The mean, [math]\displaystyle{ \overline{T}, }[/math] or mean time to failure (MTTF) is given by:
- [math]\displaystyle{ \begin{align} \bar{T}= & \int_{\gamma }^{\infty }t\cdot f(t)dt \\ = & \int_{\gamma }^{\infty }t\cdot \lambda \cdot {{e}^{-\lambda t}}dt \\ = & \gamma +\frac{1}{\lambda }=m \end{align} }[/math]
Note that when [math]\displaystyle{ \gamma =0 }[/math], the MTTF is the inverse of the exponential distribution's constant failure rate. This is only true for the exponential distribution. Most other distributions do not have a constant failure rate. Consequently, the inverse relationship between failure rate and MTTF does not hold for these other distributions.
The Median
The median, [math]\displaystyle{ \breve{T}, }[/math] is:
- [math]\displaystyle{ \breve{T}=\gamma +\frac{1}{\lambda}\cdot 0.693 }[/math]
The Mode
The mode, [math]\displaystyle{ \tilde{T}, }[/math] is:
- [math]\displaystyle{ \tilde{T}=\gamma }[/math]
The Standard Deviation
The standard deviation, [math]\displaystyle{ {\sigma }_{T} }[/math], is:
- [math]\displaystyle{ {\sigma}_{T}=\frac{1}{\lambda }=m }[/math]
The Exponential Reliability Function
The equation for the two-parameter exponential cumulative density function, or [math]\displaystyle{ cdf, }[/math] is given by:
- [math]\displaystyle{ F(t)=Q(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Recalling that the reliability function of a distribution is simply one minus the [math]\displaystyle{ cdf }[/math], the reliability function of the two-parameter exponential distribution is given by:
- [math]\displaystyle{ R(t)=1-Q(t)=1-\int_{0}^{t-\gamma }f(x)dx }[/math]
- [math]\displaystyle{ R(t)=1-\int_{0}^{t-\gamma }\lambda {{e}^{-\lambda x}}dx={{e}^{-\lambda (t-\gamma )}} }[/math]
One-Parameter Exponential Reliability Function
The one-parameter exponential reliability function is given by:
- [math]\displaystyle{ R(t)={{e}^{-\lambda t}}={{e}^{-\tfrac{t}{m}}} }[/math]
The Exponential Conditional Reliability
The exponential conditional reliability equation gives the reliability for a mission of [math]\displaystyle{ t }[/math] duration, having already successfully accumulated [math]\displaystyle{ T }[/math] hours of operation up to the start of this new mission. The exponential conditional reliability function is:
- [math]\displaystyle{ R(t|T)=\frac{R(T+t)}{R(T)}=\frac{{{e}^{-\lambda (T+t-\gamma )}}}{{{e}^{-\lambda (T-\gamma )}}}={{e}^{-\lambda t}} }[/math]
which says that the reliability for a mission of [math]\displaystyle{ t }[/math] duration undertaken after the component or equipment has already accumulated [math]\displaystyle{ T }[/math] hours of operation from age zero is only a function of the mission duration, and not a function of the age at the beginning of the mission. This is referred to as the memoryless property.
The Exponential Reliable Life
The reliable life, or the mission duration for a desired reliability goal, [math]\displaystyle{ {{t}_{R}} }[/math], for the one-parameter exponential distribution is:
- [math]\displaystyle{ R({{t}_{R}})={{e}^{-\lambda ({{t}_{R}}-\gamma )}} }[/math]
- [math]\displaystyle{ \ln[R({{t}_{R}})]=-\lambda({{t}_{R}}-\gamma ) }[/math]
or:
- [math]\displaystyle{ {{t}_{R}}=\gamma -\frac{\ln [R({{t}_{R}})]}{\lambda } }[/math]
The Exponential Failure Rate Function
The exponential failure rate function is:
- [math]\displaystyle{ \lambda (t)=\frac{f(t)}{R(t)}=\frac{\lambda {{e}^{-\lambda (t-\gamma )}}}{{{e}^{-\lambda (t-\gamma )}}}=\lambda =\text{constant} }[/math]
Once again, note that the constant failure rate is a characteristic of the exponential distribution, and special cases of other distributions only. Most other distributions have failure rates that are functions of time.
Characteristics of the Exponential Distribution
As mentioned before, the primary trait of the exponential distribution is that it is used for modeling the behavior of items with a constant failure rate. It has a fairly simple mathematical form, which makes it fairly easy to manipulate. Unfortunately, this fact also leads to the use of this model in situations where it is not appropriate. For example, it would not be appropriate to use the exponential distribution to model the reliability of an automobile. The constant failure rate of the exponential distribution would require the assumption that the automobile would be just as likely to experience a breakdown during the first mile as it would during the one-hundred-thousandth mile. Clearly, this is not a valid assumption. However, some inexperienced practitioners of reliability engineering and life data analysis will overlook this fact, lured by the siren-call of the exponential distribution's relatively simple mathematical models.
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential [math]\displaystyle{ pdf }[/math]
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The exponential [math]\displaystyle{ pdf }[/math] is always convex and is stretched to the right as [math]\displaystyle{ \lambda }[/math] decreases in value.
- The value of the [math]\displaystyle{ pdf }[/math] function is always equal to the value of [math]\displaystyle{ \lambda }[/math] at [math]\displaystyle{ t=0 }[/math] (or [math]\displaystyle{ t=\gamma }[/math]).
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before this time.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{T}-\gamma =m-\gamma }[/math].
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential Reliability Function
- The one-parameter exponential reliability function starts at the value of 100% at [math]\displaystyle{ t=0 }[/math], decreases thereafter monotonically and is convex.
- The two-parameter exponential reliability function remains at the value of 100% for [math]\displaystyle{ t=0 }[/math] up to [math]\displaystyle{ t=\gamma }[/math], and decreases thereafter monotonically and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ R(t\to \infty )\to 0 }[/math].
- The reliability for a mission duration of [math]\displaystyle{ t=m=\tfrac{1}{\lambda } }[/math], or of one MTTF duration, is always equal to [math]\displaystyle{ 0.3679 }[/math] or 36.79%. This means that the reliability for a mission which is as long as one MTTF is relatively low and is not recommended because only 36.8% of the missions will be completed successfully. In other words, of the equipment undertaking such a mission, only 36.8% will survive their mission.
Estimation of the Exponential Parameters
Probability Plotting
Estimation of the parameters for the exponential distribution via probability plotting is very similar to the process used when dealing with the Weibull distribution. Recall, however, that the appearance of the probability plotting paper and the methods by which the parameters are estimated vary from distribution to distribution, so there will be some noticeable differences. In fact, due to the nature of the exponential [math]\displaystyle{ cdf }[/math], the exponential probability plot is the only one with a negative slope. This is because the y-axis of the exponential probability plotting paper represents the reliability, whereas the y-axis for most of the other life distributions represents the unreliability.
This is illustrated in the process of linearizing the [math]\displaystyle{ cdf }[/math], which is necessary to construct the exponential probability plotting paper. For the two-parameter exponential distribution the cumulative density function is given by:
- [math]\displaystyle{ F(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Taking the natural logarithm of both sides of the above equation yields:
- [math]\displaystyle{ \ln \left[ 1-F(t) \right]=-\lambda (t-\gamma ) }[/math]
or:
- [math]\displaystyle{ \ln [1-F(t)]=\lambda \gamma -\lambda t }[/math]
Now, let:
- [math]\displaystyle{ y=\ln [1-F(t)] }[/math]
- [math]\displaystyle{ a=\lambda \gamma }[/math]
and:
- [math]\displaystyle{ b=-\lambda }[/math]
which results in the linear equation of:
- [math]\displaystyle{ y=a+bt }[/math]
Note that with the exponential probability plotting paper, the y-axis scale is logarithmic and the x-axis scale is linear. This means that the zero value is present only on the x-axis. For [math]\displaystyle{ t=0 }[/math], [math]\displaystyle{ R=1 }[/math] and [math]\displaystyle{ F(t)=0 }[/math]. So if we were to use [math]\displaystyle{ F(t) }[/math] for the y-axis, we would have to plot the point [math]\displaystyle{ (0,0) }[/math]. However, since the y-axis is logarithmic, there is no place to plot this on the exponential paper. Also, the failure rate, [math]\displaystyle{ \lambda }[/math], is the negative of the slope of the line, but there is an easier way to determine the value of [math]\displaystyle{ \lambda }[/math] from the probability plot, as will be illustrated in the following example.
Example 1:
1-Parameter Exponential Probability Plot Example
6 units are put on a life test and tested to failure. The failure times are 7, 12, 19, 29, 41, and 67 hours. Estimate the failure rate for a 1-parameter exponential distribution using the probability plotting method.
In order to plot the points for the probability plot, the appropriate reliability estimate values must be obtained. These will be equivalent to [math]\displaystyle{ 100%-MR\,\! }[/math] since the y-axis represents the reliability and the [math]\displaystyle{ MR\,\! }[/math] values represent unreliability estimates.
Next, these points are plotted on an exponential probability plotting paper. A sample of this type of plotting paper is shown next, with the sample points in place. Notice how these points describe a line with a negative slope.
Once the points are plotted, draw the best possible straight line through these points. The time value at which this line intersects with a horizontal line drawn at the 36.8% reliability mark is the mean life, and the reciprocal of this is the failure rate [math]\displaystyle{ \lambda\,\! }[/math]. This is because at [math]\displaystyle{ t=m=\tfrac{1}{\lambda }\,\! }[/math]:
- [math]\displaystyle{ \begin{align} R(t)= & {{e}^{-\lambda \cdot t}} \\ R(t)= & {{e}^{-\lambda \cdot \tfrac{1}{\lambda }}} \\ R(t)= & {{e}^{-1}}=0.368=36.8%. \end{align}\,\! }[/math]
The following plot shows that the best-fit line through the data points crosses the [math]\displaystyle{ R=36.8%\,\! }[/math] line at [math]\displaystyle{ t=33\,\! }[/math] hours. And because [math]\displaystyle{ \tfrac{1}{\lambda }=33\,\! }[/math] hours, [math]\displaystyle{ \lambda =0.0303\,\! }[/math] failures/hour.
Rank Regression on Y for Exponential Distribution
Performing a rank regression on Y requires that a straight line be fitted to the set of available data points such that the sum of the squares of the vertical deviations from the points to the line is minimized. The least squares parameter estimation method (regression analysis) was discussed in Chapter Parameter Estimation, and the following equations for rank regression on Y (RRY) were derived:
- [math]\displaystyle{ \hat{a}=\bar{y}-\hat{b}\bar{x}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}} \right)}^{2}}}{N}} }[/math]
In our case, the equations for [math]\displaystyle{ {{y}_{i}} }[/math] and [math]\displaystyle{ {{x}_{i}} }[/math] are:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
and the [math]\displaystyle{ F({{t}_{i}}) }[/math] is estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, then [math]\displaystyle{ \hat{\lambda } }[/math] and [math]\displaystyle{ \hat{\gamma } }[/math] can easily be obtained from above equations.
For the one-parameter exponential, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0, \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}} \end{align} }[/math]
The Correlation Coefficient
The estimator of [math]\displaystyle{ \rho }[/math] is the sample correlation coefficient, [math]\displaystyle{ \hat{\rho } }[/math], given by:
- [math]\displaystyle{ \hat{\rho }=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,({{x}_{i}}-\overline{x})({{y}_{i}}-\overline{y})}{\sqrt{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{x}_{i}}-\overline{x})}^{2}}\cdot \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{y}_{i}}-\overline{y})}^{2}}}} }[/math]
Example 2:
Template loop detected: Template:2 parameter exponential distribution RRY example
Rank Regression on X for Exponential Distribution
Similar to rank regression on Y, performing a rank regression on X requires that a straight line be fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized.
Again the first task is to bring our exponential [math]\displaystyle{ cdf }[/math] function into a linear form. This step is exactly the same as in regression on Y analysis. The deviation from the previous analysis begins on the least squares fit step, since in this case we treat [math]\displaystyle{ x }[/math] as the dependent variable and [math]\displaystyle{ y }[/math] as the independent variable. The best-fitting straight line to the data, for regression on X (see Chapter Parameter Estimation), is the straight line:
- [math]\displaystyle{ x=\hat{a}+\hat{b}y }[/math]
The corresponding equations for [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are:
- [math]\displaystyle{ \hat{a}=\overline{x}-\hat{b}\overline{y}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}} \right)}^{2}}}{N}} }[/math]
where:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
The values of [math]\displaystyle{ F({{t}_{i}}) }[/math] are estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, solve for the unknown [math]\displaystyle{ y }[/math] value, which corresponds to:
- [math]\displaystyle{ y=-\frac{\hat{a}}{\hat{b}}+\frac{1}{\hat{b}}x }[/math]
Solving for the parameters from above equations we get:
- [math]\displaystyle{ a=-\frac{\hat{a}}{\hat{b}}=\lambda \gamma \Rightarrow \gamma =\hat{a} }[/math]
and:
- [math]\displaystyle{ b=\frac{1}{\hat{b}}=-\lambda \Rightarrow \lambda =-\frac{1}{\hat{b}} }[/math]
For the one-parameter exponential case, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0 \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}} \end{align} }[/math]
The correlation coefficient is evaluated as before.
Example 3:
Template loop detected: Template:Example: 2 Parameter Exponential Distribution RRX
Maximum Likelihood Estimation for Exponential Distribution
As outlined in Chapter Parameter Estimation, maximum likelihood estimation works by developing a likelihood function based on the available data and finding the values of the parameter estimates that maximize the likelihood function. This can be achieved by using iterative methods to determine the parameter estimate values that maximize the likelihood function. This can be rather difficult and time-consuming, particularly when dealing with the three-parameter distribution. Another method of finding the parameter estimates involves taking the partial derivatives of the likelihood equation with respect to the parameters, setting the resulting equations equal to zero, and solving simultaneously to determine the values of the parameter estimates. The log-likelihood functions and associated partial derivatives used to determine maximum likelihood estimates for the exponential distribution are covered in Appendix.
Example 4: Template loop detected: Template:Example: MLE for Exponential Distribution
Confidence Bounds
In this section, we present the methods used in the application to estimate the different types of confidence bounds for exponentially distributed data. The complete derivations were presented in detail (for a general function) in the chapter for Confidence Bounds. At this time we should point out that exact confidence bounds for the exponential distribution have been derived, and exist in a closed form, utilizing the [math]\displaystyle{ {{\chi }^{2}} }[/math] distribution. These are described in detail in Kececioglu [20], and are covered in the section in the test design chapter. For most exponential data analyses, Weibull++ will use the approximate confidence bounds, provided from the Fisher information matrix or the likelihood ratio, in order to stay consistent with all of the other available distributions in the application. The [math]\displaystyle{ {{\chi }^{2}} }[/math] confidence bounds for the exponential distribution are discussed in more detail in the test design chapter.
Fisher Matrix Bounds
Bounds on the Parameters
For the failure rate [math]\displaystyle{ \hat{\lambda } }[/math] the upper ([math]\displaystyle{ {{\lambda }_{U}} }[/math]) and lower ([math]\displaystyle{ {{\lambda }_{L}} }[/math]) bounds are estimated by [30]:
- [math]\displaystyle{ \begin{align} & {{\lambda }_{U}}= & \hat{\lambda }\cdot {{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}} \\ & & \\ & {{\lambda }_{L}}= & \frac{\hat{\lambda }}{{{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}}} \end{align} }[/math]
where [math]\displaystyle{ {{K}_{\alpha }} }[/math] is defined by:
- [math]\displaystyle{ \alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }}) }[/math]
If [math]\displaystyle{ \delta }[/math] is the confidence level, then [math]\displaystyle{ \alpha =\tfrac{1-\delta }{2} }[/math] for the two-sided bounds, and [math]\displaystyle{ \alpha =1-\delta }[/math] for the one-sided bounds.
The variance of [math]\displaystyle{ \hat{\lambda }, }[/math] [math]\displaystyle{ Var(\hat{\lambda }), }[/math] is estimated from the Fisher matrix, as follows:
- [math]\displaystyle{ Var(\hat{\lambda })={{\left( -\frac{{{\partial }^{2}}\Lambda }{\partial {{\lambda }^{2}}} \right)}^{-1}} }[/math]
where [math]\displaystyle{ \Lambda }[/math] is the log-likelihood function of the exponential distribution, described in Appendix for log-likelihood function.
Note that no true MLE solution exists for the case of the two-parameter exponential distribution. The mathematics simply break down while trying to simultaneously solve the partial derivative equations for both the [math]\displaystyle{ \gamma }[/math] and [math]\displaystyle{ \lambda }[/math] parameters, resulting in unrealistic conditions. The way around this conundrum involves setting [math]\displaystyle{ \gamma ={{t}_{1}}, }[/math] or the first time-to-failure, and calculating [math]\displaystyle{ \lambda }[/math] in the regular fashion for this methodology. Weibull++ treats [math]\displaystyle{ \gamma }[/math] as a constant when computing bounds, i.e. [math]\displaystyle{ Var(\hat{\gamma })=0. }[/math] (See the discussion in Appendix for log-likelihood function for more information.)
Template loop detected: Template:Bounds on Reliability FMB ED
Template loop detected: Template:Bounds on Time FMB ED
Template loop detected: Template:Likelihood Ratio Confidence Bounds FMB ED
Template loop detected: Template:Bayesian Confidence Bounds ED
General Examples
Example 8:
20 units were reliability tested with the following results:
Table - Life Test Data | |
Number of Units in Group | Time-to-Failure |
---|---|
7 | 100 |
5 | 200 |
3 | 300 |
2 | 400 |
1 | 500 |
2 | 600 |
1. Assuming a 2-parameter exponential distribution, estimate the parameters by hand using the MLE analysis method.
2. Repeat the above using Weibull++. (Enter the data as grouped data to duplicate the results.)
3. Show the Probability plot for the analysis results.
4. Show the Reliability vs. Time plot for the results.
5. Show the pdf plot for the results.
6. Show the Failure Rate vs. Time plot for the results.
7. Estimate the parameters using the rank regression on Y (RRY) analysis method (and using grouped ranks).
Solution
1. For the 2-parameter exponential distribution and for [math]\displaystyle{ \hat{\gamma }=100\,\! }[/math] hours (first failure), the partial of the log-likelihood function, [math]\displaystyle{ \lambda\,\! }[/math], becomes:
- [math]\displaystyle{ \begin{align} \frac{\partial \Lambda }{\partial \lambda }= &\underset{i=1}{\overset{6}{\mathop \sum }}\,{N_i} \left[ \frac{1}{\lambda }-\left( {{T}_{i}}-100 \right) \right]=0\\ \Rightarrow & 7[\frac{1}{\lambda }-(100-100)]+5[\frac{1}{\lambda}-(200-100)] + \ldots +2[\frac{1}{\lambda}-(600-100)]\\ = & 0\\ \Rightarrow & \hat{\lambda}=\frac{20}{3100}=0.0065 \text{fr/hr} \end{align} \,\! }[/math]
2. Enter the data in a Weibull++ standard folio and calculate it as shown next.
3. On the Plot page of the folio, the exponential Probability plot will appear as shown next.
4. View the Reliability vs. Time plot.
5. View the pdf plot.
6. View the Failure Rate vs. Time plot.
Note that, as described at the beginning of this chapter, the failure rate for the exponential distribution is constant. Also note that the Failure Rate vs. Time plot does show values for times before the location parameter, [math]\displaystyle{ \gamma \,\! }[/math], at 100 hours.
7. In the case of grouped data, one must be cautious when estimating the parameters using a rank regression method. This is because the median rank values are determined from the total number of failures observed by time [math]\displaystyle{ {{T}_{i}}\,\! }[/math] where [math]\displaystyle{ i\,\! }[/math] indicates the group number. In this example, the total number of groups is [math]\displaystyle{ N=6\,\! }[/math] and the total number of units is [math]\displaystyle{ {{N}_{T}}=20\,\! }[/math]. Thus, the median rank values will be estimated for 20 units and for the total failed units ([math]\displaystyle{ {{N}_{{{F}_{i}}}}\,\! }[/math]) up to the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] group, for the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] rank value. The median ranks values can be found from rank tables or they can be estimated using ReliaSoft's Quick Statistical Reference tool.
For example, the median rank value of the fourth group will be the [math]\displaystyle{ {{17}^{th}}\,\! }[/math] rank out of a sample size of twenty units (or 81.945%).
The following table is then constructed.
Given the values in the table above, calculate [math]\displaystyle{ \hat{a}\,\! }[/math] and [math]\displaystyle{ \hat{b}\,\! }[/math]:
- [math]\displaystyle{ \begin{align} & \hat{b}= & \frac{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}}{{y}_{i}}-(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{y}_{i}})/6}{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,t_{i}^{2}-{{(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})}^{2}}/6} \\ & & \\ & \hat{b}= & \frac{-4320.3362-(2100)(-9.6476)/6}{910,000-{{(2100)}^{2}}/6} \end{align}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{b}=-0.005392\,\! }[/math]
and:
- [math]\displaystyle{ \hat{a}=\overline{y}-\hat{b}\overline{t}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{t}_{i}}}{N}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{a}=\frac{-9.6476}{6}-(-0.005392)\frac{2100}{6}=0.2793\,\! }[/math]
Therefore:
- [math]\displaystyle{ \hat{\lambda }=-\hat{b}=-(-0.005392)=0.05392\text{ failures/hour}\,\! }[/math]
and:
- [math]\displaystyle{ \hat{\gamma }=\frac{\hat{a}}{\hat{\lambda }}=\frac{0.2793}{0.005392}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{\gamma }\simeq 51.8\text{ hours}\,\! }[/math]
Then:
- [math]\displaystyle{ f(T)=(0.005392){{e}^{-0.005392(T-51.8)}}\,\! }[/math]
Using Weibull++, the estimated parameters are:
- [math]\displaystyle{ \begin{align} \hat{\lambda }= & 0.0054\text{ failures/hour} \\ \hat{\gamma }= & 51.82\text{ hours} \end{align}\,\! }[/math]
The small difference in the values from Weibull++ is due to rounding. In the application, the calculations and the rank values are carried out up to the [math]\displaystyle{ 15^{th}\,\! }[/math] decimal point.
Example 9:
A number of leukemia patients were treated with either drug 6MP or a placebo, and the times in weeks until cancer symptoms returned were recorded. Analyze each treatment separately [21, p.175].
Table - Leukemia Treatment Results | |||
Time (weeks) | Number of Patients | Treament | Comments |
---|---|---|---|
1 | 2 | placebo | |
2 | 2 | placebo | |
3 | 1 | placebo | |
4 | 2 | placebo | |
5 | 2 | placebo | |
6 | 4 | 6MP | 3 patients completed |
7 | 1 | 6MP | |
8 | 4 | placebo | |
9 | 1 | 6MP | Not completed |
10 | 2 | 6MP | 1 patient completed |
11 | 2 | placebo | |
11 | 1 | 6MP | Not completed |
12 | 2 | placebo | |
13 | 1 | 6MP | |
15 | 1 | placebo | |
16 | 1 | 6MP | |
17 | 1 | placebo | |
17 | 1 | 6MP | Not completed |
19 | 1 | 6MP | Not completed |
20 | 1 | 6MP | Not completed |
22 | 1 | placebo | |
22 | 1 | 6MP | |
23 | 1 | placebo | |
23 | 1 | 6MP | |
25 | 1 | 6MP | Not completed |
32 | 2 | 6MP | Not completed |
34 | 1 | 6MP | Not completed |
35 | 1 | 6MP | Not completed |
Create a new Weibull++ standard folio that's configured for grouped times-to-failure data with suspensions. In the first column, enter the number of patients. Whenever there are uncompleted tests, enter the number of patients who completed the test separately from the number of patients who did not (e.g., if 4 patients had symptoms return after 6 weeks and only 3 of them completed the test, then enter 1 in one row and 3 in another). In the second column enter F if the patients completed the test and S if they didn't. In the third column enter the time, and in the fourth column (Subset ID) specify whether the 6MP drug or a placebo was used.
Next, open the Batch Auto Run utility and select to separate the 6MP drug from the placebo, as shown next.
The software will create two data sheets, one for each subset ID, as shown next.
Calculate both data sheets using the 2-parameter exponential distribution and the MLE analysis method, then insert an additional plot and select to show the analysis results for both data sheets on that plot, which will appear as shown next.
Confidence Bounds
In this section, we present the methods used in the application to estimate the different types of confidence bounds for exponentially distributed data. The complete derivations were presented in detail (for a general function) in the chapter for Confidence Bounds. At this time we should point out that exact confidence bounds for the exponential distribution have been derived, and exist in a closed form, utilizing the [math]\displaystyle{ {{\chi }^{2}} }[/math] distribution. These are described in detail in Kececioglu [20], and are covered in the section in the test design chapter. For most exponential data analyses, Weibull++ will use the approximate confidence bounds, provided from the Fisher information matrix or the likelihood ratio, in order to stay consistent with all of the other available distributions in the application. The [math]\displaystyle{ {{\chi }^{2}} }[/math] confidence bounds for the exponential distribution are discussed in more detail in the test design chapter.
Fisher Matrix Bounds
Bounds on the Parameters
For the failure rate [math]\displaystyle{ \hat{\lambda } }[/math] the upper ([math]\displaystyle{ {{\lambda }_{U}} }[/math]) and lower ([math]\displaystyle{ {{\lambda }_{L}} }[/math]) bounds are estimated by [30]:
- [math]\displaystyle{ \begin{align} & {{\lambda }_{U}}= & \hat{\lambda }\cdot {{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}} \\ & & \\ & {{\lambda }_{L}}= & \frac{\hat{\lambda }}{{{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}}} \end{align} }[/math]
where [math]\displaystyle{ {{K}_{\alpha }} }[/math] is defined by:
- [math]\displaystyle{ \alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }}) }[/math]
If [math]\displaystyle{ \delta }[/math] is the confidence level, then [math]\displaystyle{ \alpha =\tfrac{1-\delta }{2} }[/math] for the two-sided bounds, and [math]\displaystyle{ \alpha =1-\delta }[/math] for the one-sided bounds.
The variance of [math]\displaystyle{ \hat{\lambda }, }[/math] [math]\displaystyle{ Var(\hat{\lambda }), }[/math] is estimated from the Fisher matrix, as follows:
- [math]\displaystyle{ Var(\hat{\lambda })={{\left( -\frac{{{\partial }^{2}}\Lambda }{\partial {{\lambda }^{2}}} \right)}^{-1}} }[/math]
where [math]\displaystyle{ \Lambda }[/math] is the log-likelihood function of the exponential distribution, described in Appendix for log-likelihood function.
Note that no true MLE solution exists for the case of the two-parameter exponential distribution. The mathematics simply break down while trying to simultaneously solve the partial derivative equations for both the [math]\displaystyle{ \gamma }[/math] and [math]\displaystyle{ \lambda }[/math] parameters, resulting in unrealistic conditions. The way around this conundrum involves setting [math]\displaystyle{ \gamma ={{t}_{1}}, }[/math] or the first time-to-failure, and calculating [math]\displaystyle{ \lambda }[/math] in the regular fashion for this methodology. Weibull++ treats [math]\displaystyle{ \gamma }[/math] as a constant when computing bounds, i.e. [math]\displaystyle{ Var(\hat{\gamma })=0. }[/math] (See the discussion in Appendix for log-likelihood function for more information.)
The exponential distribution is a commonly used distribution in reliability engineering. Mathematically, it is a fairly simple distribution, which many times leads to its use in inappropriate situations. It is, in fact, a special case of the Weibull distribution where [math]\displaystyle{ \beta =1 }[/math]. The exponential distribution is used to model the behavior of units that have a constant failure rate (or units that do not degrade with time or wear out).
Exponential Probability Density Function
The Two-Parameter Exponential Distribution
The two-parameter exponential pdf is given by:
- [math]\displaystyle{ f(t)=\lambda {{e}^{-\lambda (t-\gamma )}},f(t)\ge 0,\lambda \gt 0,t\ge 0\text{ or }\gamma }[/math]
where [math]\displaystyle{ \gamma }[/math] is the location parameter. Some of the characteristics of the two-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{t}-\gamma =m-\gamma }[/math].
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The distribution starts at [math]\displaystyle{ t=\gamma }[/math] at the level of [math]\displaystyle{ f(t=\gamma )=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases beyond [math]\displaystyle{ \gamma }[/math] and is convex.
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The One-Parameter Exponential Distribution
The one-parameter exponential [math]\displaystyle{ pdf }[/math] is obtained by setting [math]\displaystyle{ \gamma =0 }[/math], and is given by:
- [math]\displaystyle{ \begin{align}f(t)= & \lambda {{e}^{-\lambda t}}=\frac{1}{m}{{e}^{-\tfrac{1}{m}t}}, & t\ge 0, \lambda \gt 0,m\gt 0 \end{align} }[/math]
where:
- [math]\displaystyle{ \lambda }[/math] = constant rate, in failures per unit of measurement, e.g failures per hour, per cycle, etc.,
- [math]\displaystyle{ \lambda =\frac{1}{m} }[/math],
- [math]\displaystyle{ m }[/math] = mean time between failures, or to failure,
- [math]\displaystyle{ t }[/math] = operating time, life, or age, in hours, cycles, miles, actuations, etc.
This distribution requires the knowledge of only one parameter, [math]\displaystyle{ \lambda }[/math], for its application. Some of the characteristics of the one-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], is zero.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=m }[/math].
- As [math]\displaystyle{ \lambda }[/math] is decreased in value, the distribution is stretched out to the right, and as [math]\displaystyle{ \lambda }[/math] is increased, the distribution is pushed toward the origin.
- This distribution has no shape parameter as it has only one shape, i.e. the exponential, and the only parameter it has is the failure rate, [math]\displaystyle{ \lambda }[/math].
- The distribution starts at [math]\displaystyle{ t=0 }[/math] at the level of [math]\displaystyle{ f(t=0)=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases, and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ f(t)\to 0 }[/math].
- The [math]\displaystyle{ pdf }[/math] can be thought of as a special case of the Weibull [math]\displaystyle{ pdf }[/math] with [math]\displaystyle{ \gamma =0 }[/math] and [math]\displaystyle{ \beta =1 }[/math].
Exponential Statistical Properties
The Mean or MTTF
The mean, [math]\displaystyle{ \overline{T}, }[/math] or mean time to failure (MTTF) is given by:
- [math]\displaystyle{ \begin{align} \bar{T}= & \int_{\gamma }^{\infty }t\cdot f(t)dt \\ = & \int_{\gamma }^{\infty }t\cdot \lambda \cdot {{e}^{-\lambda t}}dt \\ = & \gamma +\frac{1}{\lambda }=m \end{align} }[/math]
Note that when [math]\displaystyle{ \gamma =0 }[/math], the MTTF is the inverse of the exponential distribution's constant failure rate. This is only true for the exponential distribution. Most other distributions do not have a constant failure rate. Consequently, the inverse relationship between failure rate and MTTF does not hold for these other distributions.
The Median
The median, [math]\displaystyle{ \breve{T}, }[/math] is:
- [math]\displaystyle{ \breve{T}=\gamma +\frac{1}{\lambda}\cdot 0.693 }[/math]
The Mode
The mode, [math]\displaystyle{ \tilde{T}, }[/math] is:
- [math]\displaystyle{ \tilde{T}=\gamma }[/math]
The Standard Deviation
The standard deviation, [math]\displaystyle{ {\sigma }_{T} }[/math], is:
- [math]\displaystyle{ {\sigma}_{T}=\frac{1}{\lambda }=m }[/math]
The Exponential Reliability Function
The equation for the two-parameter exponential cumulative density function, or [math]\displaystyle{ cdf, }[/math] is given by:
- [math]\displaystyle{ F(t)=Q(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Recalling that the reliability function of a distribution is simply one minus the [math]\displaystyle{ cdf }[/math], the reliability function of the two-parameter exponential distribution is given by:
- [math]\displaystyle{ R(t)=1-Q(t)=1-\int_{0}^{t-\gamma }f(x)dx }[/math]
- [math]\displaystyle{ R(t)=1-\int_{0}^{t-\gamma }\lambda {{e}^{-\lambda x}}dx={{e}^{-\lambda (t-\gamma )}} }[/math]
One-Parameter Exponential Reliability Function
The one-parameter exponential reliability function is given by:
- [math]\displaystyle{ R(t)={{e}^{-\lambda t}}={{e}^{-\tfrac{t}{m}}} }[/math]
The Exponential Conditional Reliability
The exponential conditional reliability equation gives the reliability for a mission of [math]\displaystyle{ t }[/math] duration, having already successfully accumulated [math]\displaystyle{ T }[/math] hours of operation up to the start of this new mission. The exponential conditional reliability function is:
- [math]\displaystyle{ R(t|T)=\frac{R(T+t)}{R(T)}=\frac{{{e}^{-\lambda (T+t-\gamma )}}}{{{e}^{-\lambda (T-\gamma )}}}={{e}^{-\lambda t}} }[/math]
which says that the reliability for a mission of [math]\displaystyle{ t }[/math] duration undertaken after the component or equipment has already accumulated [math]\displaystyle{ T }[/math] hours of operation from age zero is only a function of the mission duration, and not a function of the age at the beginning of the mission. This is referred to as the memoryless property.
The Exponential Reliable Life
The reliable life, or the mission duration for a desired reliability goal, [math]\displaystyle{ {{t}_{R}} }[/math], for the one-parameter exponential distribution is:
- [math]\displaystyle{ R({{t}_{R}})={{e}^{-\lambda ({{t}_{R}}-\gamma )}} }[/math]
- [math]\displaystyle{ \ln[R({{t}_{R}})]=-\lambda({{t}_{R}}-\gamma ) }[/math]
or:
- [math]\displaystyle{ {{t}_{R}}=\gamma -\frac{\ln [R({{t}_{R}})]}{\lambda } }[/math]
The Exponential Failure Rate Function
The exponential failure rate function is:
- [math]\displaystyle{ \lambda (t)=\frac{f(t)}{R(t)}=\frac{\lambda {{e}^{-\lambda (t-\gamma )}}}{{{e}^{-\lambda (t-\gamma )}}}=\lambda =\text{constant} }[/math]
Once again, note that the constant failure rate is a characteristic of the exponential distribution, and special cases of other distributions only. Most other distributions have failure rates that are functions of time.
Characteristics of the Exponential Distribution
As mentioned before, the primary trait of the exponential distribution is that it is used for modeling the behavior of items with a constant failure rate. It has a fairly simple mathematical form, which makes it fairly easy to manipulate. Unfortunately, this fact also leads to the use of this model in situations where it is not appropriate. For example, it would not be appropriate to use the exponential distribution to model the reliability of an automobile. The constant failure rate of the exponential distribution would require the assumption that the automobile would be just as likely to experience a breakdown during the first mile as it would during the one-hundred-thousandth mile. Clearly, this is not a valid assumption. However, some inexperienced practitioners of reliability engineering and life data analysis will overlook this fact, lured by the siren-call of the exponential distribution's relatively simple mathematical models.
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential [math]\displaystyle{ pdf }[/math]
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The exponential [math]\displaystyle{ pdf }[/math] is always convex and is stretched to the right as [math]\displaystyle{ \lambda }[/math] decreases in value.
- The value of the [math]\displaystyle{ pdf }[/math] function is always equal to the value of [math]\displaystyle{ \lambda }[/math] at [math]\displaystyle{ t=0 }[/math] (or [math]\displaystyle{ t=\gamma }[/math]).
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before this time.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{T}-\gamma =m-\gamma }[/math].
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential Reliability Function
- The one-parameter exponential reliability function starts at the value of 100% at [math]\displaystyle{ t=0 }[/math], decreases thereafter monotonically and is convex.
- The two-parameter exponential reliability function remains at the value of 100% for [math]\displaystyle{ t=0 }[/math] up to [math]\displaystyle{ t=\gamma }[/math], and decreases thereafter monotonically and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ R(t\to \infty )\to 0 }[/math].
- The reliability for a mission duration of [math]\displaystyle{ t=m=\tfrac{1}{\lambda } }[/math], or of one MTTF duration, is always equal to [math]\displaystyle{ 0.3679 }[/math] or 36.79%. This means that the reliability for a mission which is as long as one MTTF is relatively low and is not recommended because only 36.8% of the missions will be completed successfully. In other words, of the equipment undertaking such a mission, only 36.8% will survive their mission.
Estimation of the Exponential Parameters
Probability Plotting
Estimation of the parameters for the exponential distribution via probability plotting is very similar to the process used when dealing with the Weibull distribution. Recall, however, that the appearance of the probability plotting paper and the methods by which the parameters are estimated vary from distribution to distribution, so there will be some noticeable differences. In fact, due to the nature of the exponential [math]\displaystyle{ cdf }[/math], the exponential probability plot is the only one with a negative slope. This is because the y-axis of the exponential probability plotting paper represents the reliability, whereas the y-axis for most of the other life distributions represents the unreliability.
This is illustrated in the process of linearizing the [math]\displaystyle{ cdf }[/math], which is necessary to construct the exponential probability plotting paper. For the two-parameter exponential distribution the cumulative density function is given by:
- [math]\displaystyle{ F(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Taking the natural logarithm of both sides of the above equation yields:
- [math]\displaystyle{ \ln \left[ 1-F(t) \right]=-\lambda (t-\gamma ) }[/math]
or:
- [math]\displaystyle{ \ln [1-F(t)]=\lambda \gamma -\lambda t }[/math]
Now, let:
- [math]\displaystyle{ y=\ln [1-F(t)] }[/math]
- [math]\displaystyle{ a=\lambda \gamma }[/math]
and:
- [math]\displaystyle{ b=-\lambda }[/math]
which results in the linear equation of:
- [math]\displaystyle{ y=a+bt }[/math]
Note that with the exponential probability plotting paper, the y-axis scale is logarithmic and the x-axis scale is linear. This means that the zero value is present only on the x-axis. For [math]\displaystyle{ t=0 }[/math], [math]\displaystyle{ R=1 }[/math] and [math]\displaystyle{ F(t)=0 }[/math]. So if we were to use [math]\displaystyle{ F(t) }[/math] for the y-axis, we would have to plot the point [math]\displaystyle{ (0,0) }[/math]. However, since the y-axis is logarithmic, there is no place to plot this on the exponential paper. Also, the failure rate, [math]\displaystyle{ \lambda }[/math], is the negative of the slope of the line, but there is an easier way to determine the value of [math]\displaystyle{ \lambda }[/math] from the probability plot, as will be illustrated in the following example.
Example 1:
1-Parameter Exponential Probability Plot Example
6 units are put on a life test and tested to failure. The failure times are 7, 12, 19, 29, 41, and 67 hours. Estimate the failure rate for a 1-parameter exponential distribution using the probability plotting method.
In order to plot the points for the probability plot, the appropriate reliability estimate values must be obtained. These will be equivalent to [math]\displaystyle{ 100%-MR\,\! }[/math] since the y-axis represents the reliability and the [math]\displaystyle{ MR\,\! }[/math] values represent unreliability estimates.
Next, these points are plotted on an exponential probability plotting paper. A sample of this type of plotting paper is shown next, with the sample points in place. Notice how these points describe a line with a negative slope.
Once the points are plotted, draw the best possible straight line through these points. The time value at which this line intersects with a horizontal line drawn at the 36.8% reliability mark is the mean life, and the reciprocal of this is the failure rate [math]\displaystyle{ \lambda\,\! }[/math]. This is because at [math]\displaystyle{ t=m=\tfrac{1}{\lambda }\,\! }[/math]:
- [math]\displaystyle{ \begin{align} R(t)= & {{e}^{-\lambda \cdot t}} \\ R(t)= & {{e}^{-\lambda \cdot \tfrac{1}{\lambda }}} \\ R(t)= & {{e}^{-1}}=0.368=36.8%. \end{align}\,\! }[/math]
The following plot shows that the best-fit line through the data points crosses the [math]\displaystyle{ R=36.8%\,\! }[/math] line at [math]\displaystyle{ t=33\,\! }[/math] hours. And because [math]\displaystyle{ \tfrac{1}{\lambda }=33\,\! }[/math] hours, [math]\displaystyle{ \lambda =0.0303\,\! }[/math] failures/hour.
Rank Regression on Y for Exponential Distribution
Performing a rank regression on Y requires that a straight line be fitted to the set of available data points such that the sum of the squares of the vertical deviations from the points to the line is minimized. The least squares parameter estimation method (regression analysis) was discussed in Chapter Parameter Estimation, and the following equations for rank regression on Y (RRY) were derived:
- [math]\displaystyle{ \hat{a}=\bar{y}-\hat{b}\bar{x}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}} \right)}^{2}}}{N}} }[/math]
In our case, the equations for [math]\displaystyle{ {{y}_{i}} }[/math] and [math]\displaystyle{ {{x}_{i}} }[/math] are:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
and the [math]\displaystyle{ F({{t}_{i}}) }[/math] is estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, then [math]\displaystyle{ \hat{\lambda } }[/math] and [math]\displaystyle{ \hat{\gamma } }[/math] can easily be obtained from above equations.
For the one-parameter exponential, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0, \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}} \end{align} }[/math]
The Correlation Coefficient
The estimator of [math]\displaystyle{ \rho }[/math] is the sample correlation coefficient, [math]\displaystyle{ \hat{\rho } }[/math], given by:
- [math]\displaystyle{ \hat{\rho }=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,({{x}_{i}}-\overline{x})({{y}_{i}}-\overline{y})}{\sqrt{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{x}_{i}}-\overline{x})}^{2}}\cdot \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{y}_{i}}-\overline{y})}^{2}}}} }[/math]
Example 2:
Template loop detected: Template:2 parameter exponential distribution RRY example
Rank Regression on X for Exponential Distribution
Similar to rank regression on Y, performing a rank regression on X requires that a straight line be fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized.
Again the first task is to bring our exponential [math]\displaystyle{ cdf }[/math] function into a linear form. This step is exactly the same as in regression on Y analysis. The deviation from the previous analysis begins on the least squares fit step, since in this case we treat [math]\displaystyle{ x }[/math] as the dependent variable and [math]\displaystyle{ y }[/math] as the independent variable. The best-fitting straight line to the data, for regression on X (see Chapter Parameter Estimation), is the straight line:
- [math]\displaystyle{ x=\hat{a}+\hat{b}y }[/math]
The corresponding equations for [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are:
- [math]\displaystyle{ \hat{a}=\overline{x}-\hat{b}\overline{y}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}} \right)}^{2}}}{N}} }[/math]
where:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
The values of [math]\displaystyle{ F({{t}_{i}}) }[/math] are estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, solve for the unknown [math]\displaystyle{ y }[/math] value, which corresponds to:
- [math]\displaystyle{ y=-\frac{\hat{a}}{\hat{b}}+\frac{1}{\hat{b}}x }[/math]
Solving for the parameters from above equations we get:
- [math]\displaystyle{ a=-\frac{\hat{a}}{\hat{b}}=\lambda \gamma \Rightarrow \gamma =\hat{a} }[/math]
and:
- [math]\displaystyle{ b=\frac{1}{\hat{b}}=-\lambda \Rightarrow \lambda =-\frac{1}{\hat{b}} }[/math]
For the one-parameter exponential case, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0 \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}} \end{align} }[/math]
The correlation coefficient is evaluated as before.
Example 3:
Template loop detected: Template:Example: 2 Parameter Exponential Distribution RRX
Maximum Likelihood Estimation for Exponential Distribution
As outlined in Chapter Parameter Estimation, maximum likelihood estimation works by developing a likelihood function based on the available data and finding the values of the parameter estimates that maximize the likelihood function. This can be achieved by using iterative methods to determine the parameter estimate values that maximize the likelihood function. This can be rather difficult and time-consuming, particularly when dealing with the three-parameter distribution. Another method of finding the parameter estimates involves taking the partial derivatives of the likelihood equation with respect to the parameters, setting the resulting equations equal to zero, and solving simultaneously to determine the values of the parameter estimates. The log-likelihood functions and associated partial derivatives used to determine maximum likelihood estimates for the exponential distribution are covered in Appendix.
Example 4: Template loop detected: Template:Example: MLE for Exponential Distribution
Confidence Bounds
In this section, we present the methods used in the application to estimate the different types of confidence bounds for exponentially distributed data. The complete derivations were presented in detail (for a general function) in the chapter for Confidence Bounds. At this time we should point out that exact confidence bounds for the exponential distribution have been derived, and exist in a closed form, utilizing the [math]\displaystyle{ {{\chi }^{2}} }[/math] distribution. These are described in detail in Kececioglu [20], and are covered in the section in the test design chapter. For most exponential data analyses, Weibull++ will use the approximate confidence bounds, provided from the Fisher information matrix or the likelihood ratio, in order to stay consistent with all of the other available distributions in the application. The [math]\displaystyle{ {{\chi }^{2}} }[/math] confidence bounds for the exponential distribution are discussed in more detail in the test design chapter.
Fisher Matrix Bounds
Bounds on the Parameters
For the failure rate [math]\displaystyle{ \hat{\lambda } }[/math] the upper ([math]\displaystyle{ {{\lambda }_{U}} }[/math]) and lower ([math]\displaystyle{ {{\lambda }_{L}} }[/math]) bounds are estimated by [30]:
- [math]\displaystyle{ \begin{align} & {{\lambda }_{U}}= & \hat{\lambda }\cdot {{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}} \\ & & \\ & {{\lambda }_{L}}= & \frac{\hat{\lambda }}{{{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}}} \end{align} }[/math]
where [math]\displaystyle{ {{K}_{\alpha }} }[/math] is defined by:
- [math]\displaystyle{ \alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }}) }[/math]
If [math]\displaystyle{ \delta }[/math] is the confidence level, then [math]\displaystyle{ \alpha =\tfrac{1-\delta }{2} }[/math] for the two-sided bounds, and [math]\displaystyle{ \alpha =1-\delta }[/math] for the one-sided bounds.
The variance of [math]\displaystyle{ \hat{\lambda }, }[/math] [math]\displaystyle{ Var(\hat{\lambda }), }[/math] is estimated from the Fisher matrix, as follows:
- [math]\displaystyle{ Var(\hat{\lambda })={{\left( -\frac{{{\partial }^{2}}\Lambda }{\partial {{\lambda }^{2}}} \right)}^{-1}} }[/math]
where [math]\displaystyle{ \Lambda }[/math] is the log-likelihood function of the exponential distribution, described in Appendix for log-likelihood function.
Note that no true MLE solution exists for the case of the two-parameter exponential distribution. The mathematics simply break down while trying to simultaneously solve the partial derivative equations for both the [math]\displaystyle{ \gamma }[/math] and [math]\displaystyle{ \lambda }[/math] parameters, resulting in unrealistic conditions. The way around this conundrum involves setting [math]\displaystyle{ \gamma ={{t}_{1}}, }[/math] or the first time-to-failure, and calculating [math]\displaystyle{ \lambda }[/math] in the regular fashion for this methodology. Weibull++ treats [math]\displaystyle{ \gamma }[/math] as a constant when computing bounds, i.e. [math]\displaystyle{ Var(\hat{\gamma })=0. }[/math] (See the discussion in Appendix for log-likelihood function for more information.)
Template loop detected: Template:Bounds on Reliability FMB ED
Template loop detected: Template:Bounds on Time FMB ED
Template loop detected: Template:Likelihood Ratio Confidence Bounds FMB ED
Template loop detected: Template:Bayesian Confidence Bounds ED
General Examples
Example 8:
20 units were reliability tested with the following results:
Table - Life Test Data | |
Number of Units in Group | Time-to-Failure |
---|---|
7 | 100 |
5 | 200 |
3 | 300 |
2 | 400 |
1 | 500 |
2 | 600 |
1. Assuming a 2-parameter exponential distribution, estimate the parameters by hand using the MLE analysis method.
2. Repeat the above using Weibull++. (Enter the data as grouped data to duplicate the results.)
3. Show the Probability plot for the analysis results.
4. Show the Reliability vs. Time plot for the results.
5. Show the pdf plot for the results.
6. Show the Failure Rate vs. Time plot for the results.
7. Estimate the parameters using the rank regression on Y (RRY) analysis method (and using grouped ranks).
Solution
1. For the 2-parameter exponential distribution and for [math]\displaystyle{ \hat{\gamma }=100\,\! }[/math] hours (first failure), the partial of the log-likelihood function, [math]\displaystyle{ \lambda\,\! }[/math], becomes:
- [math]\displaystyle{ \begin{align} \frac{\partial \Lambda }{\partial \lambda }= &\underset{i=1}{\overset{6}{\mathop \sum }}\,{N_i} \left[ \frac{1}{\lambda }-\left( {{T}_{i}}-100 \right) \right]=0\\ \Rightarrow & 7[\frac{1}{\lambda }-(100-100)]+5[\frac{1}{\lambda}-(200-100)] + \ldots +2[\frac{1}{\lambda}-(600-100)]\\ = & 0\\ \Rightarrow & \hat{\lambda}=\frac{20}{3100}=0.0065 \text{fr/hr} \end{align} \,\! }[/math]
2. Enter the data in a Weibull++ standard folio and calculate it as shown next.
3. On the Plot page of the folio, the exponential Probability plot will appear as shown next.
4. View the Reliability vs. Time plot.
5. View the pdf plot.
6. View the Failure Rate vs. Time plot.
Note that, as described at the beginning of this chapter, the failure rate for the exponential distribution is constant. Also note that the Failure Rate vs. Time plot does show values for times before the location parameter, [math]\displaystyle{ \gamma \,\! }[/math], at 100 hours.
7. In the case of grouped data, one must be cautious when estimating the parameters using a rank regression method. This is because the median rank values are determined from the total number of failures observed by time [math]\displaystyle{ {{T}_{i}}\,\! }[/math] where [math]\displaystyle{ i\,\! }[/math] indicates the group number. In this example, the total number of groups is [math]\displaystyle{ N=6\,\! }[/math] and the total number of units is [math]\displaystyle{ {{N}_{T}}=20\,\! }[/math]. Thus, the median rank values will be estimated for 20 units and for the total failed units ([math]\displaystyle{ {{N}_{{{F}_{i}}}}\,\! }[/math]) up to the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] group, for the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] rank value. The median ranks values can be found from rank tables or they can be estimated using ReliaSoft's Quick Statistical Reference tool.
For example, the median rank value of the fourth group will be the [math]\displaystyle{ {{17}^{th}}\,\! }[/math] rank out of a sample size of twenty units (or 81.945%).
The following table is then constructed.
Given the values in the table above, calculate [math]\displaystyle{ \hat{a}\,\! }[/math] and [math]\displaystyle{ \hat{b}\,\! }[/math]:
- [math]\displaystyle{ \begin{align} & \hat{b}= & \frac{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}}{{y}_{i}}-(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{y}_{i}})/6}{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,t_{i}^{2}-{{(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})}^{2}}/6} \\ & & \\ & \hat{b}= & \frac{-4320.3362-(2100)(-9.6476)/6}{910,000-{{(2100)}^{2}}/6} \end{align}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{b}=-0.005392\,\! }[/math]
and:
- [math]\displaystyle{ \hat{a}=\overline{y}-\hat{b}\overline{t}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{t}_{i}}}{N}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{a}=\frac{-9.6476}{6}-(-0.005392)\frac{2100}{6}=0.2793\,\! }[/math]
Therefore:
- [math]\displaystyle{ \hat{\lambda }=-\hat{b}=-(-0.005392)=0.05392\text{ failures/hour}\,\! }[/math]
and:
- [math]\displaystyle{ \hat{\gamma }=\frac{\hat{a}}{\hat{\lambda }}=\frac{0.2793}{0.005392}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{\gamma }\simeq 51.8\text{ hours}\,\! }[/math]
Then:
- [math]\displaystyle{ f(T)=(0.005392){{e}^{-0.005392(T-51.8)}}\,\! }[/math]
Using Weibull++, the estimated parameters are:
- [math]\displaystyle{ \begin{align} \hat{\lambda }= & 0.0054\text{ failures/hour} \\ \hat{\gamma }= & 51.82\text{ hours} \end{align}\,\! }[/math]
The small difference in the values from Weibull++ is due to rounding. In the application, the calculations and the rank values are carried out up to the [math]\displaystyle{ 15^{th}\,\! }[/math] decimal point.
Example 9:
A number of leukemia patients were treated with either drug 6MP or a placebo, and the times in weeks until cancer symptoms returned were recorded. Analyze each treatment separately [21, p.175].
Table - Leukemia Treatment Results | |||
Time (weeks) | Number of Patients | Treament | Comments |
---|---|---|---|
1 | 2 | placebo | |
2 | 2 | placebo | |
3 | 1 | placebo | |
4 | 2 | placebo | |
5 | 2 | placebo | |
6 | 4 | 6MP | 3 patients completed |
7 | 1 | 6MP | |
8 | 4 | placebo | |
9 | 1 | 6MP | Not completed |
10 | 2 | 6MP | 1 patient completed |
11 | 2 | placebo | |
11 | 1 | 6MP | Not completed |
12 | 2 | placebo | |
13 | 1 | 6MP | |
15 | 1 | placebo | |
16 | 1 | 6MP | |
17 | 1 | placebo | |
17 | 1 | 6MP | Not completed |
19 | 1 | 6MP | Not completed |
20 | 1 | 6MP | Not completed |
22 | 1 | placebo | |
22 | 1 | 6MP | |
23 | 1 | placebo | |
23 | 1 | 6MP | |
25 | 1 | 6MP | Not completed |
32 | 2 | 6MP | Not completed |
34 | 1 | 6MP | Not completed |
35 | 1 | 6MP | Not completed |
Create a new Weibull++ standard folio that's configured for grouped times-to-failure data with suspensions. In the first column, enter the number of patients. Whenever there are uncompleted tests, enter the number of patients who completed the test separately from the number of patients who did not (e.g., if 4 patients had symptoms return after 6 weeks and only 3 of them completed the test, then enter 1 in one row and 3 in another). In the second column enter F if the patients completed the test and S if they didn't. In the third column enter the time, and in the fourth column (Subset ID) specify whether the 6MP drug or a placebo was used.
Next, open the Batch Auto Run utility and select to separate the 6MP drug from the placebo, as shown next.
The software will create two data sheets, one for each subset ID, as shown next.
Calculate both data sheets using the 2-parameter exponential distribution and the MLE analysis method, then insert an additional plot and select to show the analysis results for both data sheets on that plot, which will appear as shown next.
The exponential distribution is a commonly used distribution in reliability engineering. Mathematically, it is a fairly simple distribution, which many times leads to its use in inappropriate situations. It is, in fact, a special case of the Weibull distribution where [math]\displaystyle{ \beta =1 }[/math]. The exponential distribution is used to model the behavior of units that have a constant failure rate (or units that do not degrade with time or wear out).
Exponential Probability Density Function
The Two-Parameter Exponential Distribution
The two-parameter exponential pdf is given by:
- [math]\displaystyle{ f(t)=\lambda {{e}^{-\lambda (t-\gamma )}},f(t)\ge 0,\lambda \gt 0,t\ge 0\text{ or }\gamma }[/math]
where [math]\displaystyle{ \gamma }[/math] is the location parameter. Some of the characteristics of the two-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{t}-\gamma =m-\gamma }[/math].
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The distribution starts at [math]\displaystyle{ t=\gamma }[/math] at the level of [math]\displaystyle{ f(t=\gamma )=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases beyond [math]\displaystyle{ \gamma }[/math] and is convex.
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The One-Parameter Exponential Distribution
The one-parameter exponential [math]\displaystyle{ pdf }[/math] is obtained by setting [math]\displaystyle{ \gamma =0 }[/math], and is given by:
- [math]\displaystyle{ \begin{align}f(t)= & \lambda {{e}^{-\lambda t}}=\frac{1}{m}{{e}^{-\tfrac{1}{m}t}}, & t\ge 0, \lambda \gt 0,m\gt 0 \end{align} }[/math]
where:
- [math]\displaystyle{ \lambda }[/math] = constant rate, in failures per unit of measurement, e.g failures per hour, per cycle, etc.,
- [math]\displaystyle{ \lambda =\frac{1}{m} }[/math],
- [math]\displaystyle{ m }[/math] = mean time between failures, or to failure,
- [math]\displaystyle{ t }[/math] = operating time, life, or age, in hours, cycles, miles, actuations, etc.
This distribution requires the knowledge of only one parameter, [math]\displaystyle{ \lambda }[/math], for its application. Some of the characteristics of the one-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], is zero.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=m }[/math].
- As [math]\displaystyle{ \lambda }[/math] is decreased in value, the distribution is stretched out to the right, and as [math]\displaystyle{ \lambda }[/math] is increased, the distribution is pushed toward the origin.
- This distribution has no shape parameter as it has only one shape, i.e. the exponential, and the only parameter it has is the failure rate, [math]\displaystyle{ \lambda }[/math].
- The distribution starts at [math]\displaystyle{ t=0 }[/math] at the level of [math]\displaystyle{ f(t=0)=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases, and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ f(t)\to 0 }[/math].
- The [math]\displaystyle{ pdf }[/math] can be thought of as a special case of the Weibull [math]\displaystyle{ pdf }[/math] with [math]\displaystyle{ \gamma =0 }[/math] and [math]\displaystyle{ \beta =1 }[/math].
Exponential Statistical Properties
The Mean or MTTF
The mean, [math]\displaystyle{ \overline{T}, }[/math] or mean time to failure (MTTF) is given by:
- [math]\displaystyle{ \begin{align} \bar{T}= & \int_{\gamma }^{\infty }t\cdot f(t)dt \\ = & \int_{\gamma }^{\infty }t\cdot \lambda \cdot {{e}^{-\lambda t}}dt \\ = & \gamma +\frac{1}{\lambda }=m \end{align} }[/math]
Note that when [math]\displaystyle{ \gamma =0 }[/math], the MTTF is the inverse of the exponential distribution's constant failure rate. This is only true for the exponential distribution. Most other distributions do not have a constant failure rate. Consequently, the inverse relationship between failure rate and MTTF does not hold for these other distributions.
The Median
The median, [math]\displaystyle{ \breve{T}, }[/math] is:
- [math]\displaystyle{ \breve{T}=\gamma +\frac{1}{\lambda}\cdot 0.693 }[/math]
The Mode
The mode, [math]\displaystyle{ \tilde{T}, }[/math] is:
- [math]\displaystyle{ \tilde{T}=\gamma }[/math]
The Standard Deviation
The standard deviation, [math]\displaystyle{ {\sigma }_{T} }[/math], is:
- [math]\displaystyle{ {\sigma}_{T}=\frac{1}{\lambda }=m }[/math]
The Exponential Reliability Function
The equation for the two-parameter exponential cumulative density function, or [math]\displaystyle{ cdf, }[/math] is given by:
- [math]\displaystyle{ F(t)=Q(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Recalling that the reliability function of a distribution is simply one minus the [math]\displaystyle{ cdf }[/math], the reliability function of the two-parameter exponential distribution is given by:
- [math]\displaystyle{ R(t)=1-Q(t)=1-\int_{0}^{t-\gamma }f(x)dx }[/math]
- [math]\displaystyle{ R(t)=1-\int_{0}^{t-\gamma }\lambda {{e}^{-\lambda x}}dx={{e}^{-\lambda (t-\gamma )}} }[/math]
One-Parameter Exponential Reliability Function
The one-parameter exponential reliability function is given by:
- [math]\displaystyle{ R(t)={{e}^{-\lambda t}}={{e}^{-\tfrac{t}{m}}} }[/math]
The Exponential Conditional Reliability
The exponential conditional reliability equation gives the reliability for a mission of [math]\displaystyle{ t }[/math] duration, having already successfully accumulated [math]\displaystyle{ T }[/math] hours of operation up to the start of this new mission. The exponential conditional reliability function is:
- [math]\displaystyle{ R(t|T)=\frac{R(T+t)}{R(T)}=\frac{{{e}^{-\lambda (T+t-\gamma )}}}{{{e}^{-\lambda (T-\gamma )}}}={{e}^{-\lambda t}} }[/math]
which says that the reliability for a mission of [math]\displaystyle{ t }[/math] duration undertaken after the component or equipment has already accumulated [math]\displaystyle{ T }[/math] hours of operation from age zero is only a function of the mission duration, and not a function of the age at the beginning of the mission. This is referred to as the memoryless property.
The Exponential Reliable Life
The reliable life, or the mission duration for a desired reliability goal, [math]\displaystyle{ {{t}_{R}} }[/math], for the one-parameter exponential distribution is:
- [math]\displaystyle{ R({{t}_{R}})={{e}^{-\lambda ({{t}_{R}}-\gamma )}} }[/math]
- [math]\displaystyle{ \ln[R({{t}_{R}})]=-\lambda({{t}_{R}}-\gamma ) }[/math]
or:
- [math]\displaystyle{ {{t}_{R}}=\gamma -\frac{\ln [R({{t}_{R}})]}{\lambda } }[/math]
The Exponential Failure Rate Function
The exponential failure rate function is:
- [math]\displaystyle{ \lambda (t)=\frac{f(t)}{R(t)}=\frac{\lambda {{e}^{-\lambda (t-\gamma )}}}{{{e}^{-\lambda (t-\gamma )}}}=\lambda =\text{constant} }[/math]
Once again, note that the constant failure rate is a characteristic of the exponential distribution, and special cases of other distributions only. Most other distributions have failure rates that are functions of time.
Characteristics of the Exponential Distribution
As mentioned before, the primary trait of the exponential distribution is that it is used for modeling the behavior of items with a constant failure rate. It has a fairly simple mathematical form, which makes it fairly easy to manipulate. Unfortunately, this fact also leads to the use of this model in situations where it is not appropriate. For example, it would not be appropriate to use the exponential distribution to model the reliability of an automobile. The constant failure rate of the exponential distribution would require the assumption that the automobile would be just as likely to experience a breakdown during the first mile as it would during the one-hundred-thousandth mile. Clearly, this is not a valid assumption. However, some inexperienced practitioners of reliability engineering and life data analysis will overlook this fact, lured by the siren-call of the exponential distribution's relatively simple mathematical models.
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential [math]\displaystyle{ pdf }[/math]
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The exponential [math]\displaystyle{ pdf }[/math] is always convex and is stretched to the right as [math]\displaystyle{ \lambda }[/math] decreases in value.
- The value of the [math]\displaystyle{ pdf }[/math] function is always equal to the value of [math]\displaystyle{ \lambda }[/math] at [math]\displaystyle{ t=0 }[/math] (or [math]\displaystyle{ t=\gamma }[/math]).
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before this time.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{T}-\gamma =m-\gamma }[/math].
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential Reliability Function
- The one-parameter exponential reliability function starts at the value of 100% at [math]\displaystyle{ t=0 }[/math], decreases thereafter monotonically and is convex.
- The two-parameter exponential reliability function remains at the value of 100% for [math]\displaystyle{ t=0 }[/math] up to [math]\displaystyle{ t=\gamma }[/math], and decreases thereafter monotonically and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ R(t\to \infty )\to 0 }[/math].
- The reliability for a mission duration of [math]\displaystyle{ t=m=\tfrac{1}{\lambda } }[/math], or of one MTTF duration, is always equal to [math]\displaystyle{ 0.3679 }[/math] or 36.79%. This means that the reliability for a mission which is as long as one MTTF is relatively low and is not recommended because only 36.8% of the missions will be completed successfully. In other words, of the equipment undertaking such a mission, only 36.8% will survive their mission.
Estimation of the Exponential Parameters
Probability Plotting
Estimation of the parameters for the exponential distribution via probability plotting is very similar to the process used when dealing with the Weibull distribution. Recall, however, that the appearance of the probability plotting paper and the methods by which the parameters are estimated vary from distribution to distribution, so there will be some noticeable differences. In fact, due to the nature of the exponential [math]\displaystyle{ cdf }[/math], the exponential probability plot is the only one with a negative slope. This is because the y-axis of the exponential probability plotting paper represents the reliability, whereas the y-axis for most of the other life distributions represents the unreliability.
This is illustrated in the process of linearizing the [math]\displaystyle{ cdf }[/math], which is necessary to construct the exponential probability plotting paper. For the two-parameter exponential distribution the cumulative density function is given by:
- [math]\displaystyle{ F(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Taking the natural logarithm of both sides of the above equation yields:
- [math]\displaystyle{ \ln \left[ 1-F(t) \right]=-\lambda (t-\gamma ) }[/math]
or:
- [math]\displaystyle{ \ln [1-F(t)]=\lambda \gamma -\lambda t }[/math]
Now, let:
- [math]\displaystyle{ y=\ln [1-F(t)] }[/math]
- [math]\displaystyle{ a=\lambda \gamma }[/math]
and:
- [math]\displaystyle{ b=-\lambda }[/math]
which results in the linear equation of:
- [math]\displaystyle{ y=a+bt }[/math]
Note that with the exponential probability plotting paper, the y-axis scale is logarithmic and the x-axis scale is linear. This means that the zero value is present only on the x-axis. For [math]\displaystyle{ t=0 }[/math], [math]\displaystyle{ R=1 }[/math] and [math]\displaystyle{ F(t)=0 }[/math]. So if we were to use [math]\displaystyle{ F(t) }[/math] for the y-axis, we would have to plot the point [math]\displaystyle{ (0,0) }[/math]. However, since the y-axis is logarithmic, there is no place to plot this on the exponential paper. Also, the failure rate, [math]\displaystyle{ \lambda }[/math], is the negative of the slope of the line, but there is an easier way to determine the value of [math]\displaystyle{ \lambda }[/math] from the probability plot, as will be illustrated in the following example.
Example 1:
1-Parameter Exponential Probability Plot Example
6 units are put on a life test and tested to failure. The failure times are 7, 12, 19, 29, 41, and 67 hours. Estimate the failure rate for a 1-parameter exponential distribution using the probability plotting method.
In order to plot the points for the probability plot, the appropriate reliability estimate values must be obtained. These will be equivalent to [math]\displaystyle{ 100%-MR\,\! }[/math] since the y-axis represents the reliability and the [math]\displaystyle{ MR\,\! }[/math] values represent unreliability estimates.
Next, these points are plotted on an exponential probability plotting paper. A sample of this type of plotting paper is shown next, with the sample points in place. Notice how these points describe a line with a negative slope.
Once the points are plotted, draw the best possible straight line through these points. The time value at which this line intersects with a horizontal line drawn at the 36.8% reliability mark is the mean life, and the reciprocal of this is the failure rate [math]\displaystyle{ \lambda\,\! }[/math]. This is because at [math]\displaystyle{ t=m=\tfrac{1}{\lambda }\,\! }[/math]:
- [math]\displaystyle{ \begin{align} R(t)= & {{e}^{-\lambda \cdot t}} \\ R(t)= & {{e}^{-\lambda \cdot \tfrac{1}{\lambda }}} \\ R(t)= & {{e}^{-1}}=0.368=36.8%. \end{align}\,\! }[/math]
The following plot shows that the best-fit line through the data points crosses the [math]\displaystyle{ R=36.8%\,\! }[/math] line at [math]\displaystyle{ t=33\,\! }[/math] hours. And because [math]\displaystyle{ \tfrac{1}{\lambda }=33\,\! }[/math] hours, [math]\displaystyle{ \lambda =0.0303\,\! }[/math] failures/hour.
Rank Regression on Y for Exponential Distribution
Performing a rank regression on Y requires that a straight line be fitted to the set of available data points such that the sum of the squares of the vertical deviations from the points to the line is minimized. The least squares parameter estimation method (regression analysis) was discussed in Chapter Parameter Estimation, and the following equations for rank regression on Y (RRY) were derived:
- [math]\displaystyle{ \hat{a}=\bar{y}-\hat{b}\bar{x}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}} \right)}^{2}}}{N}} }[/math]
In our case, the equations for [math]\displaystyle{ {{y}_{i}} }[/math] and [math]\displaystyle{ {{x}_{i}} }[/math] are:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
and the [math]\displaystyle{ F({{t}_{i}}) }[/math] is estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, then [math]\displaystyle{ \hat{\lambda } }[/math] and [math]\displaystyle{ \hat{\gamma } }[/math] can easily be obtained from above equations.
For the one-parameter exponential, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0, \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}} \end{align} }[/math]
The Correlation Coefficient
The estimator of [math]\displaystyle{ \rho }[/math] is the sample correlation coefficient, [math]\displaystyle{ \hat{\rho } }[/math], given by:
- [math]\displaystyle{ \hat{\rho }=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,({{x}_{i}}-\overline{x})({{y}_{i}}-\overline{y})}{\sqrt{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{x}_{i}}-\overline{x})}^{2}}\cdot \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{y}_{i}}-\overline{y})}^{2}}}} }[/math]
Example 2:
Template loop detected: Template:2 parameter exponential distribution RRY example
Rank Regression on X for Exponential Distribution
Similar to rank regression on Y, performing a rank regression on X requires that a straight line be fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized.
Again the first task is to bring our exponential [math]\displaystyle{ cdf }[/math] function into a linear form. This step is exactly the same as in regression on Y analysis. The deviation from the previous analysis begins on the least squares fit step, since in this case we treat [math]\displaystyle{ x }[/math] as the dependent variable and [math]\displaystyle{ y }[/math] as the independent variable. The best-fitting straight line to the data, for regression on X (see Chapter Parameter Estimation), is the straight line:
- [math]\displaystyle{ x=\hat{a}+\hat{b}y }[/math]
The corresponding equations for [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are:
- [math]\displaystyle{ \hat{a}=\overline{x}-\hat{b}\overline{y}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}} \right)}^{2}}}{N}} }[/math]
where:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
The values of [math]\displaystyle{ F({{t}_{i}}) }[/math] are estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, solve for the unknown [math]\displaystyle{ y }[/math] value, which corresponds to:
- [math]\displaystyle{ y=-\frac{\hat{a}}{\hat{b}}+\frac{1}{\hat{b}}x }[/math]
Solving for the parameters from above equations we get:
- [math]\displaystyle{ a=-\frac{\hat{a}}{\hat{b}}=\lambda \gamma \Rightarrow \gamma =\hat{a} }[/math]
and:
- [math]\displaystyle{ b=\frac{1}{\hat{b}}=-\lambda \Rightarrow \lambda =-\frac{1}{\hat{b}} }[/math]
For the one-parameter exponential case, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0 \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}} \end{align} }[/math]
The correlation coefficient is evaluated as before.
Example 3:
Template loop detected: Template:Example: 2 Parameter Exponential Distribution RRX
Maximum Likelihood Estimation for Exponential Distribution
As outlined in Chapter Parameter Estimation, maximum likelihood estimation works by developing a likelihood function based on the available data and finding the values of the parameter estimates that maximize the likelihood function. This can be achieved by using iterative methods to determine the parameter estimate values that maximize the likelihood function. This can be rather difficult and time-consuming, particularly when dealing with the three-parameter distribution. Another method of finding the parameter estimates involves taking the partial derivatives of the likelihood equation with respect to the parameters, setting the resulting equations equal to zero, and solving simultaneously to determine the values of the parameter estimates. The log-likelihood functions and associated partial derivatives used to determine maximum likelihood estimates for the exponential distribution are covered in Appendix.
Example 4: Template loop detected: Template:Example: MLE for Exponential Distribution
Confidence Bounds
In this section, we present the methods used in the application to estimate the different types of confidence bounds for exponentially distributed data. The complete derivations were presented in detail (for a general function) in the chapter for Confidence Bounds. At this time we should point out that exact confidence bounds for the exponential distribution have been derived, and exist in a closed form, utilizing the [math]\displaystyle{ {{\chi }^{2}} }[/math] distribution. These are described in detail in Kececioglu [20], and are covered in the section in the test design chapter. For most exponential data analyses, Weibull++ will use the approximate confidence bounds, provided from the Fisher information matrix or the likelihood ratio, in order to stay consistent with all of the other available distributions in the application. The [math]\displaystyle{ {{\chi }^{2}} }[/math] confidence bounds for the exponential distribution are discussed in more detail in the test design chapter.
Fisher Matrix Bounds
Bounds on the Parameters
For the failure rate [math]\displaystyle{ \hat{\lambda } }[/math] the upper ([math]\displaystyle{ {{\lambda }_{U}} }[/math]) and lower ([math]\displaystyle{ {{\lambda }_{L}} }[/math]) bounds are estimated by [30]:
- [math]\displaystyle{ \begin{align} & {{\lambda }_{U}}= & \hat{\lambda }\cdot {{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}} \\ & & \\ & {{\lambda }_{L}}= & \frac{\hat{\lambda }}{{{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}}} \end{align} }[/math]
where [math]\displaystyle{ {{K}_{\alpha }} }[/math] is defined by:
- [math]\displaystyle{ \alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }}) }[/math]
If [math]\displaystyle{ \delta }[/math] is the confidence level, then [math]\displaystyle{ \alpha =\tfrac{1-\delta }{2} }[/math] for the two-sided bounds, and [math]\displaystyle{ \alpha =1-\delta }[/math] for the one-sided bounds.
The variance of [math]\displaystyle{ \hat{\lambda }, }[/math] [math]\displaystyle{ Var(\hat{\lambda }), }[/math] is estimated from the Fisher matrix, as follows:
- [math]\displaystyle{ Var(\hat{\lambda })={{\left( -\frac{{{\partial }^{2}}\Lambda }{\partial {{\lambda }^{2}}} \right)}^{-1}} }[/math]
where [math]\displaystyle{ \Lambda }[/math] is the log-likelihood function of the exponential distribution, described in Appendix for log-likelihood function.
Note that no true MLE solution exists for the case of the two-parameter exponential distribution. The mathematics simply break down while trying to simultaneously solve the partial derivative equations for both the [math]\displaystyle{ \gamma }[/math] and [math]\displaystyle{ \lambda }[/math] parameters, resulting in unrealistic conditions. The way around this conundrum involves setting [math]\displaystyle{ \gamma ={{t}_{1}}, }[/math] or the first time-to-failure, and calculating [math]\displaystyle{ \lambda }[/math] in the regular fashion for this methodology. Weibull++ treats [math]\displaystyle{ \gamma }[/math] as a constant when computing bounds, i.e. [math]\displaystyle{ Var(\hat{\gamma })=0. }[/math] (See the discussion in Appendix for log-likelihood function for more information.)
Template loop detected: Template:Bounds on Reliability FMB ED
Template loop detected: Template:Bounds on Time FMB ED
Template loop detected: Template:Likelihood Ratio Confidence Bounds FMB ED
Template loop detected: Template:Bayesian Confidence Bounds ED
General Examples
Example 8:
20 units were reliability tested with the following results:
Table - Life Test Data | |
Number of Units in Group | Time-to-Failure |
---|---|
7 | 100 |
5 | 200 |
3 | 300 |
2 | 400 |
1 | 500 |
2 | 600 |
1. Assuming a 2-parameter exponential distribution, estimate the parameters by hand using the MLE analysis method.
2. Repeat the above using Weibull++. (Enter the data as grouped data to duplicate the results.)
3. Show the Probability plot for the analysis results.
4. Show the Reliability vs. Time plot for the results.
5. Show the pdf plot for the results.
6. Show the Failure Rate vs. Time plot for the results.
7. Estimate the parameters using the rank regression on Y (RRY) analysis method (and using grouped ranks).
Solution
1. For the 2-parameter exponential distribution and for [math]\displaystyle{ \hat{\gamma }=100\,\! }[/math] hours (first failure), the partial of the log-likelihood function, [math]\displaystyle{ \lambda\,\! }[/math], becomes:
- [math]\displaystyle{ \begin{align} \frac{\partial \Lambda }{\partial \lambda }= &\underset{i=1}{\overset{6}{\mathop \sum }}\,{N_i} \left[ \frac{1}{\lambda }-\left( {{T}_{i}}-100 \right) \right]=0\\ \Rightarrow & 7[\frac{1}{\lambda }-(100-100)]+5[\frac{1}{\lambda}-(200-100)] + \ldots +2[\frac{1}{\lambda}-(600-100)]\\ = & 0\\ \Rightarrow & \hat{\lambda}=\frac{20}{3100}=0.0065 \text{fr/hr} \end{align} \,\! }[/math]
2. Enter the data in a Weibull++ standard folio and calculate it as shown next.
3. On the Plot page of the folio, the exponential Probability plot will appear as shown next.
4. View the Reliability vs. Time plot.
5. View the pdf plot.
6. View the Failure Rate vs. Time plot.
Note that, as described at the beginning of this chapter, the failure rate for the exponential distribution is constant. Also note that the Failure Rate vs. Time plot does show values for times before the location parameter, [math]\displaystyle{ \gamma \,\! }[/math], at 100 hours.
7. In the case of grouped data, one must be cautious when estimating the parameters using a rank regression method. This is because the median rank values are determined from the total number of failures observed by time [math]\displaystyle{ {{T}_{i}}\,\! }[/math] where [math]\displaystyle{ i\,\! }[/math] indicates the group number. In this example, the total number of groups is [math]\displaystyle{ N=6\,\! }[/math] and the total number of units is [math]\displaystyle{ {{N}_{T}}=20\,\! }[/math]. Thus, the median rank values will be estimated for 20 units and for the total failed units ([math]\displaystyle{ {{N}_{{{F}_{i}}}}\,\! }[/math]) up to the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] group, for the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] rank value. The median ranks values can be found from rank tables or they can be estimated using ReliaSoft's Quick Statistical Reference tool.
For example, the median rank value of the fourth group will be the [math]\displaystyle{ {{17}^{th}}\,\! }[/math] rank out of a sample size of twenty units (or 81.945%).
The following table is then constructed.
Given the values in the table above, calculate [math]\displaystyle{ \hat{a}\,\! }[/math] and [math]\displaystyle{ \hat{b}\,\! }[/math]:
- [math]\displaystyle{ \begin{align} & \hat{b}= & \frac{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}}{{y}_{i}}-(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{y}_{i}})/6}{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,t_{i}^{2}-{{(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})}^{2}}/6} \\ & & \\ & \hat{b}= & \frac{-4320.3362-(2100)(-9.6476)/6}{910,000-{{(2100)}^{2}}/6} \end{align}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{b}=-0.005392\,\! }[/math]
and:
- [math]\displaystyle{ \hat{a}=\overline{y}-\hat{b}\overline{t}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{t}_{i}}}{N}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{a}=\frac{-9.6476}{6}-(-0.005392)\frac{2100}{6}=0.2793\,\! }[/math]
Therefore:
- [math]\displaystyle{ \hat{\lambda }=-\hat{b}=-(-0.005392)=0.05392\text{ failures/hour}\,\! }[/math]
and:
- [math]\displaystyle{ \hat{\gamma }=\frac{\hat{a}}{\hat{\lambda }}=\frac{0.2793}{0.005392}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{\gamma }\simeq 51.8\text{ hours}\,\! }[/math]
Then:
- [math]\displaystyle{ f(T)=(0.005392){{e}^{-0.005392(T-51.8)}}\,\! }[/math]
Using Weibull++, the estimated parameters are:
- [math]\displaystyle{ \begin{align} \hat{\lambda }= & 0.0054\text{ failures/hour} \\ \hat{\gamma }= & 51.82\text{ hours} \end{align}\,\! }[/math]
The small difference in the values from Weibull++ is due to rounding. In the application, the calculations and the rank values are carried out up to the [math]\displaystyle{ 15^{th}\,\! }[/math] decimal point.
Example 9:
A number of leukemia patients were treated with either drug 6MP or a placebo, and the times in weeks until cancer symptoms returned were recorded. Analyze each treatment separately [21, p.175].
Table - Leukemia Treatment Results | |||
Time (weeks) | Number of Patients | Treament | Comments |
---|---|---|---|
1 | 2 | placebo | |
2 | 2 | placebo | |
3 | 1 | placebo | |
4 | 2 | placebo | |
5 | 2 | placebo | |
6 | 4 | 6MP | 3 patients completed |
7 | 1 | 6MP | |
8 | 4 | placebo | |
9 | 1 | 6MP | Not completed |
10 | 2 | 6MP | 1 patient completed |
11 | 2 | placebo | |
11 | 1 | 6MP | Not completed |
12 | 2 | placebo | |
13 | 1 | 6MP | |
15 | 1 | placebo | |
16 | 1 | 6MP | |
17 | 1 | placebo | |
17 | 1 | 6MP | Not completed |
19 | 1 | 6MP | Not completed |
20 | 1 | 6MP | Not completed |
22 | 1 | placebo | |
22 | 1 | 6MP | |
23 | 1 | placebo | |
23 | 1 | 6MP | |
25 | 1 | 6MP | Not completed |
32 | 2 | 6MP | Not completed |
34 | 1 | 6MP | Not completed |
35 | 1 | 6MP | Not completed |
Create a new Weibull++ standard folio that's configured for grouped times-to-failure data with suspensions. In the first column, enter the number of patients. Whenever there are uncompleted tests, enter the number of patients who completed the test separately from the number of patients who did not (e.g., if 4 patients had symptoms return after 6 weeks and only 3 of them completed the test, then enter 1 in one row and 3 in another). In the second column enter F if the patients completed the test and S if they didn't. In the third column enter the time, and in the fourth column (Subset ID) specify whether the 6MP drug or a placebo was used.
Next, open the Batch Auto Run utility and select to separate the 6MP drug from the placebo, as shown next.
The software will create two data sheets, one for each subset ID, as shown next.
Calculate both data sheets using the 2-parameter exponential distribution and the MLE analysis method, then insert an additional plot and select to show the analysis results for both data sheets on that plot, which will appear as shown next.
The exponential distribution is a commonly used distribution in reliability engineering. Mathematically, it is a fairly simple distribution, which many times leads to its use in inappropriate situations. It is, in fact, a special case of the Weibull distribution where [math]\displaystyle{ \beta =1 }[/math]. The exponential distribution is used to model the behavior of units that have a constant failure rate (or units that do not degrade with time or wear out).
Exponential Probability Density Function
The Two-Parameter Exponential Distribution
The two-parameter exponential pdf is given by:
- [math]\displaystyle{ f(t)=\lambda {{e}^{-\lambda (t-\gamma )}},f(t)\ge 0,\lambda \gt 0,t\ge 0\text{ or }\gamma }[/math]
where [math]\displaystyle{ \gamma }[/math] is the location parameter. Some of the characteristics of the two-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{t}-\gamma =m-\gamma }[/math].
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The distribution starts at [math]\displaystyle{ t=\gamma }[/math] at the level of [math]\displaystyle{ f(t=\gamma )=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases beyond [math]\displaystyle{ \gamma }[/math] and is convex.
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The One-Parameter Exponential Distribution
The one-parameter exponential [math]\displaystyle{ pdf }[/math] is obtained by setting [math]\displaystyle{ \gamma =0 }[/math], and is given by:
- [math]\displaystyle{ \begin{align}f(t)= & \lambda {{e}^{-\lambda t}}=\frac{1}{m}{{e}^{-\tfrac{1}{m}t}}, & t\ge 0, \lambda \gt 0,m\gt 0 \end{align} }[/math]
where:
- [math]\displaystyle{ \lambda }[/math] = constant rate, in failures per unit of measurement, e.g failures per hour, per cycle, etc.,
- [math]\displaystyle{ \lambda =\frac{1}{m} }[/math],
- [math]\displaystyle{ m }[/math] = mean time between failures, or to failure,
- [math]\displaystyle{ t }[/math] = operating time, life, or age, in hours, cycles, miles, actuations, etc.
This distribution requires the knowledge of only one parameter, [math]\displaystyle{ \lambda }[/math], for its application. Some of the characteristics of the one-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], is zero.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=m }[/math].
- As [math]\displaystyle{ \lambda }[/math] is decreased in value, the distribution is stretched out to the right, and as [math]\displaystyle{ \lambda }[/math] is increased, the distribution is pushed toward the origin.
- This distribution has no shape parameter as it has only one shape, i.e. the exponential, and the only parameter it has is the failure rate, [math]\displaystyle{ \lambda }[/math].
- The distribution starts at [math]\displaystyle{ t=0 }[/math] at the level of [math]\displaystyle{ f(t=0)=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases, and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ f(t)\to 0 }[/math].
- The [math]\displaystyle{ pdf }[/math] can be thought of as a special case of the Weibull [math]\displaystyle{ pdf }[/math] with [math]\displaystyle{ \gamma =0 }[/math] and [math]\displaystyle{ \beta =1 }[/math].
Exponential Statistical Properties
The Mean or MTTF
The mean, [math]\displaystyle{ \overline{T}, }[/math] or mean time to failure (MTTF) is given by:
- [math]\displaystyle{ \begin{align} \bar{T}= & \int_{\gamma }^{\infty }t\cdot f(t)dt \\ = & \int_{\gamma }^{\infty }t\cdot \lambda \cdot {{e}^{-\lambda t}}dt \\ = & \gamma +\frac{1}{\lambda }=m \end{align} }[/math]
Note that when [math]\displaystyle{ \gamma =0 }[/math], the MTTF is the inverse of the exponential distribution's constant failure rate. This is only true for the exponential distribution. Most other distributions do not have a constant failure rate. Consequently, the inverse relationship between failure rate and MTTF does not hold for these other distributions.
The Median
The median, [math]\displaystyle{ \breve{T}, }[/math] is:
- [math]\displaystyle{ \breve{T}=\gamma +\frac{1}{\lambda}\cdot 0.693 }[/math]
The Mode
The mode, [math]\displaystyle{ \tilde{T}, }[/math] is:
- [math]\displaystyle{ \tilde{T}=\gamma }[/math]
The Standard Deviation
The standard deviation, [math]\displaystyle{ {\sigma }_{T} }[/math], is:
- [math]\displaystyle{ {\sigma}_{T}=\frac{1}{\lambda }=m }[/math]
The Exponential Reliability Function
The equation for the two-parameter exponential cumulative density function, or [math]\displaystyle{ cdf, }[/math] is given by:
- [math]\displaystyle{ F(t)=Q(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Recalling that the reliability function of a distribution is simply one minus the [math]\displaystyle{ cdf }[/math], the reliability function of the two-parameter exponential distribution is given by:
- [math]\displaystyle{ R(t)=1-Q(t)=1-\int_{0}^{t-\gamma }f(x)dx }[/math]
- [math]\displaystyle{ R(t)=1-\int_{0}^{t-\gamma }\lambda {{e}^{-\lambda x}}dx={{e}^{-\lambda (t-\gamma )}} }[/math]
One-Parameter Exponential Reliability Function
The one-parameter exponential reliability function is given by:
- [math]\displaystyle{ R(t)={{e}^{-\lambda t}}={{e}^{-\tfrac{t}{m}}} }[/math]
The Exponential Conditional Reliability
The exponential conditional reliability equation gives the reliability for a mission of [math]\displaystyle{ t }[/math] duration, having already successfully accumulated [math]\displaystyle{ T }[/math] hours of operation up to the start of this new mission. The exponential conditional reliability function is:
- [math]\displaystyle{ R(t|T)=\frac{R(T+t)}{R(T)}=\frac{{{e}^{-\lambda (T+t-\gamma )}}}{{{e}^{-\lambda (T-\gamma )}}}={{e}^{-\lambda t}} }[/math]
which says that the reliability for a mission of [math]\displaystyle{ t }[/math] duration undertaken after the component or equipment has already accumulated [math]\displaystyle{ T }[/math] hours of operation from age zero is only a function of the mission duration, and not a function of the age at the beginning of the mission. This is referred to as the memoryless property.
The Exponential Reliable Life
The reliable life, or the mission duration for a desired reliability goal, [math]\displaystyle{ {{t}_{R}} }[/math], for the one-parameter exponential distribution is:
- [math]\displaystyle{ R({{t}_{R}})={{e}^{-\lambda ({{t}_{R}}-\gamma )}} }[/math]
- [math]\displaystyle{ \ln[R({{t}_{R}})]=-\lambda({{t}_{R}}-\gamma ) }[/math]
or:
- [math]\displaystyle{ {{t}_{R}}=\gamma -\frac{\ln [R({{t}_{R}})]}{\lambda } }[/math]
The Exponential Failure Rate Function
The exponential failure rate function is:
- [math]\displaystyle{ \lambda (t)=\frac{f(t)}{R(t)}=\frac{\lambda {{e}^{-\lambda (t-\gamma )}}}{{{e}^{-\lambda (t-\gamma )}}}=\lambda =\text{constant} }[/math]
Once again, note that the constant failure rate is a characteristic of the exponential distribution, and special cases of other distributions only. Most other distributions have failure rates that are functions of time.
Characteristics of the Exponential Distribution
As mentioned before, the primary trait of the exponential distribution is that it is used for modeling the behavior of items with a constant failure rate. It has a fairly simple mathematical form, which makes it fairly easy to manipulate. Unfortunately, this fact also leads to the use of this model in situations where it is not appropriate. For example, it would not be appropriate to use the exponential distribution to model the reliability of an automobile. The constant failure rate of the exponential distribution would require the assumption that the automobile would be just as likely to experience a breakdown during the first mile as it would during the one-hundred-thousandth mile. Clearly, this is not a valid assumption. However, some inexperienced practitioners of reliability engineering and life data analysis will overlook this fact, lured by the siren-call of the exponential distribution's relatively simple mathematical models.
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential [math]\displaystyle{ pdf }[/math]
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The exponential [math]\displaystyle{ pdf }[/math] is always convex and is stretched to the right as [math]\displaystyle{ \lambda }[/math] decreases in value.
- The value of the [math]\displaystyle{ pdf }[/math] function is always equal to the value of [math]\displaystyle{ \lambda }[/math] at [math]\displaystyle{ t=0 }[/math] (or [math]\displaystyle{ t=\gamma }[/math]).
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before this time.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{T}-\gamma =m-\gamma }[/math].
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential Reliability Function
- The one-parameter exponential reliability function starts at the value of 100% at [math]\displaystyle{ t=0 }[/math], decreases thereafter monotonically and is convex.
- The two-parameter exponential reliability function remains at the value of 100% for [math]\displaystyle{ t=0 }[/math] up to [math]\displaystyle{ t=\gamma }[/math], and decreases thereafter monotonically and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ R(t\to \infty )\to 0 }[/math].
- The reliability for a mission duration of [math]\displaystyle{ t=m=\tfrac{1}{\lambda } }[/math], or of one MTTF duration, is always equal to [math]\displaystyle{ 0.3679 }[/math] or 36.79%. This means that the reliability for a mission which is as long as one MTTF is relatively low and is not recommended because only 36.8% of the missions will be completed successfully. In other words, of the equipment undertaking such a mission, only 36.8% will survive their mission.
Estimation of the Exponential Parameters
Probability Plotting
Estimation of the parameters for the exponential distribution via probability plotting is very similar to the process used when dealing with the Weibull distribution. Recall, however, that the appearance of the probability plotting paper and the methods by which the parameters are estimated vary from distribution to distribution, so there will be some noticeable differences. In fact, due to the nature of the exponential [math]\displaystyle{ cdf }[/math], the exponential probability plot is the only one with a negative slope. This is because the y-axis of the exponential probability plotting paper represents the reliability, whereas the y-axis for most of the other life distributions represents the unreliability.
This is illustrated in the process of linearizing the [math]\displaystyle{ cdf }[/math], which is necessary to construct the exponential probability plotting paper. For the two-parameter exponential distribution the cumulative density function is given by:
- [math]\displaystyle{ F(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Taking the natural logarithm of both sides of the above equation yields:
- [math]\displaystyle{ \ln \left[ 1-F(t) \right]=-\lambda (t-\gamma ) }[/math]
or:
- [math]\displaystyle{ \ln [1-F(t)]=\lambda \gamma -\lambda t }[/math]
Now, let:
- [math]\displaystyle{ y=\ln [1-F(t)] }[/math]
- [math]\displaystyle{ a=\lambda \gamma }[/math]
and:
- [math]\displaystyle{ b=-\lambda }[/math]
which results in the linear equation of:
- [math]\displaystyle{ y=a+bt }[/math]
Note that with the exponential probability plotting paper, the y-axis scale is logarithmic and the x-axis scale is linear. This means that the zero value is present only on the x-axis. For [math]\displaystyle{ t=0 }[/math], [math]\displaystyle{ R=1 }[/math] and [math]\displaystyle{ F(t)=0 }[/math]. So if we were to use [math]\displaystyle{ F(t) }[/math] for the y-axis, we would have to plot the point [math]\displaystyle{ (0,0) }[/math]. However, since the y-axis is logarithmic, there is no place to plot this on the exponential paper. Also, the failure rate, [math]\displaystyle{ \lambda }[/math], is the negative of the slope of the line, but there is an easier way to determine the value of [math]\displaystyle{ \lambda }[/math] from the probability plot, as will be illustrated in the following example.
Example 1:
1-Parameter Exponential Probability Plot Example
6 units are put on a life test and tested to failure. The failure times are 7, 12, 19, 29, 41, and 67 hours. Estimate the failure rate for a 1-parameter exponential distribution using the probability plotting method.
In order to plot the points for the probability plot, the appropriate reliability estimate values must be obtained. These will be equivalent to [math]\displaystyle{ 100%-MR\,\! }[/math] since the y-axis represents the reliability and the [math]\displaystyle{ MR\,\! }[/math] values represent unreliability estimates.
Next, these points are plotted on an exponential probability plotting paper. A sample of this type of plotting paper is shown next, with the sample points in place. Notice how these points describe a line with a negative slope.
Once the points are plotted, draw the best possible straight line through these points. The time value at which this line intersects with a horizontal line drawn at the 36.8% reliability mark is the mean life, and the reciprocal of this is the failure rate [math]\displaystyle{ \lambda\,\! }[/math]. This is because at [math]\displaystyle{ t=m=\tfrac{1}{\lambda }\,\! }[/math]:
- [math]\displaystyle{ \begin{align} R(t)= & {{e}^{-\lambda \cdot t}} \\ R(t)= & {{e}^{-\lambda \cdot \tfrac{1}{\lambda }}} \\ R(t)= & {{e}^{-1}}=0.368=36.8%. \end{align}\,\! }[/math]
The following plot shows that the best-fit line through the data points crosses the [math]\displaystyle{ R=36.8%\,\! }[/math] line at [math]\displaystyle{ t=33\,\! }[/math] hours. And because [math]\displaystyle{ \tfrac{1}{\lambda }=33\,\! }[/math] hours, [math]\displaystyle{ \lambda =0.0303\,\! }[/math] failures/hour.
Rank Regression on Y for Exponential Distribution
Performing a rank regression on Y requires that a straight line be fitted to the set of available data points such that the sum of the squares of the vertical deviations from the points to the line is minimized. The least squares parameter estimation method (regression analysis) was discussed in Chapter Parameter Estimation, and the following equations for rank regression on Y (RRY) were derived:
- [math]\displaystyle{ \hat{a}=\bar{y}-\hat{b}\bar{x}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}} \right)}^{2}}}{N}} }[/math]
In our case, the equations for [math]\displaystyle{ {{y}_{i}} }[/math] and [math]\displaystyle{ {{x}_{i}} }[/math] are:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
and the [math]\displaystyle{ F({{t}_{i}}) }[/math] is estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, then [math]\displaystyle{ \hat{\lambda } }[/math] and [math]\displaystyle{ \hat{\gamma } }[/math] can easily be obtained from above equations.
For the one-parameter exponential, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0, \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}} \end{align} }[/math]
The Correlation Coefficient
The estimator of [math]\displaystyle{ \rho }[/math] is the sample correlation coefficient, [math]\displaystyle{ \hat{\rho } }[/math], given by:
- [math]\displaystyle{ \hat{\rho }=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,({{x}_{i}}-\overline{x})({{y}_{i}}-\overline{y})}{\sqrt{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{x}_{i}}-\overline{x})}^{2}}\cdot \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{y}_{i}}-\overline{y})}^{2}}}} }[/math]
Example 2:
Template loop detected: Template:2 parameter exponential distribution RRY example
Rank Regression on X for Exponential Distribution
Similar to rank regression on Y, performing a rank regression on X requires that a straight line be fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized.
Again the first task is to bring our exponential [math]\displaystyle{ cdf }[/math] function into a linear form. This step is exactly the same as in regression on Y analysis. The deviation from the previous analysis begins on the least squares fit step, since in this case we treat [math]\displaystyle{ x }[/math] as the dependent variable and [math]\displaystyle{ y }[/math] as the independent variable. The best-fitting straight line to the data, for regression on X (see Chapter Parameter Estimation), is the straight line:
- [math]\displaystyle{ x=\hat{a}+\hat{b}y }[/math]
The corresponding equations for [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are:
- [math]\displaystyle{ \hat{a}=\overline{x}-\hat{b}\overline{y}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}} \right)}^{2}}}{N}} }[/math]
where:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
The values of [math]\displaystyle{ F({{t}_{i}}) }[/math] are estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, solve for the unknown [math]\displaystyle{ y }[/math] value, which corresponds to:
- [math]\displaystyle{ y=-\frac{\hat{a}}{\hat{b}}+\frac{1}{\hat{b}}x }[/math]
Solving for the parameters from above equations we get:
- [math]\displaystyle{ a=-\frac{\hat{a}}{\hat{b}}=\lambda \gamma \Rightarrow \gamma =\hat{a} }[/math]
and:
- [math]\displaystyle{ b=\frac{1}{\hat{b}}=-\lambda \Rightarrow \lambda =-\frac{1}{\hat{b}} }[/math]
For the one-parameter exponential case, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0 \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}} \end{align} }[/math]
The correlation coefficient is evaluated as before.
Example 3:
Template loop detected: Template:Example: 2 Parameter Exponential Distribution RRX
Maximum Likelihood Estimation for Exponential Distribution
As outlined in Chapter Parameter Estimation, maximum likelihood estimation works by developing a likelihood function based on the available data and finding the values of the parameter estimates that maximize the likelihood function. This can be achieved by using iterative methods to determine the parameter estimate values that maximize the likelihood function. This can be rather difficult and time-consuming, particularly when dealing with the three-parameter distribution. Another method of finding the parameter estimates involves taking the partial derivatives of the likelihood equation with respect to the parameters, setting the resulting equations equal to zero, and solving simultaneously to determine the values of the parameter estimates. The log-likelihood functions and associated partial derivatives used to determine maximum likelihood estimates for the exponential distribution are covered in Appendix.
Example 4: Template loop detected: Template:Example: MLE for Exponential Distribution
Confidence Bounds
In this section, we present the methods used in the application to estimate the different types of confidence bounds for exponentially distributed data. The complete derivations were presented in detail (for a general function) in the chapter for Confidence Bounds. At this time we should point out that exact confidence bounds for the exponential distribution have been derived, and exist in a closed form, utilizing the [math]\displaystyle{ {{\chi }^{2}} }[/math] distribution. These are described in detail in Kececioglu [20], and are covered in the section in the test design chapter. For most exponential data analyses, Weibull++ will use the approximate confidence bounds, provided from the Fisher information matrix or the likelihood ratio, in order to stay consistent with all of the other available distributions in the application. The [math]\displaystyle{ {{\chi }^{2}} }[/math] confidence bounds for the exponential distribution are discussed in more detail in the test design chapter.
Fisher Matrix Bounds
Bounds on the Parameters
For the failure rate [math]\displaystyle{ \hat{\lambda } }[/math] the upper ([math]\displaystyle{ {{\lambda }_{U}} }[/math]) and lower ([math]\displaystyle{ {{\lambda }_{L}} }[/math]) bounds are estimated by [30]:
- [math]\displaystyle{ \begin{align} & {{\lambda }_{U}}= & \hat{\lambda }\cdot {{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}} \\ & & \\ & {{\lambda }_{L}}= & \frac{\hat{\lambda }}{{{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}}} \end{align} }[/math]
where [math]\displaystyle{ {{K}_{\alpha }} }[/math] is defined by:
- [math]\displaystyle{ \alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }}) }[/math]
If [math]\displaystyle{ \delta }[/math] is the confidence level, then [math]\displaystyle{ \alpha =\tfrac{1-\delta }{2} }[/math] for the two-sided bounds, and [math]\displaystyle{ \alpha =1-\delta }[/math] for the one-sided bounds.
The variance of [math]\displaystyle{ \hat{\lambda }, }[/math] [math]\displaystyle{ Var(\hat{\lambda }), }[/math] is estimated from the Fisher matrix, as follows:
- [math]\displaystyle{ Var(\hat{\lambda })={{\left( -\frac{{{\partial }^{2}}\Lambda }{\partial {{\lambda }^{2}}} \right)}^{-1}} }[/math]
where [math]\displaystyle{ \Lambda }[/math] is the log-likelihood function of the exponential distribution, described in Appendix for log-likelihood function.
Note that no true MLE solution exists for the case of the two-parameter exponential distribution. The mathematics simply break down while trying to simultaneously solve the partial derivative equations for both the [math]\displaystyle{ \gamma }[/math] and [math]\displaystyle{ \lambda }[/math] parameters, resulting in unrealistic conditions. The way around this conundrum involves setting [math]\displaystyle{ \gamma ={{t}_{1}}, }[/math] or the first time-to-failure, and calculating [math]\displaystyle{ \lambda }[/math] in the regular fashion for this methodology. Weibull++ treats [math]\displaystyle{ \gamma }[/math] as a constant when computing bounds, i.e. [math]\displaystyle{ Var(\hat{\gamma })=0. }[/math] (See the discussion in Appendix for log-likelihood function for more information.)
Template loop detected: Template:Bounds on Reliability FMB ED
Template loop detected: Template:Bounds on Time FMB ED
Template loop detected: Template:Likelihood Ratio Confidence Bounds FMB ED
Template loop detected: Template:Bayesian Confidence Bounds ED
General Examples
Example 8:
20 units were reliability tested with the following results:
Table - Life Test Data | |
Number of Units in Group | Time-to-Failure |
---|---|
7 | 100 |
5 | 200 |
3 | 300 |
2 | 400 |
1 | 500 |
2 | 600 |
1. Assuming a 2-parameter exponential distribution, estimate the parameters by hand using the MLE analysis method.
2. Repeat the above using Weibull++. (Enter the data as grouped data to duplicate the results.)
3. Show the Probability plot for the analysis results.
4. Show the Reliability vs. Time plot for the results.
5. Show the pdf plot for the results.
6. Show the Failure Rate vs. Time plot for the results.
7. Estimate the parameters using the rank regression on Y (RRY) analysis method (and using grouped ranks).
Solution
1. For the 2-parameter exponential distribution and for [math]\displaystyle{ \hat{\gamma }=100\,\! }[/math] hours (first failure), the partial of the log-likelihood function, [math]\displaystyle{ \lambda\,\! }[/math], becomes:
- [math]\displaystyle{ \begin{align} \frac{\partial \Lambda }{\partial \lambda }= &\underset{i=1}{\overset{6}{\mathop \sum }}\,{N_i} \left[ \frac{1}{\lambda }-\left( {{T}_{i}}-100 \right) \right]=0\\ \Rightarrow & 7[\frac{1}{\lambda }-(100-100)]+5[\frac{1}{\lambda}-(200-100)] + \ldots +2[\frac{1}{\lambda}-(600-100)]\\ = & 0\\ \Rightarrow & \hat{\lambda}=\frac{20}{3100}=0.0065 \text{fr/hr} \end{align} \,\! }[/math]
2. Enter the data in a Weibull++ standard folio and calculate it as shown next.
3. On the Plot page of the folio, the exponential Probability plot will appear as shown next.
4. View the Reliability vs. Time plot.
5. View the pdf plot.
6. View the Failure Rate vs. Time plot.
Note that, as described at the beginning of this chapter, the failure rate for the exponential distribution is constant. Also note that the Failure Rate vs. Time plot does show values for times before the location parameter, [math]\displaystyle{ \gamma \,\! }[/math], at 100 hours.
7. In the case of grouped data, one must be cautious when estimating the parameters using a rank regression method. This is because the median rank values are determined from the total number of failures observed by time [math]\displaystyle{ {{T}_{i}}\,\! }[/math] where [math]\displaystyle{ i\,\! }[/math] indicates the group number. In this example, the total number of groups is [math]\displaystyle{ N=6\,\! }[/math] and the total number of units is [math]\displaystyle{ {{N}_{T}}=20\,\! }[/math]. Thus, the median rank values will be estimated for 20 units and for the total failed units ([math]\displaystyle{ {{N}_{{{F}_{i}}}}\,\! }[/math]) up to the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] group, for the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] rank value. The median ranks values can be found from rank tables or they can be estimated using ReliaSoft's Quick Statistical Reference tool.
For example, the median rank value of the fourth group will be the [math]\displaystyle{ {{17}^{th}}\,\! }[/math] rank out of a sample size of twenty units (or 81.945%).
The following table is then constructed.
Given the values in the table above, calculate [math]\displaystyle{ \hat{a}\,\! }[/math] and [math]\displaystyle{ \hat{b}\,\! }[/math]:
- [math]\displaystyle{ \begin{align} & \hat{b}= & \frac{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}}{{y}_{i}}-(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{y}_{i}})/6}{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,t_{i}^{2}-{{(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})}^{2}}/6} \\ & & \\ & \hat{b}= & \frac{-4320.3362-(2100)(-9.6476)/6}{910,000-{{(2100)}^{2}}/6} \end{align}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{b}=-0.005392\,\! }[/math]
and:
- [math]\displaystyle{ \hat{a}=\overline{y}-\hat{b}\overline{t}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{t}_{i}}}{N}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{a}=\frac{-9.6476}{6}-(-0.005392)\frac{2100}{6}=0.2793\,\! }[/math]
Therefore:
- [math]\displaystyle{ \hat{\lambda }=-\hat{b}=-(-0.005392)=0.05392\text{ failures/hour}\,\! }[/math]
and:
- [math]\displaystyle{ \hat{\gamma }=\frac{\hat{a}}{\hat{\lambda }}=\frac{0.2793}{0.005392}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{\gamma }\simeq 51.8\text{ hours}\,\! }[/math]
Then:
- [math]\displaystyle{ f(T)=(0.005392){{e}^{-0.005392(T-51.8)}}\,\! }[/math]
Using Weibull++, the estimated parameters are:
- [math]\displaystyle{ \begin{align} \hat{\lambda }= & 0.0054\text{ failures/hour} \\ \hat{\gamma }= & 51.82\text{ hours} \end{align}\,\! }[/math]
The small difference in the values from Weibull++ is due to rounding. In the application, the calculations and the rank values are carried out up to the [math]\displaystyle{ 15^{th}\,\! }[/math] decimal point.
Example 9:
A number of leukemia patients were treated with either drug 6MP or a placebo, and the times in weeks until cancer symptoms returned were recorded. Analyze each treatment separately [21, p.175].
Table - Leukemia Treatment Results | |||
Time (weeks) | Number of Patients | Treament | Comments |
---|---|---|---|
1 | 2 | placebo | |
2 | 2 | placebo | |
3 | 1 | placebo | |
4 | 2 | placebo | |
5 | 2 | placebo | |
6 | 4 | 6MP | 3 patients completed |
7 | 1 | 6MP | |
8 | 4 | placebo | |
9 | 1 | 6MP | Not completed |
10 | 2 | 6MP | 1 patient completed |
11 | 2 | placebo | |
11 | 1 | 6MP | Not completed |
12 | 2 | placebo | |
13 | 1 | 6MP | |
15 | 1 | placebo | |
16 | 1 | 6MP | |
17 | 1 | placebo | |
17 | 1 | 6MP | Not completed |
19 | 1 | 6MP | Not completed |
20 | 1 | 6MP | Not completed |
22 | 1 | placebo | |
22 | 1 | 6MP | |
23 | 1 | placebo | |
23 | 1 | 6MP | |
25 | 1 | 6MP | Not completed |
32 | 2 | 6MP | Not completed |
34 | 1 | 6MP | Not completed |
35 | 1 | 6MP | Not completed |
Create a new Weibull++ standard folio that's configured for grouped times-to-failure data with suspensions. In the first column, enter the number of patients. Whenever there are uncompleted tests, enter the number of patients who completed the test separately from the number of patients who did not (e.g., if 4 patients had symptoms return after 6 weeks and only 3 of them completed the test, then enter 1 in one row and 3 in another). In the second column enter F if the patients completed the test and S if they didn't. In the third column enter the time, and in the fourth column (Subset ID) specify whether the 6MP drug or a placebo was used.
Next, open the Batch Auto Run utility and select to separate the 6MP drug from the placebo, as shown next.
The software will create two data sheets, one for each subset ID, as shown next.
Calculate both data sheets using the 2-parameter exponential distribution and the MLE analysis method, then insert an additional plot and select to show the analysis results for both data sheets on that plot, which will appear as shown next.
The exponential distribution is a commonly used distribution in reliability engineering. Mathematically, it is a fairly simple distribution, which many times leads to its use in inappropriate situations. It is, in fact, a special case of the Weibull distribution where [math]\displaystyle{ \beta =1 }[/math]. The exponential distribution is used to model the behavior of units that have a constant failure rate (or units that do not degrade with time or wear out).
Exponential Probability Density Function
The Two-Parameter Exponential Distribution
The two-parameter exponential pdf is given by:
- [math]\displaystyle{ f(t)=\lambda {{e}^{-\lambda (t-\gamma )}},f(t)\ge 0,\lambda \gt 0,t\ge 0\text{ or }\gamma }[/math]
where [math]\displaystyle{ \gamma }[/math] is the location parameter. Some of the characteristics of the two-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{t}-\gamma =m-\gamma }[/math].
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The distribution starts at [math]\displaystyle{ t=\gamma }[/math] at the level of [math]\displaystyle{ f(t=\gamma )=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases beyond [math]\displaystyle{ \gamma }[/math] and is convex.
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The One-Parameter Exponential Distribution
The one-parameter exponential [math]\displaystyle{ pdf }[/math] is obtained by setting [math]\displaystyle{ \gamma =0 }[/math], and is given by:
- [math]\displaystyle{ \begin{align}f(t)= & \lambda {{e}^{-\lambda t}}=\frac{1}{m}{{e}^{-\tfrac{1}{m}t}}, & t\ge 0, \lambda \gt 0,m\gt 0 \end{align} }[/math]
where:
- [math]\displaystyle{ \lambda }[/math] = constant rate, in failures per unit of measurement, e.g failures per hour, per cycle, etc.,
- [math]\displaystyle{ \lambda =\frac{1}{m} }[/math],
- [math]\displaystyle{ m }[/math] = mean time between failures, or to failure,
- [math]\displaystyle{ t }[/math] = operating time, life, or age, in hours, cycles, miles, actuations, etc.
This distribution requires the knowledge of only one parameter, [math]\displaystyle{ \lambda }[/math], for its application. Some of the characteristics of the one-parameter exponential distribution are [19]:
- The location parameter, [math]\displaystyle{ \gamma }[/math], is zero.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=m }[/math].
- As [math]\displaystyle{ \lambda }[/math] is decreased in value, the distribution is stretched out to the right, and as [math]\displaystyle{ \lambda }[/math] is increased, the distribution is pushed toward the origin.
- This distribution has no shape parameter as it has only one shape, i.e. the exponential, and the only parameter it has is the failure rate, [math]\displaystyle{ \lambda }[/math].
- The distribution starts at [math]\displaystyle{ t=0 }[/math] at the level of [math]\displaystyle{ f(t=0)=\lambda }[/math] and decreases thereafter exponentially and monotonically as [math]\displaystyle{ t }[/math] increases, and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ f(t)\to 0 }[/math].
- The [math]\displaystyle{ pdf }[/math] can be thought of as a special case of the Weibull [math]\displaystyle{ pdf }[/math] with [math]\displaystyle{ \gamma =0 }[/math] and [math]\displaystyle{ \beta =1 }[/math].
Exponential Statistical Properties
The Mean or MTTF
The mean, [math]\displaystyle{ \overline{T}, }[/math] or mean time to failure (MTTF) is given by:
- [math]\displaystyle{ \begin{align} \bar{T}= & \int_{\gamma }^{\infty }t\cdot f(t)dt \\ = & \int_{\gamma }^{\infty }t\cdot \lambda \cdot {{e}^{-\lambda t}}dt \\ = & \gamma +\frac{1}{\lambda }=m \end{align} }[/math]
Note that when [math]\displaystyle{ \gamma =0 }[/math], the MTTF is the inverse of the exponential distribution's constant failure rate. This is only true for the exponential distribution. Most other distributions do not have a constant failure rate. Consequently, the inverse relationship between failure rate and MTTF does not hold for these other distributions.
The Median
The median, [math]\displaystyle{ \breve{T}, }[/math] is:
- [math]\displaystyle{ \breve{T}=\gamma +\frac{1}{\lambda}\cdot 0.693 }[/math]
The Mode
The mode, [math]\displaystyle{ \tilde{T}, }[/math] is:
- [math]\displaystyle{ \tilde{T}=\gamma }[/math]
The Standard Deviation
The standard deviation, [math]\displaystyle{ {\sigma }_{T} }[/math], is:
- [math]\displaystyle{ {\sigma}_{T}=\frac{1}{\lambda }=m }[/math]
The Exponential Reliability Function
The equation for the two-parameter exponential cumulative density function, or [math]\displaystyle{ cdf, }[/math] is given by:
- [math]\displaystyle{ F(t)=Q(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Recalling that the reliability function of a distribution is simply one minus the [math]\displaystyle{ cdf }[/math], the reliability function of the two-parameter exponential distribution is given by:
- [math]\displaystyle{ R(t)=1-Q(t)=1-\int_{0}^{t-\gamma }f(x)dx }[/math]
- [math]\displaystyle{ R(t)=1-\int_{0}^{t-\gamma }\lambda {{e}^{-\lambda x}}dx={{e}^{-\lambda (t-\gamma )}} }[/math]
One-Parameter Exponential Reliability Function
The one-parameter exponential reliability function is given by:
- [math]\displaystyle{ R(t)={{e}^{-\lambda t}}={{e}^{-\tfrac{t}{m}}} }[/math]
The Exponential Conditional Reliability
The exponential conditional reliability equation gives the reliability for a mission of [math]\displaystyle{ t }[/math] duration, having already successfully accumulated [math]\displaystyle{ T }[/math] hours of operation up to the start of this new mission. The exponential conditional reliability function is:
- [math]\displaystyle{ R(t|T)=\frac{R(T+t)}{R(T)}=\frac{{{e}^{-\lambda (T+t-\gamma )}}}{{{e}^{-\lambda (T-\gamma )}}}={{e}^{-\lambda t}} }[/math]
which says that the reliability for a mission of [math]\displaystyle{ t }[/math] duration undertaken after the component or equipment has already accumulated [math]\displaystyle{ T }[/math] hours of operation from age zero is only a function of the mission duration, and not a function of the age at the beginning of the mission. This is referred to as the memoryless property.
The Exponential Reliable Life
The reliable life, or the mission duration for a desired reliability goal, [math]\displaystyle{ {{t}_{R}} }[/math], for the one-parameter exponential distribution is:
- [math]\displaystyle{ R({{t}_{R}})={{e}^{-\lambda ({{t}_{R}}-\gamma )}} }[/math]
- [math]\displaystyle{ \ln[R({{t}_{R}})]=-\lambda({{t}_{R}}-\gamma ) }[/math]
or:
- [math]\displaystyle{ {{t}_{R}}=\gamma -\frac{\ln [R({{t}_{R}})]}{\lambda } }[/math]
The Exponential Failure Rate Function
The exponential failure rate function is:
- [math]\displaystyle{ \lambda (t)=\frac{f(t)}{R(t)}=\frac{\lambda {{e}^{-\lambda (t-\gamma )}}}{{{e}^{-\lambda (t-\gamma )}}}=\lambda =\text{constant} }[/math]
Once again, note that the constant failure rate is a characteristic of the exponential distribution, and special cases of other distributions only. Most other distributions have failure rates that are functions of time.
Characteristics of the Exponential Distribution
As mentioned before, the primary trait of the exponential distribution is that it is used for modeling the behavior of items with a constant failure rate. It has a fairly simple mathematical form, which makes it fairly easy to manipulate. Unfortunately, this fact also leads to the use of this model in situations where it is not appropriate. For example, it would not be appropriate to use the exponential distribution to model the reliability of an automobile. The constant failure rate of the exponential distribution would require the assumption that the automobile would be just as likely to experience a breakdown during the first mile as it would during the one-hundred-thousandth mile. Clearly, this is not a valid assumption. However, some inexperienced practitioners of reliability engineering and life data analysis will overlook this fact, lured by the siren-call of the exponential distribution's relatively simple mathematical models.
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential [math]\displaystyle{ pdf }[/math]
- The exponential [math]\displaystyle{ pdf }[/math] has no shape parameter, as it has only one shape.
- The exponential [math]\displaystyle{ pdf }[/math] is always convex and is stretched to the right as [math]\displaystyle{ \lambda }[/math] decreases in value.
- The value of the [math]\displaystyle{ pdf }[/math] function is always equal to the value of [math]\displaystyle{ \lambda }[/math] at [math]\displaystyle{ t=0 }[/math] (or [math]\displaystyle{ t=\gamma }[/math]).
- The location parameter, [math]\displaystyle{ \gamma }[/math], if positive, shifts the beginning of the distribution by a distance of [math]\displaystyle{ \gamma }[/math] to the right of the origin, signifying that the chance failures start to occur only after [math]\displaystyle{ \gamma }[/math] hours of operation, and cannot occur before this time.
- The scale parameter is [math]\displaystyle{ \tfrac{1}{\lambda }=\bar{T}-\gamma =m-\gamma }[/math].
- As [math]\displaystyle{ t\to \infty }[/math], [math]\displaystyle{ f(t)\to 0 }[/math].
The Effect of [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma }[/math] on the Exponential Reliability Function
- The one-parameter exponential reliability function starts at the value of 100% at [math]\displaystyle{ t=0 }[/math], decreases thereafter monotonically and is convex.
- The two-parameter exponential reliability function remains at the value of 100% for [math]\displaystyle{ t=0 }[/math] up to [math]\displaystyle{ t=\gamma }[/math], and decreases thereafter monotonically and is convex.
- As [math]\displaystyle{ t\to \infty }[/math] , [math]\displaystyle{ R(t\to \infty )\to 0 }[/math].
- The reliability for a mission duration of [math]\displaystyle{ t=m=\tfrac{1}{\lambda } }[/math], or of one MTTF duration, is always equal to [math]\displaystyle{ 0.3679 }[/math] or 36.79%. This means that the reliability for a mission which is as long as one MTTF is relatively low and is not recommended because only 36.8% of the missions will be completed successfully. In other words, of the equipment undertaking such a mission, only 36.8% will survive their mission.
Estimation of the Exponential Parameters
Probability Plotting
Estimation of the parameters for the exponential distribution via probability plotting is very similar to the process used when dealing with the Weibull distribution. Recall, however, that the appearance of the probability plotting paper and the methods by which the parameters are estimated vary from distribution to distribution, so there will be some noticeable differences. In fact, due to the nature of the exponential [math]\displaystyle{ cdf }[/math], the exponential probability plot is the only one with a negative slope. This is because the y-axis of the exponential probability plotting paper represents the reliability, whereas the y-axis for most of the other life distributions represents the unreliability.
This is illustrated in the process of linearizing the [math]\displaystyle{ cdf }[/math], which is necessary to construct the exponential probability plotting paper. For the two-parameter exponential distribution the cumulative density function is given by:
- [math]\displaystyle{ F(t)=1-{{e}^{-\lambda (t-\gamma )}} }[/math]
Taking the natural logarithm of both sides of the above equation yields:
- [math]\displaystyle{ \ln \left[ 1-F(t) \right]=-\lambda (t-\gamma ) }[/math]
or:
- [math]\displaystyle{ \ln [1-F(t)]=\lambda \gamma -\lambda t }[/math]
Now, let:
- [math]\displaystyle{ y=\ln [1-F(t)] }[/math]
- [math]\displaystyle{ a=\lambda \gamma }[/math]
and:
- [math]\displaystyle{ b=-\lambda }[/math]
which results in the linear equation of:
- [math]\displaystyle{ y=a+bt }[/math]
Note that with the exponential probability plotting paper, the y-axis scale is logarithmic and the x-axis scale is linear. This means that the zero value is present only on the x-axis. For [math]\displaystyle{ t=0 }[/math], [math]\displaystyle{ R=1 }[/math] and [math]\displaystyle{ F(t)=0 }[/math]. So if we were to use [math]\displaystyle{ F(t) }[/math] for the y-axis, we would have to plot the point [math]\displaystyle{ (0,0) }[/math]. However, since the y-axis is logarithmic, there is no place to plot this on the exponential paper. Also, the failure rate, [math]\displaystyle{ \lambda }[/math], is the negative of the slope of the line, but there is an easier way to determine the value of [math]\displaystyle{ \lambda }[/math] from the probability plot, as will be illustrated in the following example.
Example 1:
1-Parameter Exponential Probability Plot Example
6 units are put on a life test and tested to failure. The failure times are 7, 12, 19, 29, 41, and 67 hours. Estimate the failure rate for a 1-parameter exponential distribution using the probability plotting method.
In order to plot the points for the probability plot, the appropriate reliability estimate values must be obtained. These will be equivalent to [math]\displaystyle{ 100%-MR\,\! }[/math] since the y-axis represents the reliability and the [math]\displaystyle{ MR\,\! }[/math] values represent unreliability estimates.
Next, these points are plotted on an exponential probability plotting paper. A sample of this type of plotting paper is shown next, with the sample points in place. Notice how these points describe a line with a negative slope.
Once the points are plotted, draw the best possible straight line through these points. The time value at which this line intersects with a horizontal line drawn at the 36.8% reliability mark is the mean life, and the reciprocal of this is the failure rate [math]\displaystyle{ \lambda\,\! }[/math]. This is because at [math]\displaystyle{ t=m=\tfrac{1}{\lambda }\,\! }[/math]:
- [math]\displaystyle{ \begin{align} R(t)= & {{e}^{-\lambda \cdot t}} \\ R(t)= & {{e}^{-\lambda \cdot \tfrac{1}{\lambda }}} \\ R(t)= & {{e}^{-1}}=0.368=36.8%. \end{align}\,\! }[/math]
The following plot shows that the best-fit line through the data points crosses the [math]\displaystyle{ R=36.8%\,\! }[/math] line at [math]\displaystyle{ t=33\,\! }[/math] hours. And because [math]\displaystyle{ \tfrac{1}{\lambda }=33\,\! }[/math] hours, [math]\displaystyle{ \lambda =0.0303\,\! }[/math] failures/hour.
Rank Regression on Y for Exponential Distribution
Performing a rank regression on Y requires that a straight line be fitted to the set of available data points such that the sum of the squares of the vertical deviations from the points to the line is minimized. The least squares parameter estimation method (regression analysis) was discussed in Chapter Parameter Estimation, and the following equations for rank regression on Y (RRY) were derived:
- [math]\displaystyle{ \hat{a}=\bar{y}-\hat{b}\bar{x}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}} \right)}^{2}}}{N}} }[/math]
In our case, the equations for [math]\displaystyle{ {{y}_{i}} }[/math] and [math]\displaystyle{ {{x}_{i}} }[/math] are:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
and the [math]\displaystyle{ F({{t}_{i}}) }[/math] is estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, then [math]\displaystyle{ \hat{\lambda } }[/math] and [math]\displaystyle{ \hat{\gamma } }[/math] can easily be obtained from above equations.
For the one-parameter exponential, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0, \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,x_{i}^{2}} \end{align} }[/math]
The Correlation Coefficient
The estimator of [math]\displaystyle{ \rho }[/math] is the sample correlation coefficient, [math]\displaystyle{ \hat{\rho } }[/math], given by:
- [math]\displaystyle{ \hat{\rho }=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,({{x}_{i}}-\overline{x})({{y}_{i}}-\overline{y})}{\sqrt{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{x}_{i}}-\overline{x})}^{2}}\cdot \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{({{y}_{i}}-\overline{y})}^{2}}}} }[/math]
Example 2:
Template loop detected: Template:2 parameter exponential distribution RRY example
Rank Regression on X for Exponential Distribution
Similar to rank regression on Y, performing a rank regression on X requires that a straight line be fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized.
Again the first task is to bring our exponential [math]\displaystyle{ cdf }[/math] function into a linear form. This step is exactly the same as in regression on Y analysis. The deviation from the previous analysis begins on the least squares fit step, since in this case we treat [math]\displaystyle{ x }[/math] as the dependent variable and [math]\displaystyle{ y }[/math] as the independent variable. The best-fitting straight line to the data, for regression on X (see Chapter Parameter Estimation), is the straight line:
- [math]\displaystyle{ x=\hat{a}+\hat{b}y }[/math]
The corresponding equations for [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are:
- [math]\displaystyle{ \hat{a}=\overline{x}-\hat{b}\overline{y}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N} }[/math]
and:
- [math]\displaystyle{ \hat{b}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}-\tfrac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}-\tfrac{{{\left( \underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}} \right)}^{2}}}{N}} }[/math]
where:
- [math]\displaystyle{ {{y}_{i}}=\ln [1-F({{t}_{i}})] }[/math]
and:
- [math]\displaystyle{ {{x}_{i}}={{t}_{i}} }[/math]
The values of [math]\displaystyle{ F({{t}_{i}}) }[/math] are estimated from the median ranks. Once [math]\displaystyle{ \hat{a} }[/math] and [math]\displaystyle{ \hat{b} }[/math] are obtained, solve for the unknown [math]\displaystyle{ y }[/math] value, which corresponds to:
- [math]\displaystyle{ y=-\frac{\hat{a}}{\hat{b}}+\frac{1}{\hat{b}}x }[/math]
Solving for the parameters from above equations we get:
- [math]\displaystyle{ a=-\frac{\hat{a}}{\hat{b}}=\lambda \gamma \Rightarrow \gamma =\hat{a} }[/math]
and:
- [math]\displaystyle{ b=\frac{1}{\hat{b}}=-\lambda \Rightarrow \lambda =-\frac{1}{\hat{b}} }[/math]
For the one-parameter exponential case, equations for estimating a and b become:
- [math]\displaystyle{ \begin{align} \hat{a}= & 0 \\ \hat{b}= & \frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{x}_{i}}{{y}_{i}}}{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,y_{i}^{2}} \end{align} }[/math]
The correlation coefficient is evaluated as before.
Example 3:
Template loop detected: Template:Example: 2 Parameter Exponential Distribution RRX
Maximum Likelihood Estimation for Exponential Distribution
As outlined in Chapter Parameter Estimation, maximum likelihood estimation works by developing a likelihood function based on the available data and finding the values of the parameter estimates that maximize the likelihood function. This can be achieved by using iterative methods to determine the parameter estimate values that maximize the likelihood function. This can be rather difficult and time-consuming, particularly when dealing with the three-parameter distribution. Another method of finding the parameter estimates involves taking the partial derivatives of the likelihood equation with respect to the parameters, setting the resulting equations equal to zero, and solving simultaneously to determine the values of the parameter estimates. The log-likelihood functions and associated partial derivatives used to determine maximum likelihood estimates for the exponential distribution are covered in Appendix.
Example 4: Template loop detected: Template:Example: MLE for Exponential Distribution
Confidence Bounds
In this section, we present the methods used in the application to estimate the different types of confidence bounds for exponentially distributed data. The complete derivations were presented in detail (for a general function) in the chapter for Confidence Bounds. At this time we should point out that exact confidence bounds for the exponential distribution have been derived, and exist in a closed form, utilizing the [math]\displaystyle{ {{\chi }^{2}} }[/math] distribution. These are described in detail in Kececioglu [20], and are covered in the section in the test design chapter. For most exponential data analyses, Weibull++ will use the approximate confidence bounds, provided from the Fisher information matrix or the likelihood ratio, in order to stay consistent with all of the other available distributions in the application. The [math]\displaystyle{ {{\chi }^{2}} }[/math] confidence bounds for the exponential distribution are discussed in more detail in the test design chapter.
Fisher Matrix Bounds
Bounds on the Parameters
For the failure rate [math]\displaystyle{ \hat{\lambda } }[/math] the upper ([math]\displaystyle{ {{\lambda }_{U}} }[/math]) and lower ([math]\displaystyle{ {{\lambda }_{L}} }[/math]) bounds are estimated by [30]:
- [math]\displaystyle{ \begin{align} & {{\lambda }_{U}}= & \hat{\lambda }\cdot {{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}} \\ & & \\ & {{\lambda }_{L}}= & \frac{\hat{\lambda }}{{{e}^{\left[ \tfrac{{{K}_{\alpha }}\sqrt{Var(\hat{\lambda })}}{\hat{\lambda }} \right]}}} \end{align} }[/math]
where [math]\displaystyle{ {{K}_{\alpha }} }[/math] is defined by:
- [math]\displaystyle{ \alpha =\frac{1}{\sqrt{2\pi }}\int_{{{K}_{\alpha }}}^{\infty }{{e}^{-\tfrac{{{t}^{2}}}{2}}}dt=1-\Phi ({{K}_{\alpha }}) }[/math]
If [math]\displaystyle{ \delta }[/math] is the confidence level, then [math]\displaystyle{ \alpha =\tfrac{1-\delta }{2} }[/math] for the two-sided bounds, and [math]\displaystyle{ \alpha =1-\delta }[/math] for the one-sided bounds.
The variance of [math]\displaystyle{ \hat{\lambda }, }[/math] [math]\displaystyle{ Var(\hat{\lambda }), }[/math] is estimated from the Fisher matrix, as follows:
- [math]\displaystyle{ Var(\hat{\lambda })={{\left( -\frac{{{\partial }^{2}}\Lambda }{\partial {{\lambda }^{2}}} \right)}^{-1}} }[/math]
where [math]\displaystyle{ \Lambda }[/math] is the log-likelihood function of the exponential distribution, described in Appendix for log-likelihood function.
Note that no true MLE solution exists for the case of the two-parameter exponential distribution. The mathematics simply break down while trying to simultaneously solve the partial derivative equations for both the [math]\displaystyle{ \gamma }[/math] and [math]\displaystyle{ \lambda }[/math] parameters, resulting in unrealistic conditions. The way around this conundrum involves setting [math]\displaystyle{ \gamma ={{t}_{1}}, }[/math] or the first time-to-failure, and calculating [math]\displaystyle{ \lambda }[/math] in the regular fashion for this methodology. Weibull++ treats [math]\displaystyle{ \gamma }[/math] as a constant when computing bounds, i.e. [math]\displaystyle{ Var(\hat{\gamma })=0. }[/math] (See the discussion in Appendix for log-likelihood function for more information.)
Template loop detected: Template:Bounds on Reliability FMB ED
Template loop detected: Template:Bounds on Time FMB ED
Template loop detected: Template:Likelihood Ratio Confidence Bounds FMB ED
Template loop detected: Template:Bayesian Confidence Bounds ED
General Examples
Example 8:
20 units were reliability tested with the following results:
Table - Life Test Data | |
Number of Units in Group | Time-to-Failure |
---|---|
7 | 100 |
5 | 200 |
3 | 300 |
2 | 400 |
1 | 500 |
2 | 600 |
1. Assuming a 2-parameter exponential distribution, estimate the parameters by hand using the MLE analysis method.
2. Repeat the above using Weibull++. (Enter the data as grouped data to duplicate the results.)
3. Show the Probability plot for the analysis results.
4. Show the Reliability vs. Time plot for the results.
5. Show the pdf plot for the results.
6. Show the Failure Rate vs. Time plot for the results.
7. Estimate the parameters using the rank regression on Y (RRY) analysis method (and using grouped ranks).
Solution
1. For the 2-parameter exponential distribution and for [math]\displaystyle{ \hat{\gamma }=100\,\! }[/math] hours (first failure), the partial of the log-likelihood function, [math]\displaystyle{ \lambda\,\! }[/math], becomes:
- [math]\displaystyle{ \begin{align} \frac{\partial \Lambda }{\partial \lambda }= &\underset{i=1}{\overset{6}{\mathop \sum }}\,{N_i} \left[ \frac{1}{\lambda }-\left( {{T}_{i}}-100 \right) \right]=0\\ \Rightarrow & 7[\frac{1}{\lambda }-(100-100)]+5[\frac{1}{\lambda}-(200-100)] + \ldots +2[\frac{1}{\lambda}-(600-100)]\\ = & 0\\ \Rightarrow & \hat{\lambda}=\frac{20}{3100}=0.0065 \text{fr/hr} \end{align} \,\! }[/math]
2. Enter the data in a Weibull++ standard folio and calculate it as shown next.
3. On the Plot page of the folio, the exponential Probability plot will appear as shown next.
4. View the Reliability vs. Time plot.
5. View the pdf plot.
6. View the Failure Rate vs. Time plot.
Note that, as described at the beginning of this chapter, the failure rate for the exponential distribution is constant. Also note that the Failure Rate vs. Time plot does show values for times before the location parameter, [math]\displaystyle{ \gamma \,\! }[/math], at 100 hours.
7. In the case of grouped data, one must be cautious when estimating the parameters using a rank regression method. This is because the median rank values are determined from the total number of failures observed by time [math]\displaystyle{ {{T}_{i}}\,\! }[/math] where [math]\displaystyle{ i\,\! }[/math] indicates the group number. In this example, the total number of groups is [math]\displaystyle{ N=6\,\! }[/math] and the total number of units is [math]\displaystyle{ {{N}_{T}}=20\,\! }[/math]. Thus, the median rank values will be estimated for 20 units and for the total failed units ([math]\displaystyle{ {{N}_{{{F}_{i}}}}\,\! }[/math]) up to the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] group, for the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] rank value. The median ranks values can be found from rank tables or they can be estimated using ReliaSoft's Quick Statistical Reference tool.
For example, the median rank value of the fourth group will be the [math]\displaystyle{ {{17}^{th}}\,\! }[/math] rank out of a sample size of twenty units (or 81.945%).
The following table is then constructed.
Given the values in the table above, calculate [math]\displaystyle{ \hat{a}\,\! }[/math] and [math]\displaystyle{ \hat{b}\,\! }[/math]:
- [math]\displaystyle{ \begin{align} & \hat{b}= & \frac{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}}{{y}_{i}}-(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{y}_{i}})/6}{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,t_{i}^{2}-{{(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})}^{2}}/6} \\ & & \\ & \hat{b}= & \frac{-4320.3362-(2100)(-9.6476)/6}{910,000-{{(2100)}^{2}}/6} \end{align}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{b}=-0.005392\,\! }[/math]
and:
- [math]\displaystyle{ \hat{a}=\overline{y}-\hat{b}\overline{t}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{t}_{i}}}{N}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{a}=\frac{-9.6476}{6}-(-0.005392)\frac{2100}{6}=0.2793\,\! }[/math]
Therefore:
- [math]\displaystyle{ \hat{\lambda }=-\hat{b}=-(-0.005392)=0.05392\text{ failures/hour}\,\! }[/math]
and:
- [math]\displaystyle{ \hat{\gamma }=\frac{\hat{a}}{\hat{\lambda }}=\frac{0.2793}{0.005392}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{\gamma }\simeq 51.8\text{ hours}\,\! }[/math]
Then:
- [math]\displaystyle{ f(T)=(0.005392){{e}^{-0.005392(T-51.8)}}\,\! }[/math]
Using Weibull++, the estimated parameters are:
- [math]\displaystyle{ \begin{align} \hat{\lambda }= & 0.0054\text{ failures/hour} \\ \hat{\gamma }= & 51.82\text{ hours} \end{align}\,\! }[/math]
The small difference in the values from Weibull++ is due to rounding. In the application, the calculations and the rank values are carried out up to the [math]\displaystyle{ 15^{th}\,\! }[/math] decimal point.
Example 9:
A number of leukemia patients were treated with either drug 6MP or a placebo, and the times in weeks until cancer symptoms returned were recorded. Analyze each treatment separately [21, p.175].
Table - Leukemia Treatment Results | |||
Time (weeks) | Number of Patients | Treament | Comments |
---|---|---|---|
1 | 2 | placebo | |
2 | 2 | placebo | |
3 | 1 | placebo | |
4 | 2 | placebo | |
5 | 2 | placebo | |
6 | 4 | 6MP | 3 patients completed |
7 | 1 | 6MP | |
8 | 4 | placebo | |
9 | 1 | 6MP | Not completed |
10 | 2 | 6MP | 1 patient completed |
11 | 2 | placebo | |
11 | 1 | 6MP | Not completed |
12 | 2 | placebo | |
13 | 1 | 6MP | |
15 | 1 | placebo | |
16 | 1 | 6MP | |
17 | 1 | placebo | |
17 | 1 | 6MP | Not completed |
19 | 1 | 6MP | Not completed |
20 | 1 | 6MP | Not completed |
22 | 1 | placebo | |
22 | 1 | 6MP | |
23 | 1 | placebo | |
23 | 1 | 6MP | |
25 | 1 | 6MP | Not completed |
32 | 2 | 6MP | Not completed |
34 | 1 | 6MP | Not completed |
35 | 1 | 6MP | Not completed |
Create a new Weibull++ standard folio that's configured for grouped times-to-failure data with suspensions. In the first column, enter the number of patients. Whenever there are uncompleted tests, enter the number of patients who completed the test separately from the number of patients who did not (e.g., if 4 patients had symptoms return after 6 weeks and only 3 of them completed the test, then enter 1 in one row and 3 in another). In the second column enter F if the patients completed the test and S if they didn't. In the third column enter the time, and in the fourth column (Subset ID) specify whether the 6MP drug or a placebo was used.
Next, open the Batch Auto Run utility and select to separate the 6MP drug from the placebo, as shown next.
The software will create two data sheets, one for each subset ID, as shown next.
Calculate both data sheets using the 2-parameter exponential distribution and the MLE analysis method, then insert an additional plot and select to show the analysis results for both data sheets on that plot, which will appear as shown next.
General Examples
Example 8:
20 units were reliability tested with the following results:
Table - Life Test Data | |
Number of Units in Group | Time-to-Failure |
---|---|
7 | 100 |
5 | 200 |
3 | 300 |
2 | 400 |
1 | 500 |
2 | 600 |
1. Assuming a 2-parameter exponential distribution, estimate the parameters by hand using the MLE analysis method.
2. Repeat the above using Weibull++. (Enter the data as grouped data to duplicate the results.)
3. Show the Probability plot for the analysis results.
4. Show the Reliability vs. Time plot for the results.
5. Show the pdf plot for the results.
6. Show the Failure Rate vs. Time plot for the results.
7. Estimate the parameters using the rank regression on Y (RRY) analysis method (and using grouped ranks).
Solution
1. For the 2-parameter exponential distribution and for [math]\displaystyle{ \hat{\gamma }=100\,\! }[/math] hours (first failure), the partial of the log-likelihood function, [math]\displaystyle{ \lambda\,\! }[/math], becomes:
- [math]\displaystyle{ \begin{align} \frac{\partial \Lambda }{\partial \lambda }= &\underset{i=1}{\overset{6}{\mathop \sum }}\,{N_i} \left[ \frac{1}{\lambda }-\left( {{T}_{i}}-100 \right) \right]=0\\ \Rightarrow & 7[\frac{1}{\lambda }-(100-100)]+5[\frac{1}{\lambda}-(200-100)] + \ldots +2[\frac{1}{\lambda}-(600-100)]\\ = & 0\\ \Rightarrow & \hat{\lambda}=\frac{20}{3100}=0.0065 \text{fr/hr} \end{align} \,\! }[/math]
2. Enter the data in a Weibull++ standard folio and calculate it as shown next.
3. On the Plot page of the folio, the exponential Probability plot will appear as shown next.
4. View the Reliability vs. Time plot.
5. View the pdf plot.
6. View the Failure Rate vs. Time plot.
Note that, as described at the beginning of this chapter, the failure rate for the exponential distribution is constant. Also note that the Failure Rate vs. Time plot does show values for times before the location parameter, [math]\displaystyle{ \gamma \,\! }[/math], at 100 hours.
7. In the case of grouped data, one must be cautious when estimating the parameters using a rank regression method. This is because the median rank values are determined from the total number of failures observed by time [math]\displaystyle{ {{T}_{i}}\,\! }[/math] where [math]\displaystyle{ i\,\! }[/math] indicates the group number. In this example, the total number of groups is [math]\displaystyle{ N=6\,\! }[/math] and the total number of units is [math]\displaystyle{ {{N}_{T}}=20\,\! }[/math]. Thus, the median rank values will be estimated for 20 units and for the total failed units ([math]\displaystyle{ {{N}_{{{F}_{i}}}}\,\! }[/math]) up to the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] group, for the [math]\displaystyle{ {{i}^{th}}\,\! }[/math] rank value. The median ranks values can be found from rank tables or they can be estimated using ReliaSoft's Quick Statistical Reference tool.
For example, the median rank value of the fourth group will be the [math]\displaystyle{ {{17}^{th}}\,\! }[/math] rank out of a sample size of twenty units (or 81.945%).
The following table is then constructed.
Given the values in the table above, calculate [math]\displaystyle{ \hat{a}\,\! }[/math] and [math]\displaystyle{ \hat{b}\,\! }[/math]:
- [math]\displaystyle{ \begin{align} & \hat{b}= & \frac{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}}{{y}_{i}}-(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{y}_{i}})/6}{\underset{i=1}{\overset{6}{\mathop{\sum }}}\,t_{i}^{2}-{{(\underset{i=1}{\overset{6}{\mathop{\sum }}}\,{{t}_{i}})}^{2}}/6} \\ & & \\ & \hat{b}= & \frac{-4320.3362-(2100)(-9.6476)/6}{910,000-{{(2100)}^{2}}/6} \end{align}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{b}=-0.005392\,\! }[/math]
and:
- [math]\displaystyle{ \hat{a}=\overline{y}-\hat{b}\overline{t}=\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{y}_{i}}}{N}-\hat{b}\frac{\underset{i=1}{\overset{N}{\mathop{\sum }}}\,{{t}_{i}}}{N}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{a}=\frac{-9.6476}{6}-(-0.005392)\frac{2100}{6}=0.2793\,\! }[/math]
Therefore:
- [math]\displaystyle{ \hat{\lambda }=-\hat{b}=-(-0.005392)=0.05392\text{ failures/hour}\,\! }[/math]
and:
- [math]\displaystyle{ \hat{\gamma }=\frac{\hat{a}}{\hat{\lambda }}=\frac{0.2793}{0.005392}\,\! }[/math]
or:
- [math]\displaystyle{ \hat{\gamma }\simeq 51.8\text{ hours}\,\! }[/math]
Then:
- [math]\displaystyle{ f(T)=(0.005392){{e}^{-0.005392(T-51.8)}}\,\! }[/math]
Using Weibull++, the estimated parameters are:
- [math]\displaystyle{ \begin{align} \hat{\lambda }= & 0.0054\text{ failures/hour} \\ \hat{\gamma }= & 51.82\text{ hours} \end{align}\,\! }[/math]
The small difference in the values from Weibull++ is due to rounding. In the application, the calculations and the rank values are carried out up to the [math]\displaystyle{ 15^{th}\,\! }[/math] decimal point.
Example 9:
A number of leukemia patients were treated with either drug 6MP or a placebo, and the times in weeks until cancer symptoms returned were recorded. Analyze each treatment separately [21, p.175].
Table - Leukemia Treatment Results | |||
Time (weeks) | Number of Patients | Treament | Comments |
---|---|---|---|
1 | 2 | placebo | |
2 | 2 | placebo | |
3 | 1 | placebo | |
4 | 2 | placebo | |
5 | 2 | placebo | |
6 | 4 | 6MP | 3 patients completed |
7 | 1 | 6MP | |
8 | 4 | placebo | |
9 | 1 | 6MP | Not completed |
10 | 2 | 6MP | 1 patient completed |
11 | 2 | placebo | |
11 | 1 | 6MP | Not completed |
12 | 2 | placebo | |
13 | 1 | 6MP | |
15 | 1 | placebo | |
16 | 1 | 6MP | |
17 | 1 | placebo | |
17 | 1 | 6MP | Not completed |
19 | 1 | 6MP | Not completed |
20 | 1 | 6MP | Not completed |
22 | 1 | placebo | |
22 | 1 | 6MP | |
23 | 1 | placebo | |
23 | 1 | 6MP | |
25 | 1 | 6MP | Not completed |
32 | 2 | 6MP | Not completed |
34 | 1 | 6MP | Not completed |
35 | 1 | 6MP | Not completed |
Create a new Weibull++ standard folio that's configured for grouped times-to-failure data with suspensions. In the first column, enter the number of patients. Whenever there are uncompleted tests, enter the number of patients who completed the test separately from the number of patients who did not (e.g., if 4 patients had symptoms return after 6 weeks and only 3 of them completed the test, then enter 1 in one row and 3 in another). In the second column enter F if the patients completed the test and S if they didn't. In the third column enter the time, and in the fourth column (Subset ID) specify whether the 6MP drug or a placebo was used.
Next, open the Batch Auto Run utility and select to separate the 6MP drug from the placebo, as shown next.
The software will create two data sheets, one for each subset ID, as shown next.
Calculate both data sheets using the 2-parameter exponential distribution and the MLE analysis method, then insert an additional plot and select to show the analysis results for both data sheets on that plot, which will appear as shown next.