Highly Fractional Factorial Designs
Reliability analysis is commonly thought of as an approach to model failures of existing products. The usual reliability analysis involves characterization of failures of the products using distributions such as exponential, Weibull and lognormal. Based on the fitted distribution, failures are mitigated, or warranty returns are predicted, or maintenance actions are planned. However, reliability analysis can also be used as a powerful tool to design robust products that operate with minimal failures, by adopting the methodology of Design for Reliability (DFR). In DFR, reliability analysis is carried out in conjunction with physics of failure and experiment design techniques. Under this approach, Design of Experiments (DOE) uses life data to "build" reliability into the products, not just quantify the existing reliability. Such an approach, if properly implemented, can result in significant cost savings, especially in terms of fewer warranty returns or repair and maintenance actions. Although DOE techniques can be used to improve product reliability and also make this reliability robust to noise factors, the discussion in this chapter is focused on reliability improvement.
Reliability DOE Analysis
Reliability DOE (R-DOE) analysis is fairly similar to the analysis of other designed experiments except that the response is the life of the product in the respective units (e.g., for an automobile component the units of life may be miles, for a mechanical component this may be cycles, and for a pharmaceutical product this may be months or years). However, two important differences exist that make R-DOE analysis unique. The first is that the life data of most products are typically well modeled by either the lognormal, Weibull or exponential distribution, but usually do not follow the normal distribution. Traditional DOE techniques follow the assumption that response values at any treatment level follow the normal distribution and therefore, the error terms,
Stresses affecting the life of the product may also be investigated using R-DOE analysis. In this case, the primary purpose of any R-DOE analysis is to identify which of the investigated stresses affect the life of the product (by investigating if a change in the level of any stress leads to a significant change in the life of the product). Once the important stresses affecting the life of the product have been identified, detailed analyses can be carried out using ReliaSoft's ALTA software. ALTA includes a number of life-stress relationships (LSRs) to model the relation between life and the stress affecting the life of the product.
R-DOE Analysis of Lognormally Distributed Data
Assume that the life,
where
where:
represents the times-to-failure at the th treatment level of the factor represents the mean value of for the th treatment is the random error term- The subscript
represents the treatment level of the factor with for a two level factor
The model of the equation given above is analogous to the ANOVA model,
where:
represents the logarithmic times-to-failure at the th treatment represents the mean of the natural logarithm of the times-to-failure at the th treatment represents the standard deviation of the natural logarithms of the times-to-failure
The random error term,
where
The natural logarithm of the times-to-failure at any factor level,
where
In general the model to investigate a given number of factors can be expressed as:
Based on the model equations mentioned thus far, the analyst can easily conduct an R-DOE analysis for the lognormally distributed life data using standard regression techniques. However this is no longer true once the data also includes censored observations. In the case of censored data, the analysis has to be carried out using maximum likelihood estimation (MLE) techniques.
Maximum Likelihood Estimation for the Lognormal Distribution
The maximum likelihood estimation method can be used to estimate parameters in R-DOE analyses when censored data are present. The likelihood function is calculated for each observed time-to-failure,
where:
is the total number of observed times-to-failure is the life characteristic and has been substituted based on the model used to investigate a given number of factors is the time of the th failure
For right censored data, the likelihood function is [Life Data Analysis Reference]:
where:
is the total number of observed suspensions is the time of th suspension
For interval data, the likelihood function is [Life Data Analysis Reference]:
:
where:
is the total number of interval data is the beginning time of the th interval- and
is the end time of the th interval
When all types of data (complete, right censored and interval) are present, the complete likelihood function is:
Then the log-likelihood function is:
The MLE estimates are obtained by solving for parameters
Once the estimates are obtained, the significance of any parameter,
Hypothesis Tests
Hypothesis testing in R-DOE analyses is carried out using the likelihood ratio test. To test the significance of a factor, the corresponding effect coefficient,
The statistic used for the test is the likelihood ratio,
where:
is the vector of all parameter estimates obtained using MLE (i.e., ... ) is the vector of all parameter estimates excluding the estimate of is the value of the likelihood function when all parameters are included in the model is the value of the likelihood function when all parameters except are included in the model
If the null hypothesis,
The likelihood ratio test can also be used to test the significance of a number of parameters,
Example
To illustrate the use of MLE in R-DOE analysis, consider the case where the life of a product is thought to be affected by two factors,

The resulting experiment design and the corresponding times-to-failure data obtained are shown next. Note that, although the life data set contains complete data and regression techniques are applicable, calculations are shown using MLE. DOE ++ uses MLE for all R-DOE analysis calculations.
Because the purpose of the experiment is to study two factors without considering their interaction, the applicable model for the lognormally distributed response data is:
where
The following hypotheses need to be tested in this example:
- 1)
- 1)
This test investigates the main effect of factor
where
- 2)
- 2)
This test investigates the main effect of factor
where
To calculate the test statistics, the maximum likelihood estimates of the parameters must be known. The estimates are obtained next.
MLE Estimates
Since the life data for the present experiment are complete and follow the lognormal distribution, the likelihood function can be written as:
Substituting
Then the log-likelihood function is:
To obtain the MLE estimates of the parameters,
Equating the
Substituting the values of
Thus:
Setting
Thus:
Setting
Thus:
Knowing
Thus:
Once the estimates have been calculated, the likelihood ratio test can be carried out for the two factors.
Likelihood Ratio Test
The likelihood ratio test for factor
The corresponding logarithmic value is
The corresponding logarithmic value is
The
Assuming that the desired significance level for the present experiment is 0.1, since
The likelihood ratio to test factor
The
Since

Fisher Matrix Bounds on Parameters
In general, the MLE estimates of the parameters are asymptotically normal. This means that for large sample sizes the distribution of the estimates from the same population would be very close to the normal distribution [Meeker and Escobar]. If
where
The variance-covariance matrix is obtained by inverting the Fisher matrix
Once the variance-covariance matrix is known the variance of any parameter can be obtained from the diagonal elements of the matrix. Note that if a parameter,
Using
Knowing
Example
Continuing with the example, the confidence bounds on the MLE estimates of the parameters
The variance-covariance matrix can be obtained by taking the inverse of the Fisher matrix
Inverting
Therefore, the variance of the parameter estimates are:
Knowing the variance, the confidence bounds on the parameters can be calculated. For example, the 90% bounds (
The 90% bounds on
The standard error for the parameters can be obtained by taking the positive square root of the variance. For example, the standard error for
The
The
The previous calculation results are displayed as MLE Information in the results obtained from DOE++ as shown in the following figure. In the figure, the Effect corresponding to each factor is simply twice the MLE estimate of the coefficient for that factor. Generally, the

R-DOE Analysis of Data Following the Weibull Distribution
The probability density function for the 2-parameter Weibull distribution is:
where
where:
is the value of the scale parameter at the th treatment combination of the two factors is the indicator variable representing the level of the first factor is the indicator variable representing the level of the second factor is the intercept term and are the effect coefficients for the two factors- and
is the effect coefficient for the interaction of the two factors
The model can be easily expanded to include other factors and their interactions. Note that when any data follows the Weibull distribution, the logarithmic transformation of the data follows the extreme-value distribution, whose probability density function is given as follows:
where the
Maximum Likelihood Estimation for the Weibull Distribution
The likelihood function for complete data in R-DOE analysis of Weibull distributed life data is:
where:
is the total number of observed times-to-failure is the life characteristic at the th treatment- and
is the time of the th failure
For right censored data, the likelihood function is:
where:
is the total number of observed suspensions- and
is the time of th suspension
For interval data, the likelihood function is:
where:
is the total number of interval data is the beginning time of the th interval- and
is the end time of the th interval
In each of the likelihood functions,
The complete likelihood function when all types of data (complete, right and left censored) are present is:
Then the log-likelihood function is:
The MLE estimates are obtained by solving for parameters
Once the estimates are obtained, the significance of any parameter,
R-DOE Analysis of Data Following the Exponential Distribution
The exponential distribution is a special case of the Weibull distribution when the shape parameter
where
Model Diagnostics
Residual plots can be used to check if the model obtained, based on the MLE estimates, is a good fit to the data. DOE++ uses standardized residuals for R-DOE analyses. If the data follows the lognormal distribution, then standardized residuals are calculated using the following equation:
For the probability plot, the standardized residuals are displayed on a normal probability plot. This is because under the assumed model for the lognormal distribution, the standardized residuals should follow a normal distribution with a mean of 0 and a standard deviation of 1.
For data that follows the Weibull distribution, the standardized residuals are calculated as shown next:
The probability plot, in this case, is used to check if the residuals follow the extreme-value distribution with a mean of 0. Note that in all residual plots, when an observation,
Application Examples
Example
This example illustrates the use of R-DOE analysis to design reliability into a product. An experiment was carried out to investigate the effect of five factors (each at two levels) on the reliability of fluorescent lights [Taguchi, 1987, p. 930]. The factors,

The short duration of the experiment and failure times were probably because the lights were tested under conditions which resulted in stress higher than normal conditions. The failure of the lights was assumed to follow the lognormal distribution.
The analysis results from DOE++ for this experiment are shown next.

The results are obtained by selecting the main effects of the five factors and the interaction
- Factor
should be set at the lower level of since its coefficient is negative - Factor
should be set at the higher level of since its coefficient is positive - Factor
should be set at the lower level of since its coefficient is negative
Note that, since actual factor levels are not disclosed (presumably for proprietary reasons), predictions beyond the test conditions cannot be carried out in this case.
Example
Consider a product whose reliability is thought to be affected by eight potential factors -
The results from DOE++ for this experiment are shown in the figure below.

The results show that only factors
Assume that, in terms of the actual units, the
- Factor
should be set at the lower level of 333 since its coefficient is negative - Factor
should be set at the higher level of 2000 since its coefficient is positive
Now assume that the use conditions for the product for the significant factors,

ALTA allows for modeling of the nature of relationship between life and stress. It is assumed that the relation between life of the product and temperature follows the Arrhenius relation [Accelerated Life Testing Reference] while the relation between life and fan-speed follows the inverse power law relation [Accelerated Life Testing Reference]. Using these relations ALTA fits the following model for the data in the figure below:

Based on this model the B10 life of the product at the use conditions is obtained as shown next. The Weibull reliability equation is:
Substituting the value of
Finally, substituting the use conditions (Temp
Therefore, at the use conditions, the B10 life of the product is 225 time units. This result and other reliability metrics can be directly obtained from ALTA.
Additional R-DOE Analyses
DOE++ also allows for the analysis of single factor R-DOE experiments. This analysis is similar to the analysis of single factor designed experiments mentioned in Two Level Factorial Experiments. In single factor R-DOE analysis, the focus is on discovering whether change in the level of a factor affects reliability and how each of the factor levels are different from the other levels. The analysis models and calculations are similar to multi-factor R-DOE analysis.
Example
To illustrate single factor R-DOE analysis, consider the data in the table where life data readings for a product are taken at three levels of a certain factor,
where
The following hypothesis test needs to be carried out in this example:
where
where

MLE Estimates
Following the procedure used in the analysis of multi-factor R-DOE experiments, MLE estimates of the parameters are obtained by differentiating the log-likelihood function
Substituting

Likelihood Ratio Test
Knowing the MLE estimates, the likelihood ratio test for the significance of factor
The likelihood value for the reduced model,
Then the likelihood ratio is:
If the null hypothesis,
Assuming that the desired significance is 0.1, since
Life Characteristic Summary Results
Results in the Life Characteristic Summary table, include information about the life characteristic corresponding to each treatment level of the factor. If
The respective equations for all three treatment levels for a single replicate of the experiment can be expressed in matrix notation as:
where:
Knowing
Thus:
The variance for the predicted values of life characteristic can be calculated using the following equation:
where
From the previous matrix,
Since
Results for other levels can be calculated in a similar manner and are shown in the figure below.

Life Comparisons Results
Results under Life Comparisons include information on how life is different at a level in comparison to any other level of the factor. For example, the difference between the predicted values of life at levels 1 and 2 is (in terms of the logarithmic transformation):
The pooled standard error for this difference can be obtained as:
If the covariance between
This is the value displayed by DOE++. Knowing the pooled standard error the confidence interval on the difference can be calculated. The 90% confidence interval on the difference in (logarithmic) life between levels 1 and 2 of factor
Since the confidence interval does not include zero it can be concluded that the two levels are significantly different at
The
Since