Two Level Factorial Experiments: Difference between revisions
Chris Kahn (talk | contribs) No edit summary |
Lisa Hacker (talk | contribs) No edit summary |
||
(141 intermediate revisions by 5 users not shown) | |||
Line 1: | Line 1: | ||
{{Template:Doebook| | {{Template:Doebook|8}} | ||
Two level factorial experiments are factorial experiments in which each factor is investigated at only two levels. The early stages of experimentation usually involve the investigation of a large number of potential factors to discover the "vital few" factors. Two level factorial experiments are used during these stages to quickly filter out unwanted effects so that attention can then be focused on the important ones. | Two level factorial experiments are factorial experiments in which each factor is investigated at only two levels. The early stages of experimentation usually involve the investigation of a large number of potential factors to discover the "vital few" factors. Two level factorial experiments are used during these stages to quickly filter out unwanted effects so that attention can then be focused on the important ones. | ||
==2<sup>''k''</sup> Designs== | ==2<sup>''k''</sup> Designs== | ||
The factorial experiments, where all combination of the levels of the factors are run, are usually referred to as ''full factorial experiments''. Full factorial two level experiments are also referred to as <math>{2}^{k}\,\!</math> designs where <math>k\,\!</math> denotes the number of factors being investigated in the experiment. In | The factorial experiments, where all combination of the levels of the factors are run, are usually referred to as ''full factorial experiments''. Full factorial two level experiments are also referred to as <math>{2}^{k}\,\!</math> designs where <math>k\,\!</math> denotes the number of factors being investigated in the experiment. In Weibull++ DOE folios, these designs are referred to as 2 Level Factorial Designs as shown in the figure below. | ||
[[Image: | [[Image:doe7_1.png|center|487px|Selection of full factorial experiments with two levels in Weibull++.|link=]] | ||
A full factorial two level design with <math>k\,\!</math> factors requires <math>{{2}^{k}}\,\!</math> runs for a single replicate. For example, a two level experiment with three factors will require <math>2\times 2\times 2={{2}^{3}}=8\,\!</math> runs. The choice of the two levels of factors used in two level experiments depends on the factor; some factors naturally have two levels. For example, if gender is a factor, then male and female are the two levels. For other factors, the limits of the range of interest are usually used. For example, if temperature is a factor that varies from <math>{45}^{o}C\,\!</math> to <math>{90}^{o}C\,\!</math>, then the two levels used in the <math>{2}^{k}\,\!</math> design for this factor would be <math>{45}^{o}\,\!C\,\!</math> and <math>{90}^{o}\,\!C\,\!</math>. | A full factorial two level design with <math>k\,\!</math> factors requires <math>{{2}^{k}}\,\!</math> runs for a single replicate. For example, a two level experiment with three factors will require <math>2\times 2\times 2={{2}^{3}}=8\,\!</math> runs. The choice of the two levels of factors used in two level experiments depends on the factor; some factors naturally have two levels. For example, if gender is a factor, then male and female are the two levels. For other factors, the limits of the range of interest are usually used. For example, if temperature is a factor that varies from <math>{45}^{o}C\,\!</math> to <math>{90}^{o}C\,\!</math>, then the two levels used in the <math>{2}^{k}\,\!</math> design for this factor would be <math>{45}^{o}\,\!C\,\!</math> and <math>{90}^{o}\,\!C\,\!</math>. | ||
The two levels of the factor in the <math>{2}^{k}\,\!</math> design are usually represented as <math>-1\,\!</math> (for the first level) and <math>1\,\!</math> (for the second level). Note that this representation is reversed from the coding used in [[ | The two levels of the factor in the <math>{2}^{k}\,\!</math> design are usually represented as <math>-1\,\!</math> (for the first level) and <math>1\,\!</math> (for the second level). Note that this representation is reversed from the coding used in [[General Full Factorial Designs]] for the indicator variables that represent two level factors in ANOVA models. For ANOVA models, the first level of the factor was represented using a value of <math>1\,\!</math> for the indicator variable, while the second level was represented using a value of <math>-1\,\!</math>. For details on the notation used for two level experiments refer to [[Two_Level_Factorial_Experiments#Notation| Notation]]. | ||
Line 19: | Line 19: | ||
[[Image:doe7.2.png | [[Image:doe7.2.png|center|400px|The <math>2^2\,\!</math> design. Figure (a) displays the experiment design, (b) displays the design matrix and (c) displays the geometric representation for the design. In Figure (b), the column names I, A, B and AB are used. Column I represents the intercept term. Columns A and B represent the respective factor settings. Column AB represents the interaction and is the product of columns A and B.]] | ||
<br> | <br> | ||
Line 27: | Line 27: | ||
[[Image:doe7.3.png | [[Image:doe7.3.png|center|324px|The <math>2^3\,\!</math> design. Figure (a) shows the experiment design and (b) shows the design matrix.]] | ||
[[Image:doe7.4.png | [[Image:doe7.4.png|center|290px|Geometric representation of the <math>2^3\,\!</math> design.]] | ||
Line 40: | Line 40: | ||
===Notation=== | ===Notation=== | ||
Based on the notation | Based on the notation used in [[General Full Factorial Designs]], the ANOVA model for a two level factorial experiment with three factors would be as follows: | ||
::<math>\begin{align} | ::<math>\begin{align} | ||
Line 70: | Line 70: | ||
The notation for the regression model is much more convenient, especially for the case when a large number of higher order interactions are present. In two level experiments, the ANOVA model requires only one indicator variable to represent each factor for both qualitative and quantitative factors. Therefore, the notation for the multiple linear regression model can be applied to the ANOVA model of the experiment that has all the factors at two levels. For example, for the experiment of the ANOVA model given above, <math>{{\beta }_{0}}\,\!</math> can represent the overall mean instead of <math>\mu \,\!</math>, and <math>{{\beta }_{1}}\,\!</math> can represent the independent effect, <math>{{\tau }_{1}}\,\!</math>, of factor <math>A\,\!</math>. Other main effects can be represented in a similar manner. The notation for the interaction effects is much more simplified (e.g., <math>{{\beta }_{123}}\,\!</math> can be used to represent the three factor interaction effect, <math>{{(\tau \beta \gamma )}_{111}}\,\!</math>). | The notation for the regression model is much more convenient, especially for the case when a large number of higher order interactions are present. In two level experiments, the ANOVA model requires only one indicator variable to represent each factor for both qualitative and quantitative factors. Therefore, the notation for the multiple linear regression model can be applied to the ANOVA model of the experiment that has all the factors at two levels. For example, for the experiment of the ANOVA model given above, <math>{{\beta }_{0}}\,\!</math> can represent the overall mean instead of <math>\mu \,\!</math>, and <math>{{\beta }_{1}}\,\!</math> can represent the independent effect, <math>{{\tau }_{1}}\,\!</math>, of factor <math>A\,\!</math>. Other main effects can be represented in a similar manner. The notation for the interaction effects is much more simplified (e.g., <math>{{\beta }_{123}}\,\!</math> can be used to represent the three factor interaction effect, <math>{{(\tau \beta \gamma )}_{111}}\,\!</math>). | ||
As mentioned earlier, it is important to note that the coding for the indicator variables for the ANOVA models of two level factorial experiments is reversed from the coding followed in | As mentioned earlier, it is important to note that the coding for the indicator variables for the ANOVA models of two level factorial experiments is reversed from the coding followed in [[General Full Factorial Designs]]. Here <math>-1\,\!</math> represents the first level of the factor while <math>1\,\!</math> represents the second level. This is because for a two level factor a single variable is needed to represent the factor for both qualitative and quantitative factors. For quantitative factors, using <math>-1\,\!</math> for the first level (which is the low level) and 1 for the second level (which is the high level) keeps the coding consistent with the numerical value of the factors. The change in coding between the two coding schemes does not affect the analysis except that signs of the estimated effect coefficients will be reversed (i.e., numerical values of <math>{{\hat{\tau }}_{1}}\,\!</math>, obtained based on the coding of [[General Full Factorial Designs]], and <math>{{\hat{\beta }}_{1}}\,\!</math>, obtained based on the new coding, will be the same but their signs would be opposite). | ||
Line 114: | Line 114: | ||
::<math>\begin{align} | ::<math>\begin{align} | ||
{{({{X}^{\prime }}X)}^{-1}}= & 0.125\cdot I | {{({{X}^{\prime }}X)}^{-1}}= & 0.125\cdot I = & \frac{1}{8}\cdot I = & \frac{1}{{{2}^{3}}}\cdot I | ||
= & \frac{1}{8}\cdot I | |||
= & \frac{1}{{{2}^{3}}}\cdot I | |||
\end{align}\,\!</math> | \end{align}\,\!</math> | ||
Line 136: | Line 134: | ||
::<math>\begin{align} | ::<math>\begin{align} | ||
C= & {{{\hat{\sigma }}}^{2}}\cdot {{({{X}^{\prime }}X)}^{-1}} | C= & {{{\hat{\sigma }}}^{2}}\cdot {{({{X}^{\prime }}X)}^{-1}} = & M{{S}_{E}}\cdot {{({{X}^{\prime }}X)}^{-1}} = & \frac{M{{S}_{E}}}{({{2}^{k}}\cdot m)}\cdot I | ||
= & M{{S}_{E}}\cdot {{({{X}^{\prime }}X)}^{-1}} | |||
= & \frac{M{{S}_{E}}}{({{2}^{k}}\cdot m)}\cdot I | |||
\end{align}\,\!</math> | \end{align}\,\!</math> | ||
Line 148: | Line 144: | ||
::<math>\begin{align} | ::<math>\begin{align} | ||
se({{{\hat{\beta }}}_{j}})= & \sqrt{{{C}_{jj}}} | se({{{\hat{\beta }}}_{j}})= & \sqrt{{{C}_{jj}}} = & \sqrt{\frac{M{{S}_{E}}}{({{2}^{k}}\cdot m)}}\text{ }for\text{ }all\text{ }j | ||
= & \sqrt{\frac{M{{S}_{E}}}{({{2}^{k}}\cdot m)}}\text{ }for\text{ }all\text{ }j | |||
\end{align}\,\!</math> | \end{align}\,\!</math> | ||
This property is used to construct the normal probability plot of effects in <math>{2}^{k}\,\!</math> designs and identify significant effects using graphical techniques. For details on the normal probability plot of effects in | This property is used to construct the normal probability plot of effects in <math>{2}^{k}\,\!</math> designs and identify significant effects using graphical techniques. For details on the normal probability plot of effects in a Weibull++ DOE folio, refer to [[Two_Level_Factorial_Experiments#Normal_Probability_Plot_of_Effects| Normal Probability Plot of Effects]]. | ||
====Example==== | ====Example==== | ||
To illustrate the analysis of a full factorial <math>{2}^{k}\,\!</math> design, consider a three factor experiment to investigate the effect of honing pressure, number of strokes and cycle time on the surface finish of automobile brake drums. Each of these factors is investigated at two levels. The honing pressure is investigated at levels of 200 <math>psi\,\!</math> and 400 <math>psi\,\!</math>, the number of strokes used is 3 and 5 and the two levels of the cycle time are 3 and 5 seconds. The design for this experiment is set up in | To illustrate the analysis of a full factorial <math>{2}^{k}\,\!</math> design, consider a three factor experiment to investigate the effect of honing pressure, number of strokes and cycle time on the surface finish of automobile brake drums. Each of these factors is investigated at two levels. The honing pressure is investigated at levels of 200 <math>psi\,\!</math> and 400 <math>psi\,\!</math>, the number of strokes used is 3 and 5 and the two levels of the cycle time are 3 and 5 seconds. The design for this experiment is set up in a Weibull++ DOE folio as shown in the first two following figures. It is decided to run two replicates for this experiment. The surface finish data collected from each run (using randomization) and the complete design is shown in the third following figure. The analysis of the experiment data is explained next. | ||
[[Image: | [[Image:doe7_5.png|center|877px|Design properties for the experiment in the example.|link=]] | ||
[[Image: | [[Image:doe7_6.png|center|859px|Design summary for the experiment in the example.|link=]] | ||
[[Image: | [[Image:doe7_7.png|center|778px|Experiment design for the example to investigate the surface finish of automobile brake drums.|link=]] | ||
The applicable | The applicable model using the notation for <math>{2}^{k}\,\!</math> designs is: | ||
Line 194: | Line 189: | ||
:<math>{{H}_{0}}\ \ :\ \ {{\beta }_{1}}=0\,\!</math> | :<math>{{H}_{0}}\ \ :\ \ {{\beta }_{1}}=0\,\!</math> | ||
:<math>{{H}_{1}}\ \ :\ \ {{\beta }_{1}}\ne 0\,\!</math> | :<math>{{H}_{1}}\ \ :\ \ {{\beta }_{1}}\ne 0\,\!</math> | ||
This test investigates the main effect of factor <math>A\,\!</math> (honing pressure). The statistic for this test is: | This test investigates the main effect of factor <math>A\,\!</math> (honing pressure). The statistic for this test is: | ||
Line 204: | Line 198: | ||
:<math>{{H}_{0}}\ \ :\ \ {{\beta }_{12}}=0\,\!</math> | :<math>{{H}_{0}}\ \ :\ \ {{\beta }_{12}}=0\,\!</math> | ||
:<math>{{H}_{1}}\ \ :\ \ {{\beta }_{12}}\ne 0\,\!</math> | :<math>{{H}_{1}}\ \ :\ \ {{\beta }_{12}}\ne 0\,\!</math> | ||
This test investigates the two factor interaction <math>AB\,\!</math>. The statistic for this test is: | This test investigates the two factor interaction <math>AB\,\!</math>. The statistic for this test is: | ||
Line 214: | Line 207: | ||
:<math>{{H}_{0}}\ \ :\ \ {{\beta }_{123}}=0\,\!</math> | :<math>{{H}_{0}}\ \ :\ \ {{\beta }_{123}}=0\,\!</math> | ||
:<math>{{H}_{1}}\ \ :\ \ {{\beta }_{123}}\ne 0\,\!</math> | :<math>{{H}_{1}}\ \ :\ \ {{\beta }_{123}}\ne 0\,\!</math> | ||
This test investigates the three factor interaction <math>ABC\,\!</math>. The statistic for this test is: | This test investigates the three factor interaction <math>ABC\,\!</math>. The statistic for this test is: | ||
Line 229: | Line 221: | ||
::<math>y=X\beta +\epsilon \,\!</math> | ::<math>y=X\beta +\epsilon \,\!</math> | ||
where: | |||
<center><math>y=\left[ \begin{matrix} | <center><math>y=\left[ \begin{matrix} | ||
Line 287: | Line 279: | ||
::<math>\begin{align} | ::<math>\begin{align} | ||
S{{S}_{A}}= & Model\text{ }Sum\text{ }of\text{ }Squares- | S{{S}_{A}}= & Model\text{ }Sum\text{ }of\text{ }Squares - Sum\text{ }of\text{ }Squares\text{ }of\text{ }model\text{ }excluding\text{ }the\text{ }main\text{ }effect\text{ }of\text{ }A \ | ||
= & {{y}^{\prime }}[H-(1/16)J]y-{{y}^{\prime }}[{{H}_{\tilde{\ }A}}-(1/16)J]y | = & {{y}^{\prime }}[H-(1/16)J]y-{{y}^{\prime }}[{{H}_{\tilde{\ }A}}-(1/16)J]y | ||
\end{align}\,\!</math> | \end{align}\,\!</math> | ||
Line 325: | Line 316: | ||
= & 0.9797 | = & 0.9797 | ||
\end{align}\,\!</math> | \end{align}\,\!</math> | ||
where <math>M{{S}_{AB}}\,\!</math> is the mean square for the <math>AB\,\!</math> interaction and <math>M{{S}_{E}}\,\!</math> is the error mean square. The <math>p\,\!</math> value corresponding to the statistic, <math>{{({{f}_{0}})}_{AB}}=0.9797\,\!</math>, based on the <math>F\,\!</math> distribution with one degree of freedom in the numerator and eight degrees of freedom in the denominator is: | where <math>M{{S}_{AB}}\,\!</math> is the mean square for the <math>AB\,\!</math> interaction and <math>M{{S}_{E}}\,\!</math> is the error mean square. The <math>p\,\!</math> value corresponding to the statistic, <math>{{({{f}_{0}})}_{AB}}=0.9797\,\!</math>, based on the <math>F\,\!</math> distribution with one degree of freedom in the numerator and eight degrees of freedom in the denominator is: | ||
Line 334: | Line 324: | ||
= & 0.3513 | = & 0.3513 | ||
\end{align}\,\!</math> | \end{align}\,\!</math> | ||
Assuming that the desired significance is 0.1, since <math>p\,\!</math> value > 0.1, it can be concluded that the interaction between honing pressure and number of strokes does not affect the surface finish of the brake drums. Tests for other effects can be carried out in a similar manner. The results are shown in the ANOVA Table in the following figure. The values S, R-sq and R-sq(adj) in the figure indicate how well the model fits the data. The value of S represents the standard error of the model, R-sq represents the coefficient of multiple determination and R-sq(adj) represents the adjusted coefficient of multiple determination. For details on these values refer to [[Multiple_Linear_Regression_Analysis|Multiple Linear Regression Analysis]]. | Assuming that the desired significance is 0.1, since <math>p\,\!</math> value > 0.1, it can be concluded that the interaction between honing pressure and number of strokes does not affect the surface finish of the brake drums. Tests for other effects can be carried out in a similar manner. The results are shown in the ANOVA Table in the following figure. The values S, R-sq and R-sq(adj) in the figure indicate how well the model fits the data. The value of S represents the standard error of the model, R-sq represents the coefficient of multiple determination and R-sq(adj) represents the adjusted coefficient of multiple determination. For details on these values refer to [[Multiple_Linear_Regression_Analysis|Multiple Linear Regression Analysis]]. | ||
[[Image: | [[Image:doe7_8.png|center|799px|ANOVA table for the experiment in the [[Two_Level_Factorial_Experiments#Example| example]].|link=]] | ||
====Calculation of Effect Coefficients==== | ====Calculation of Effect Coefficients==== | ||
Line 361: | Line 350: | ||
[[Image: | [[Image:doe7_9.png|center|800px|Regression Information table for the experiment in the [[Two_Level_Factorial_Experiments#Example| example]].|link=]] | ||
The coefficients and related results are shown in the Regression Information table above. In the table, the Effect column displays the effects, which are simply twice the coefficients. The Standard Error column displays the standard error, <math>se({{\hat{\beta }}_{j}})\,\!</math>. The Low CI and High CI columns display the confidence interval on the coefficients. The interval shown is the 90% interval as the significance is chosen as 0.1. The T Value column displays the <math>t\,\!</math> statistic, <math>{{t}_{0}}\,\!</math>, corresponding to the coefficients. The P Value column displays the <math>p\,\!</math> value corresponding to the <math>t\,\!</math> statistic. (For details on how these results are calculated, refer to [[ | The coefficients and related results are shown in the Regression Information table above. In the table, the Effect column displays the effects, which are simply twice the coefficients. The Standard Error column displays the standard error, <math>se({{\hat{\beta }}_{j}})\,\!</math>. The Low CI and High CI columns display the confidence interval on the coefficients. The interval shown is the 90% interval as the significance is chosen as 0.1. The T Value column displays the <math>t\,\!</math> statistic, <math>{{t}_{0}}\,\!</math>, corresponding to the coefficients. The P Value column displays the <math>p\,\!</math> value corresponding to the <math>t\,\!</math> statistic. (For details on how these results are calculated, refer to [[General Full Factorial Designs]]). Plots of residuals can also be obtained from the DOE folio to ensure that the assumptions related to the ANOVA model are not violated. | ||
====Model Equation==== | ====Model Equation==== | ||
From the analysis results in the figure within | From the analysis results in the above figure within [[Two_Level_Factorial_Experiments#Calculation_of_Effect_Coefficients|calculation of effect coefficients]] section, it is seen that effects <math>A\,\!</math>, <math>B\,\!</math> and <math>AC\,\!</math> are significant. In a DOE folio, the <math>p\,\!</math> values for the significant effects are displayed in red in the ANOVA Table for easy identification. Using the values of the estimated effect coefficients, the model for the present <math>{2}^{3}\,\!</math> design in terms of the coded values can be written as: | ||
::<math>\begin{align} | ::<math>\begin{align} | ||
Line 373: | Line 363: | ||
= & 86.4375+2.5625{{x}_{1}}-4.9375{{x}_{2}}+2.4375{{x}_{1}}{{x}_{3}} | = & 86.4375+2.5625{{x}_{1}}-4.9375{{x}_{2}}+2.4375{{x}_{1}}{{x}_{3}} | ||
\end{align}\,\!</math> | \end{align}\,\!</math> | ||
To make the model hierarchical, the main effect, <math>C\,\!</math>, needs to be included in the model (because the interaction <math>AC\,\!</math> is included in the model). The resulting model is: | To make the model hierarchical, the main effect, <math>C\,\!</math>, needs to be included in the model (because the interaction <math>AC\,\!</math> is included in the model). The resulting model is: | ||
::<math>\hat{y}=86.4375+2.5625{{x}_{1}}-4.9375{{x}_{2}}+1.0625{{x}_{3}}+2.4375{{x}_{1}}{{x}_{3}}\,\!</math> | ::<math>\hat{y}=86.4375+2.5625{{x}_{1}}-4.9375{{x}_{2}}+1.0625{{x}_{3}}+2.4375{{x}_{1}}{{x}_{3}}\,\!</math> | ||
[[Image: | This equation can be viewed in a DOE folio, as shown in the following figure, using the Show Analysis Summary icon in the Control Panel. The equation shown in the figure will match the hierarchical model once the required terms are selected using the Select Effects icon. | ||
[[Image:doe7_10.png|center|557px|The model equation for the experiment of the [[Two_Level_Factorial_Experiments#Example| example]].|link=]] | |||
==Replicated and Repeated Runs== | ==Replicated and Repeated Runs== | ||
In the case of replicated experiments, it is important to note the difference between replicated runs and repeated runs. Both repeated and replicated runs are multiple response readings taken at the same factor levels. However, repeated runs are response observations taken at the same time or in succession. Replicated runs are response observations recorded in a random order. Therefore, replicated runs include more variation than repeated runs. For example, a baker, who wants to investigate the effect of two factors on the quality of cakes, will have to bake four cakes to complete one replicate of a <math>{2}^{2}\,\!</math> design. Assume that the baker bakes eight cakes in all. If, for each of the four treatments of the <math>{2}^{2}\,\!</math> design, the baker selects one treatment at random and then bakes two cakes for this treatment at the same time then this is a case of two repeated runs. If, however, the baker bakes all the eight cakes randomly, then the eight cakes represent two sets of replicated runs. | In the case of replicated experiments, it is important to note the difference between replicated runs and repeated runs. Both repeated and replicated runs are multiple response readings taken at the same factor levels. However, repeated runs are response observations taken at the same time or in succession. Replicated runs are response observations recorded in a random order. Therefore, replicated runs include more variation than repeated runs. For example, a baker, who wants to investigate the effect of two factors on the quality of cakes, will have to bake four cakes to complete one replicate of a <math>{2}^{2}\,\!</math> design. Assume that the baker bakes eight cakes in all. If, for each of the four treatments of the <math>{2}^{2}\,\!</math> design, the baker selects one treatment at random and then bakes two cakes for this treatment at the same time then this is a case of two repeated runs. If, however, the baker bakes all the eight cakes randomly, then the eight cakes represent two sets of replicated runs. | ||
For repeated measurements, the average values of the response for each treatment should be entered into DOE | For repeated measurements, the average values of the response for each treatment should be entered into a DOE folio as shown in the following figure (a) when the two cakes for a particular treatment are baked together. For replicated measurements, when all the cakes are baked randomly, the data is entered as shown in the following figure (b). | ||
[[Image:doe7.11.png|center|300px|Data entry for repeated and replicated runs. Figure (a) shows repeated runs and (b) shows replicated runs.]] | |||
==Unreplicated 2<sup>''k''</sup> Designs== | ==Unreplicated 2<sup>''k''</sup> Designs== | ||
If a factorial experiment is run only for a single replicate then it is not possible to test hypotheses about the main effects and interactions as the error sum of squares cannot be obtained. This is because the number of observations in a single replicate equals the number of terms in the ANOVA model. Hence the model fits the data perfectly and no degrees of freedom are available to obtain the error sum of squares. | |||
However, sometimes it is only possible to run a single replicate of the <math>{2}^{k}\,\!</math> design because of constraints on resources and time. In the absence of the error sum of squares, hypothesis tests to identify significant factors cannot be conducted. A number of methods of analyzing information obtained from unreplicated <math>{2}^{k}\,\!</math> designs are available. These include pooling higher order interactions, using the normal probability plot of effects or including center point replicates in the design. | |||
===Pooling Higher Order Interactions=== | ===Pooling Higher Order Interactions=== | ||
Line 399: | Line 394: | ||
===Normal Probability Plot of Effects=== | ===Normal Probability Plot of Effects=== | ||
Another way to use unreplicated <math>{2}^{k}\,\!</math> designs to identify significant effects is to construct the normal probability plot of the effects. As mentioned in [[Two_Level_Factorial_Experiments#Special_Features| Special Features]], the standard error for all effect coefficients in the <math>{2}^{k}\,\!</math> designs is the same. Therefore, on a normal probability plot of effect coefficients, all non-significant effect coefficients (with <math>\beta =0\,\!</math>) will fall along the straight line representative of the normal distribution, N(<math>0,{{\sigma }^{2}}/({{2}^{k}}\cdot m)\,\!</math>). Effect coefficients that show large deviations from this line will be significant since they do not come from this normal distribution. Similarly, since effects <math>=2\times \,\!</math> effect coefficients, all non-significant effects will also follow a straight line on the normal probability plot of effects. For replicated designs, the Effects Probability plot of DOE | Another way to use unreplicated <math>{2}^{k}\,\!</math> designs to identify significant effects is to construct the normal probability plot of the effects. As mentioned in [[Two_Level_Factorial_Experiments#Special_Features| Special Features]], the standard error for all effect coefficients in the <math>{2}^{k}\,\!</math> designs is the same. Therefore, on a normal probability plot of effect coefficients, all non-significant effect coefficients (with <math>\beta =0\,\!</math>) will fall along the straight line representative of the normal distribution, N(<math>0,{{\sigma }^{2}}/({{2}^{k}}\cdot m)\,\!</math>). Effect coefficients that show large deviations from this line will be significant since they do not come from this normal distribution. Similarly, since effects <math>=2\times \,\!</math> effect coefficients, all non-significant effects will also follow a straight line on the normal probability plot of effects. For replicated designs, the Effects Probability plot of a DOE folio plots the normalized effect values (or the T Values) on the standard normal probability line, N(0,1). However, in the case of unreplicated <math>{2}^{k}\,\!</math> designs, <math>{{\sigma }^{2}}\,\!</math> remains unknown since <math>M{{S}_{E}}\,\!</math> cannot be obtained. Lenth's method is used in this case to estimate the variance of the effects. For details on Lenth's method, please refer to [[DOE References| Montgomery (2001)]]. The DOE folio then uses this variance value to plot effects along the N(0, Lenth's effect variance) line. The | ||
method is illustrated in the following example. | method is illustrated in the following example. | ||
====Example==== | ====Example==== | ||
Vinyl panels, used as instrument panels in a certain automobile, are seen to develop defects after a certain amount of time. To investigate the issue, it is decided to carry out a two level factorial experiment. Potential factors to be investigated in the experiment are vacuum rate (factor <math>A\,\!</math>), material temperature (factor <math>B\,\!</math>), element intensity (factor <math>C\,\!</math>) and pre-stretch (factor <math>D\,\!</math>). The two levels of the factors used in the experiment are as shown in | Vinyl panels, used as instrument panels in a certain automobile, are seen to develop defects after a certain amount of time. To investigate the issue, it is decided to carry out a two level factorial experiment. Potential factors to be investigated in the experiment are vacuum rate (factor <math>A\,\!</math>), material temperature (factor <math>B\,\!</math>), element intensity (factor <math>C\,\!</math>) and pre-stretch (factor <math>D\,\!</math>). The two levels of the factors used in the experiment are as shown in below. | ||
[[Image:doet7.1.png|center|300px|Factors to investigate defects in vinyl panels.]] | |||
With a <math>{2}^{4}\,\!</math> design requiring 16 runs per replicate it is only feasible for the manufacturer to run a single replicate. | |||
The experiment design and data, collected as percent defects, are shown in the following figure. Since the present experiment design contains only a single replicate, it is not possible to obtain an estimate of the error sum of squares, <math>S{{S}_{E}}\,\!</math>. It is decided to use the normal probability plot of effects to identify the significant effects. The effect values for each term are obtained as shown in the following figure. | |||
[[Image:doe7_13.png|center|650px|Experiment design for the [[Two_Level_Factorial_Experiments#Example_2| example]].|link=]] | |||
[[ | Lenth's method uses these values to estimate the variance. As described in [[DOE_References|[Lenth, 1989]]], if all effects are arranged in ascending order, using their absolute values, then <math>{{s}_{0}}\,\!</math> is defined as 1.5 times the median value: | ||
::<math>\begin{align} | ::<math>\begin{align} | ||
Line 426: | Line 422: | ||
Using <math>{{s}_{0}}\,\!</math>, the "pseudo standard error" (<math>PSE\,\!</math>) is calculated as 1.5 times the median value of all effects that are less than 2.5 <math>{{s}_{0}}\,\!</math> : | Using <math>{{s}_{0}}\,\!</math>, the "pseudo standard error" (<math>PSE\,\!</math>) is calculated as 1.5 times the median value of all effects that are less than 2.5 <math>{{s}_{0}}\,\!</math> : | ||
::<math>\begin{align} | ::<math>\begin{align} | ||
Line 437: | Line 434: | ||
The significant effects can also be identified by comparing individual effect values to the margin of error or the threshold value using the pareto chart (see the third following figure). If the required significance is 0.1, then: | The significant effects can also be identified by comparing individual effect values to the margin of error or the threshold value using the pareto chart (see the third following figure). If the required significance is 0.1, then: | ||
::<math>margin\text{ }of\text{ }error={{t}_{\alpha /2,d}}\cdot PSE\,\!</math> | ::<math>margin\text{ }of\text{ }error={{t}_{\alpha /2,d}}\cdot PSE\,\!</math> | ||
Line 442: | Line 440: | ||
The <math>t\,\!</math> statistic, <math>{{t}_{\alpha /2,d}}\,\!</math>, is calculated at a significance of <math>\alpha /2\,\!</math> (for the two-sided hypothesis) and degrees of freedom <math>d=(\,\!</math> number of effects <math>)/3\,\!</math>. Thus: | The <math>t\,\!</math> statistic, <math>{{t}_{\alpha /2,d}}\,\!</math>, is calculated at a significance of <math>\alpha /2\,\!</math> (for the two-sided hypothesis) and degrees of freedom <math>d=(\,\!</math> number of effects <math>)/3\,\!</math>. Thus: | ||
::<math>\begin{align} | ::<math>\begin{align} | ||
Line 453: | Line 452: | ||
[[Image: | [[Image:doe7_14.png|center|800px|Effect values for the experiment in the [[Two_Level_Factorial_Experiments#Example_2| example]].|link=]] | ||
[[Image: | [[Image:doe7_15.png|center|650px|Normal probability plot of effects for the experiment in the [[Two_Level_Factorial_Experiments#Example_2| example]].|link=]] | ||
[[Image: | [[Image:doe7_16.png|center|650px|Pareto chart for the experiment in the [[Two_Level_Factorial_Experiments#Example_2| example]].|link=]] | ||
===Center Point Replicates=== | ===Center Point Replicates=== | ||
Line 465: | Line 464: | ||
Another method of dealing with unreplicated <math>{2}^{k}\,\!</math> designs that only have quantitative factors is to use replicated runs at the center point. The center point is the response corresponding to the treatment exactly midway between the two levels of all factors. Running multiple replicates at this point provides an estimate of pure error. Although running multiple replicates at any treatment level can provide an estimate of pure error, the other advantage of running center point replicates in the <math>{2}^{k}\,\!</math> design is in checking for the presence of curvature. The test for curvature investigates whether the model between the response and the factors is linear and is discussed in [[Two_Level_Factorial_Experiments#Using_Center_Point_Replicates_to_Test_Curvature| Center Pt. Replicates to Test Curvature]]. | Another method of dealing with unreplicated <math>{2}^{k}\,\!</math> designs that only have quantitative factors is to use replicated runs at the center point. The center point is the response corresponding to the treatment exactly midway between the two levels of all factors. Running multiple replicates at this point provides an estimate of pure error. Although running multiple replicates at any treatment level can provide an estimate of pure error, the other advantage of running center point replicates in the <math>{2}^{k}\,\!</math> design is in checking for the presence of curvature. The test for curvature investigates whether the model between the response and the factors is linear and is discussed in [[Two_Level_Factorial_Experiments#Using_Center_Point_Replicates_to_Test_Curvature| Center Pt. Replicates to Test Curvature]]. | ||
====Example==== | ====Example: Use Center Point to Get Pure Error==== | ||
Consider a <math>{2}^{2}\,\!</math> experiment design to investigate the effect of two factors, <math>A\,\!</math> and <math>B\,\!</math>, on a certain response. The energy consumed when the treatments of the <math>{2}^{2}\,\!</math> design are run is considerably larger than the energy consumed for the center point run (because at the center point the factors are at their middle levels). Therefore, the analyst decides to run only a single replicate of the design and augment the design by five replicated runs at the center point as shown in the following figure. The design properties for this experiment are shown in the second following figure. The complete experiment design is shown in the third following figure. The center points can be used in the identification of significant effects as shown next. | |||
: | [[Image:doe7.17.png||center|300px|<math>2^2\,\!</math> design augmented by five center point runs.]] | ||
[[Image:doe7_18.png|center|537px|Design properties for the experiment in the [[Two_Level_Factorial_Experiments#Example_3| example]].]] | |||
[[Image:doe7_19.png|center|630px|Experiment design for the [[Two_Level_Factorial_Experiments#Example_3| example]].]] | |||
Since the present <math>{2}^{2}\,\!</math> design is unreplicated, there are no degrees of freedom available to calculate the error sum of squares. By augmenting this design with five center points, the response values at the center points, <math>y_{i}^{c}\,\!</math>, can be used to obtain an estimate of pure error, <math>S{{S}_{PE}}\,\!</math>. Let <math>{{\bar{y}}^{c}}\,\!</math> represent the average response for the five replicates at the center. Then: | |||
::<math>S{{S}_{PE}}=Sum\text{ }of\text{ }Squares\text{ }for\text{ }center\text{ }points\,\!</math> | |||
Line 535: | Line 531: | ||
Assuming that the desired significance is 0.1, since <math>p\,\!</math> value < 0.1, it can be concluded that the main effect of factor <math>A\,\!</math> significantly affects the response. This result is displayed in the ANOVA table as shown in the following figure. Test for the significance of other factors can be carried out in a similar manner. | Assuming that the desired significance is 0.1, since <math>p\,\!</math> value < 0.1, it can be concluded that the main effect of factor <math>A\,\!</math> significantly affects the response. This result is displayed in the ANOVA table as shown in the following figure. Test for the significance of other factors can be carried out in a similar manner. | ||
[[Image:doe7_20.png|center|649px|Results for the experiment in the [[Two_Level_Factorial_Experiments#Example_3| example]].]] | |||
===Using Center Point Replicates to Test Curvature=== | ===Using Center Point Replicates to Test Curvature=== | ||
Center point replicates can also be used to check for curvature in replicated or unreplicated <math>{2}^{k}\,\!</math> designs. The test for curvature investigates whether the model between the response and the factors is linear. The way DOE | Center point replicates can also be used to check for curvature in replicated or unreplicated <math>{2}^{k}\,\!</math> designs. The test for curvature investigates whether the model between the response and the factors is linear. The way the DOE folio handles center point replicates is similar to its handling of blocks. The center point replicates are treated as an additional factor in the model. The factor is labeled as Curvature in the results of the DOE folio. If Curvature turns out to be a significant factor in the results, then this indicates the presence of curvature in the model. | ||
====Example: Use Center Point to Test Curvature==== | |||
To illustrate the use of center point replicates in testing for curvature, consider again the data of the single replicate <math>{2}^{2}\,\!</math> experiment from a preceding figure(labeled "<math>2^2</math> design augmented by five center point runs"). Let <math>{{x}_{1}}\,\!</math> be the indicator variable to indicate if the run is a center point: | |||
To illustrate the use of center point replicates in testing for curvature, consider again the data of the single replicate <math>{2}^{2}\,\!</math> experiment from a preceding figure(labeled "<math>2^2</math> design | |||
Line 664: | Line 659: | ||
Assuming that the desired significance is 0.1, since <math>p\,\!</math> value > 0.1, it can be concluded that curvature does not exist for this design. This results is shown in the ANOVA table in the figure above. The surface of the fitted model based on these results, along with the observed response values, is shown in the figure below. | Assuming that the desired significance is 0.1, since <math>p\,\!</math> value > 0.1, it can be concluded that curvature does not exist for this design. This results is shown in the ANOVA table in the figure above. The surface of the fitted model based on these results, along with the observed response values, is shown in the figure below. | ||
[[Image:doe7.21.png | [[Image:doe7.21.png|center|400px|Model surface and observed response values for the design in the [[Two_Level_Factorial_Experiments#Example_4| example]].]] | ||
<br> | <br> | ||
Line 670: | Line 665: | ||
==Blocking in 2<sup>k</sup> Designs== | ==Blocking in 2<sup>k</sup> Designs== | ||
Blocking can be used in the <math>{2}^{k}\,\!</math> designs to deal with cases when replicates cannot be run under identical conditions. Randomized complete block designs that were discussed in [[ | Blocking can be used in the <math>{2}^{k}\,\!</math> designs to deal with cases when replicates cannot be run under identical conditions. Randomized complete block designs that were discussed in [[Randomization and Blocking in DOE]] for factorial experiments are also applicable here. At times, even with just two levels per factor, it is not possible to run all treatment combinations for one replicate of the experiment under homogeneous conditions. For example, each replicate of the <math>{2}^{2}\,\!</math> design requires four runs. If each run requires two hours and testing facilities are available for only four hours per day, two days of testing would be required to run one complete replicate. Blocking can be used to separate the treatment runs on the two different days. Blocks that do not contain all treatments of a replicate are called incomplete blocks. In incomplete block designs, the block effect is confounded with certain effect(s) under investigation. For the <math>{2}^{2}\,\!</math> design assume that treatments <math>(1)\,\!</math> and <math>ab\,\!</math> were run on the first day and treatments <math>a\,\!</math> and <math>b\,\!</math> were run on the second day. Then, the incomplete block design for this experiment is: | ||
Line 745: | Line 740: | ||
Therefore, to confound the interaction <math>AB\,\!</math> with the block effect in the <math>{2}^{2}\,\!</math> incomplete block design, treatments <math>(1)\,\!</math> and <math>ab\,\!</math> (with <math>L=0\,\!</math>) should be assigned to block 2 and treatment combinations <math>a\,\!</math> and <math>b\,\!</math> (with <math>L=1\,\!</math>) should be assigned to block 1. | Therefore, to confound the interaction <math>AB\,\!</math> with the block effect in the <math>{2}^{2}\,\!</math> incomplete block design, treatments <math>(1)\,\!</math> and <math>ab\,\!</math> (with <math>L=0\,\!</math>) should be assigned to block 2 and treatment combinations <math>a\,\!</math> and <math>b\,\!</math> (with <math>L=1\,\!</math>) should be assigned to block 1. | ||
====Example==== | ====Example: Two Level Factorial Design with Two Blocks==== | ||
This example illustrates how treatments can be allocated to two blocks for an unreplicated <math>{2}^{k}\,\!</math> design. Consider the unreplicated <math>{2}^{4}\,\!</math> design to investigate the four factors affecting the defects in automobile vinyl panels discussed in [[Two_Level_Factorial_Experiments#Normal_Probability_Plot_of_Effects| Normal Probability Plot of Effects]]. Assume that the 16 treatments required for this experiment were run by two different operators with each operator conducting 8 runs. This experiment is an example of an incomplete block design. The analyst in charge of this experiment assumed that the interaction <math>ABCD\,\!</math> was not significant and decided to allocate treatments to the two operators so that the <math>ABCD\,\!</math> interaction was confounded with the block effect (the two operators are the blocks). The allocation scheme to assign treatments to the two operators can be obtained as follows. | This example illustrates how treatments can be allocated to two blocks for an unreplicated <math>{2}^{k}\,\!</math> design. Consider the unreplicated <math>{2}^{4}\,\!</math> design to investigate the four factors affecting the defects in automobile vinyl panels discussed in [[Two_Level_Factorial_Experiments#Normal_Probability_Plot_of_Effects| Normal Probability Plot of Effects]]. Assume that the 16 treatments required for this experiment were run by two different operators with each operator conducting 8 runs. This experiment is an example of an incomplete block design. The analyst in charge of this experiment assumed that the interaction <math>ABCD\,\!</math> was not significant and decided to allocate treatments to the two operators so that the <math>ABCD\,\!</math> interaction was confounded with the block effect (the two operators are the blocks). The allocation scheme to assign treatments to the two operators can be obtained as follows. | ||
Line 763: | Line 758: | ||
::<math>ab\ \ :\ \ \text{ }L=1+1+0+0=2=0\text{ (mod 2)}\,\!</math> | ::<math>ab\ \ :\ \ \text{ }L=1+1+0+0=2=0\text{ (mod 2)}\,\!</math> | ||
[[Image:doe7.22.png | [[Image:doe7.22.png|center|200px| Allocation of treatments to two blocks for the <math>2^4</math> design in the example by confounding interaction of <math>ABCD</math> with the blocks.]] | ||
Therefore, <math>ab\,\!</math> should be assigned to Block 2 or the second operator. Other treatments can be allocated to the two operators in a similar manner to arrive at the allocation scheme shown in the figure below. | Therefore, <math>ab\,\!</math> should be assigned to Block 2 or the second operator. Other treatments can be allocated to the two operators in a similar manner to arrive at the allocation scheme shown in the figure below. | ||
In DOE | In a DOE folio, to confound the <math>ABCD\,\!</math> interaction for the <math>{2}^{4}\,\!</math> design into two blocks, the number of blocks are specified as shown in the figure below. Then the interaction <math>ABCD\,\!</math> is entered in the Block Generator window (second following figure) which is available using the Block Generator button in the following figure. The design generated by the Weibull++ DOE folio is shown in the third of the following figures. This design matches the allocation scheme of the preceding figure. | ||
[[Image:doe7_23.png|center|700px| Adding block properties for the experiment in the example.|link=]] | |||
= | [[Image:doe7_24.png|center|700px|Specifying the interaction ABCD as the interaction to be confounded with the blocks for the [[Two_Level_Factorial_Experiments#Example_5| example]].|link=]] | ||
[[Image:doe7_25.png|center|800px|Two block design for the experiment in the [[Two_Level_Factorial_Experiments#Example_5| example]].|link=]] | |||
For the analysis of this design, the sum of squares for all effects are calculated assuming no blocking. Then, to account for blocking, the sum of squares corresponding to the <math>ABCD\,\!</math> interaction is considered as the sum of squares due to blocks and <math>ABCD\,\!</math>. In the DOE folio, this is done by displaying this sum of squares as the sum of squares due to the blocks. This is shown in the following figure where the sum of squares in question is obtained as 72.25 and is displayed against Block. The interaction ABCD, which is confounded with the blocks, is not displayed. Since the design is unreplicated, any of the methods to analyze unreplicated designs mentioned in [[Two_Level_Factorial_Experiments#Unreplicated_2__Designs_in_2__Blocks| Unreplicated <math>2^k</math> designs]] have to be used to identify significant effects. | |||
[[Image: | [[Image:doe7_26.png|center|793px|ANOVA table for the experiment of the [[Two_Level_Factorial_Experiments#Example_5| example]].|link=]] | ||
===Unreplicated 2<sup>''k''</sup> Designs in 2<sup>''p''</sup> Blocks=== | |||
A single replicate of the <math>{2}^{k}\,\!</math> design can be run in up to <math>{2}^{p}\,\!</math> blocks where <math>p<k\,\!</math>. The number of effects confounded with the blocks equals the degrees of freedom associated with the block effect. | |||
Line 827: | Line 823: | ||
In general, when an unreplicated <math>{2}^{k}\,\!</math> design is confounded in <math>{2}^{p}\,\!</math> blocks, <math>p\,\!</math> contrasts are needed (<math>{{L}_{1}},{{L}_{2}}...{{L}_{p}}\,\!</math>). <math>p\,\!</math> effects are selected to define these contrasts such that none of these effects are the generalized interaction of the others. The <math>{2}^{p}\,\!</math> blocks can then be assigned the treatments using the <math>p\,\!</math> contrasts. <math>{{2}^{p}}-(p+1)\,\!</math> effects, that are also confounded with the blocks, are then obtained as the generalized interaction of the <math>p\,\!</math> effects. In the statistical analysis of these designs, the sum of squares are computed as if no blocking were used. Then the block sum of squares is obtained by adding the sum of squares for all the effects confounded with the blocks. | In general, when an unreplicated <math>{2}^{k}\,\!</math> design is confounded in <math>{2}^{p}\,\!</math> blocks, <math>p\,\!</math> contrasts are needed (<math>{{L}_{1}},{{L}_{2}}...{{L}_{p}}\,\!</math>). <math>p\,\!</math> effects are selected to define these contrasts such that none of these effects are the generalized interaction of the others. The <math>{2}^{p}\,\!</math> blocks can then be assigned the treatments using the <math>p\,\!</math> contrasts. <math>{{2}^{p}}-(p+1)\,\!</math> effects, that are also confounded with the blocks, are then obtained as the generalized interaction of the <math>p\,\!</math> effects. In the statistical analysis of these designs, the sum of squares are computed as if no blocking were used. Then the block sum of squares is obtained by adding the sum of squares for all the effects confounded with the blocks. | ||
====Example==== | ====Example: 2 Level Factorial Design with Four Blocks==== | ||
This example illustrates how DOE | This example illustrates how a DOE folio obtains the sum of squares when treatments for an unreplicated <math>{2}^{k}\,\!</math> design are allocated among four blocks. Consider again the unreplicated <math>{2}^{4}\,\!</math> design used to investigate the defects in automobile vinyl panels presented in [[Two_Level_Factorial_Experiments#Normal_Probability_Plot_of_Effects| Normal Probability Plot of Effects]]. Assume that the 16 treatments needed to complete the experiment were run by four operators. Therefore, there are four blocks. Assume that the treatments were allocated to the blocks using the generators mentioned in the previous section, i.e., treatments were allocated among the four operators by confounding the effects, <math>AC\,\!</math> and <math>BD,\,\!</math> with the blocks. These effects can be specified as Block Generators as shown in the following figure. (The generalized interaction of these two effects, interaction <math>ABCD\,\!</math>, will also get confounded with the blocks.) The resulting design is shown in the second following figure and matches the allocation scheme obtained in the previous section. | ||
[[Image: | [[Image:doe7_27.png|center|700px|Specifying the interactions AC and BD as block generators for the [[Two_Level_Factorial_Experiments#Example_6| example]].|link=]] | ||
Line 838: | Line 834: | ||
[[Image: | [[Image:doe7_28.png|center|800px|Design for the experiment in the [[Two_Level_Factorial_Experiments#Example_6| example]].|link=]] | ||
[[Image: | [[Image:doe7_29.png|center|793px|ANOVA table for the experiment in the [[Two_Level_Factorial_Experiments#Example_6| example]].|link=]] | ||
==Variability Analysis== | ==Variability Analysis== | ||
For replicated two level factorial experiments, DOE | For replicated two level factorial experiments, the DOE folio provides the option of conducting variability analysis (using the Variability Analysis icon under the Data menu). The analysis is used to identify the treatment that results in the least amount of variation in the product or process being investigated. Variability analysis is conducted by treating the standard deviation of the response for each treatment of the experiment as an additional response. The standard deviation for a treatment is obtained by using the replicated response values at that treatment run. As an example, consider the <math>{2}^{3}\,\!</math> design shown in the following figure where each run is replicated four times. A variability analysis can be conducted for this design. The DOE folio calculates eight standard deviation values corresponding to each treatment of the design (see second following figure). Then, the design is analyzed as an unreplicated <math>{2}^{3}\,\!</math> design with the standard deviations (displayed as Y Standard Deviation. in second following figure) as the response. The normal probability plot of effects identifies <math>AC\,\!</math> as the effect that influences variability (see third figure following). Based on the effect coefficients obtained in the fourth figure following, the model for Y Std. is: | ||
Line 858: | Line 854: | ||
[[Image:doe7.30.png | [[Image:doe7.30.png|center|391px|A <math>2^3\,\!</math> design with four replicated response values that can be used to conduct a variability analysis.]] | ||
<br> | <br> | ||
[[Image:doe7_31.png|center|727px|Variability analysis in DOE++.|link=]] | |||
<br> | <br> | ||
[[Image:doe7_32.png|center|650px|Normal probability plot of effects for the variability analysis example.|link=]] | |||
<br> | <br> | ||
[[Image:doe7_33.png|center|727px|Effect coefficients for the variability analysis example.|link=]] | |||
<br> | <br> | ||
Latest revision as of 23:58, 16 March 2023
Two level factorial experiments are factorial experiments in which each factor is investigated at only two levels. The early stages of experimentation usually involve the investigation of a large number of potential factors to discover the "vital few" factors. Two level factorial experiments are used during these stages to quickly filter out unwanted effects so that attention can then be focused on the important ones.
2k Designs
The factorial experiments, where all combination of the levels of the factors are run, are usually referred to as full factorial experiments. Full factorial two level experiments are also referred to as

A full factorial two level design with
The two levels of the factor in the
The 22 Design
The simplest of the two level factorial experiments is the
The 23 Design
The
The
Analysis of 2k Designs
The
Notation
Based on the notation used in General Full Factorial Designs, the ANOVA model for a two level factorial experiment with three factors would be as follows:
where:
- •
represents the overall mean - •
represents the independent effect of the first factor (factor ) out of the two effects and - •
represents the independent effect of the second factor (factor ) out of the two effects and - •
represents the independent effect of the interaction out of the other interaction effects - •
represents the effect of the third factor (factor ) out of the two effects and - •
represents the effect of the interaction out of the other interaction effects - •
represents the effect of the interaction out of the other interaction effects - •
represents the effect of the interaction out of the other interaction effects
and
The notation for a linear regression model having three predictor variables with interactions is:
The notation for the regression model is much more convenient, especially for the case when a large number of higher order interactions are present. In two level experiments, the ANOVA model requires only one indicator variable to represent each factor for both qualitative and quantitative factors. Therefore, the notation for the multiple linear regression model can be applied to the ANOVA model of the experiment that has all the factors at two levels. For example, for the experiment of the ANOVA model given above,
As mentioned earlier, it is important to note that the coding for the indicator variables for the ANOVA models of two level factorial experiments is reversed from the coding followed in General Full Factorial Designs. Here
In summary, the ANOVA model for the experiments with all factors at two levels is different from the ANOVA models for other experiments in terms of the notation in the following two ways:
- • The notation of the regression models is used for the effect coefficients.
- • The coding of the indicator variables is reversed.
Special Features
Consider the design matrix,
Notice that, due to the orthogonal design of the
where
The
Then the variance-covariance matrix for the
Note that the variance-covariance matrix for the
It can also be noted from the equation given above, that in addition to the
This property is used to construct the normal probability plot of effects in
Example
To illustrate the analysis of a full factorial



The applicable model using the notation for
where the indicator variable,
If the subscripts for the run (
To investigate how the given factors affect the response, the following hypothesis tests need to be carried:
This test investigates the main effect of factor
where
This test investigates the two factor interaction
where
This test investigates the three factor interaction
where
Expression of the ANOVA Model as
In matrix notation, the ANOVA model can be expressed as:
where:
Calculation of the Extra Sum of Squares for the Factors
Knowing the matrices
where
Similarly, the extra sum of squares for the interaction effect
The extra sum of squares for other effects can be obtained in a similar manner.
Calculation of the Test Statistics
Knowing the extra sum of squares, the test statistic for the effects can be calculated. For example, the test statistic for the interaction
where
Assuming that the desired significance is 0.1, since

Calculation of Effect Coefficients
The estimate of effect coefficients can also be obtained:

The coefficients and related results are shown in the Regression Information table above. In the table, the Effect column displays the effects, which are simply twice the coefficients. The Standard Error column displays the standard error,
Model Equation
From the analysis results in the above figure within calculation of effect coefficients section, it is seen that effects
To make the model hierarchical, the main effect,
This equation can be viewed in a DOE folio, as shown in the following figure, using the Show Analysis Summary icon in the Control Panel. The equation shown in the figure will match the hierarchical model once the required terms are selected using the Select Effects icon.

Replicated and Repeated Runs
In the case of replicated experiments, it is important to note the difference between replicated runs and repeated runs. Both repeated and replicated runs are multiple response readings taken at the same factor levels. However, repeated runs are response observations taken at the same time or in succession. Replicated runs are response observations recorded in a random order. Therefore, replicated runs include more variation than repeated runs. For example, a baker, who wants to investigate the effect of two factors on the quality of cakes, will have to bake four cakes to complete one replicate of a
Unreplicated 2k Designs
If a factorial experiment is run only for a single replicate then it is not possible to test hypotheses about the main effects and interactions as the error sum of squares cannot be obtained. This is because the number of observations in a single replicate equals the number of terms in the ANOVA model. Hence the model fits the data perfectly and no degrees of freedom are available to obtain the error sum of squares.
However, sometimes it is only possible to run a single replicate of the
Pooling Higher Order Interactions
One of the ways to deal with unreplicated
Normal Probability Plot of Effects
Another way to use unreplicated
Example
Vinyl panels, used as instrument panels in a certain automobile, are seen to develop defects after a certain amount of time. To investigate the issue, it is decided to carry out a two level factorial experiment. Potential factors to be investigated in the experiment are vacuum rate (factor
With a
The experiment design and data, collected as percent defects, are shown in the following figure. Since the present experiment design contains only a single replicate, it is not possible to obtain an estimate of the error sum of squares,

Lenth's method uses these values to estimate the variance. As described in [Lenth, 1989], if all effects are arranged in ascending order, using their absolute values, then
Using
Using
The significant effects can also be identified by comparing individual effect values to the margin of error or the threshold value using the pareto chart (see the third following figure). If the required significance is 0.1, then:
The
The value of 4.534 is shown as the critical value line in the third following figure. All effects with absolute values greater than the margin of error can be considered to be significant. These effects are



Center Point Replicates
Another method of dealing with unreplicated
Example: Use Center Point to Get Pure Error
Consider a
Since the present
Then the corresponding mean square is:
Alternatively,
Once
Then, the test statistic to test the significance of the main effect of factor
The
Assuming that the desired significance is 0.1, since
Using Center Point Replicates to Test Curvature
Center point replicates can also be used to check for curvature in replicated or unreplicated
Example: Use Center Point to Test Curvature
To illustrate the use of center point replicates in testing for curvature, consider again the data of the single replicate
If
To investigate the presence of curvature, the following hypotheses need to be tested:
The test statistic to be used for this test is:
where
Calculation of the Sum of Squares
The
The sum of squares can now be calculated. For example, the error sum of squares is:
where
where
This extra sum of squares can be used to test for the significance of curvature. The corresponding mean square is:
Calculation of the Test Statistic
Knowing the mean squares, the statistic to check the significance of curvature can be calculated.
The
Assuming that the desired significance is 0.1, since
Blocking in 2k Designs
Blocking can be used in the
For this design the block effect may be calculated as:
The
The two equations given above show that, in this design, the
where the
The value of
Once the defining contrast is known, it can be used to allocate treatments to the blocks. For the
Note that the value of
Therefore, to confound the interaction
Example: Two Level Factorial Design with Two Blocks
This example illustrates how treatments can be allocated to two blocks for an unreplicated
The defining contrast for the
The treatments can be allocated to the two operators using the values of the defining contrast. Assume that
Therefore, treatment
Therefore,



For the analysis of this design, the sum of squares for all effects are calculated assuming no blocking. Then, to account for blocking, the sum of squares corresponding to the

Unreplicated 2k Designs in 2p Blocks
A single replicate of the
If two blocks are used (the block effect has two levels), then one (
Based on the values of
Since the block effect has three degrees of freedom, three effects are confounded with the block effect. In addition to
Example: 2 Level Factorial Design with Four Blocks
This example illustrates how a DOE folio obtains the sum of squares when treatments for an unreplicated

The sum of squares in this case can be obtained by calculating the sum of squares for each of the effects assuming there is no blocking. Once the individual sum of squares have been obtained, the block sum of squares can be calculated. The block sum of squares is the sum of the sum of squares of effects,


Variability Analysis
For replicated two level factorial experiments, the DOE folio provides the option of conducting variability analysis (using the Variability Analysis icon under the Data menu). The analysis is used to identify the treatment that results in the least amount of variation in the product or process being investigated. Variability analysis is conducted by treating the standard deviation of the response for each treatment of the experiment as an additional response. The standard deviation for a treatment is obtained by using the replicated response values at that treatment run. As an example, consider the
Based on the model, the experimenter has two choices to minimize variability (by minimizing Y Std.). The first choice is that


