|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Data
Sheet Examples Demo |
Power
and Size Advantages of Exact Parametric Methods
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Treatment A | 9.1 | 7.8 | 8.3 | 8.4 | 10.0 | 9.4 | 8.4 | 9.6 |
| Treatment B | 11.4 | 10.2 | 10.7 | 12.7 | 8.2 | 9.4 | 11.1 | 10.4 |
| Treatment C | 13.6 | 8.4 | 10.3 | 4.1 | 10.5 | 9.6 | 10.8 | 11.9 |
XPro computes the p-value for testing the equality of treatment means both under the (unreasonable) assumption of equal variances and without that assumption. The p-values of the two sets computed using XPro are as follows:
The discrepancy of the p-values in this example clearly demonstrates the serious weakness of the classical F-test in the presence of heterogeneity. Because it ignores the problem of heterogeneity, the classical F-test failes to detect significant differences in treatments, even though there is sufficient data.
Are you willing to risk your imporatnt findings from an experiment when the lack of power of a test is low? In applications such as biomedical experiments, weakness of a procedure at this magnitude is unacceptable. In summary, this example demonstrates how exact methods can help conserve resources by concluding experiments in a timely manner at a reduced cost.
Example 2: Size performance, a comparison with widely used MLE based methodsThis comparison and underlying exact methods have been presented at ASA courses and technical sessions. Although the study findings have implications for all higher-way balanced mixed models, we present here the results as applied to a simple a x n one-way layout (with n being the number of repetitions), so that you can easily reproduce the results and check our claims. In this simulation, the error variance is fixed at 1 and the variance component of the random factor is set to three values around 1. The following table shows Type I error of each test when the test is carried out at the .05 confidence level (using 10,000 simulated samples). This also represents the rate at which the corresponding 95% confidence intervals did not contain the true value of the variance component.
Type I (false positive) Error of Competing .05-Size Tests
| ----------------------------- | a = 2 , n = 10 | a =10, n =10 | a=5, n=100 |
| Variance component | .10 | 1.0 | 10.0 | .10 | 1.0 | 10.0 | .10 | 1.0 | 10 |
| Generalized tests | .05 | .05 | .05 | .05 | .05 | .05 | .05 | .05 | .05 |
| MLE based asymptotic test | .57 | .58 | .59 | .19 | .20 | .20 | .31 | .31 | .31 |
| RMLE based asymptotic test | .38 | .40 | .40 | .13 | .14 | .14 | .20 | .21 | .21 |
It is evident from this simulation study that regardless of how large n is, the Type I error rates of the asymptotic test range between approximately .2 and .6, unless the number of levels of the random factor is unusually large. XPro computes exact and/or size guaranteed p-values and confidence intervals on variance components of random effects and mixed models. It also allows one to compare variance components from two balanced models.
Application Example 3: _______________________________________
The Table below shows a representative data set from a pharmaceutical study on drug for treating male erectile dysfunction. The objective of the study was to determine the efficacy and safety of active treatment at doses 0.5mg, 1.5mg, 5mg compared to placebo. The four groups are denoted as 1 (placebo),2,3,and 4 in the table. The data was collected over a period of 10 weeks from 20 subjects.
| Group | Sub\Time | T1 | T2 | T3 | T4 | T5 |
| 1 | 1 | 29.90 | 29.22 | 30.91 | 31.35 | 29.27 |
| 1 | 2 | 29.75 | 30.90 | 29.11 | 31.76 | 29.01 |
| 1 | 3 | 29.16 | 32.64 | 31.68 | 30.19 | 30.30 |
| 1 | 4 | 29.33 | 27.54 | 29.16 | 33.77 | 31.21 |
| 1 | 5 | 29.28 | 29.93 | 31.35 | 29.03 | 30.64 |
| 2 | 6 | 27.06 | 32.16 | 31.88 | 35.33 | 38.35 |
| 2 | 7 | 32.08 | 38.20 | 21.75 | 29.53 | 33.51 |
| 2 | 8 | 20.74 | 23.14 | 34.34 | 33.63 | 29.96 |
| 2 | 9 | 30.53 | 31.57 | 35.36 | 33.04 | 24.58 |
| 2 | 10 | 31.27 | 21.92 | 41.40 | 38.48 | 22.38 |
| 3 | 11 | 30.99 | 30.85 | 32.49 | 33.13 | 33.58 |
| 3 | 12 | 32.68 | 30.68 | 30.30 | 33.11 | 32.76 |
| 3 | 13 | 30.99 | 33.33 | 31.09 | 31.78 | 31.87 |
| 3 | 14 | 31.83 | 31.58 | 33.62 | 34.79 | 33.89 |
| 3 | 15 | 34.13 | 31.09 | 33.44 | 31.52 | 29.31 |
| 4 | 16 | 24.77 | 27.53 | 28.88 | 34.23 | 35.21 |
| 4 | 17 | 35.62 | 32.55 | 29.92 | 26.54 | 28.22 |
| 4 | 18 | 36.16 | 31.28 | 33.63 | 40.04 | 33.28 |
| 4 | 19 | 33.84 | 29.90 | 26.45 | 30.10 | 30.76 |
| 4 | 20 | 28.01 | 30.62 | 30.91 | 25.20 | 28.53 |
It can be seen that this data has highly unequal group variances indicating that the assumption of equal variances that one assumes for applying the classical F-test is not reasonable. However, if the assumption has no bearing on the conclusion of the test, that is not a problem. Let us see what happens. Shown below is the ANOVA table for above data after running XPro. The table includes the p-values by the F-test available from general software packages and the generalized F-test obtained using XPro.
| Source | DF | SS | MS | F-val | P-val | Generalized P-val |
| Treatments | 3 | 49.77 | 16.59 | 1.54 | 0.2416 | 0.0005 |
| Occasions | 4 | 54.43 | 13.60 | 0.95 | 0.4436 | 0.253 |
| Within Treat. | 16 | 171.8 | 10.74 | |||
| Treat.X Occa. | 12 | 87.97 | 7.33 | 0.51 | 0.9011 | 0.988 |
| Error | 64 | 921.0 | 14.39 |
As evident from the generalized p-value of 0.005 in the ANOVA table that the differences in treatments is highly significant. In fact the effect of second and third dosages are higher than that of placebo. Despite the substantial evidence available in this data set to reject the hypothesis of equal treatment effect, the F-test available from most software packages would completely fail to detect the significance of treatment effects. With a p-value of .2416 the F-test does not even suggest us to continue with the experiment to establish statistical significance of results. Such a large p-value would have completely discouraged the researchers. In real world applications this would be a multi-million dollar mistake. With XPro you will not take the chance of making such mistakes because it computes p-values with and without assuming equal variances.
References
[1]. Krutchkoff, R.G. (1988). "One-Way
Fixed Effects Analysis of Variance When the Error Variances May be Unequal",
Journal of Statistics, Computational Simulation, 30, 177-183.
[2]. Thursby, J.G. (1992). "A Comparison of Several Exact and Approximate Tests for Structured Shift Under Hetereoscedasticity", Journal of Econometrics, 53, 363-386.
[3]. Weerahandi, S. (1995). Exact Statistical Methods for Data Analysis, Springer-erlag, New York.
[4]. Weerahandi, S. (1995). "ANOVA Under Unequal Error Variances", Biometrics, 51, 589-599.
[5]. Zhou, L. and Mathew, T. (1994), "Some Tests for Variance Components Using Generalized p-Values", Technometrics, 4, 394-402.
[6]. Weerahandi, S. (1991), "Testing Variance Components in Mixed Models with Generalized p Values," Journal of the American Statistical Association, 86, 151-153.
[7]. Chi, E. and Weerahandi, S (1998), "Comparing Treatments Under Growth Curve Models with Compound-Symmetric Covariance Structure: Exact Tests Using Generalized p-Values,", To appear in JSPI.
[ Home | Products | Services | FAQs | Downloads | Order Info | About Dataxiom ]
©2002 Dataxiom Software, Inc.. All rights reserved. e-Mail comments and questions to support@dataxiom.com.