References
Box, G. E. P., and J. S. Hunter. 1957. “Multi-Factor Experimental
Designs for Exploring Response Surfaces.” Annals of
Mathematical Statistics 28 (1): 195–241. https://doi.org/10.1214/aoms/1177707047.
Bullough, Richard C., and Christopher L. Melby. 1993. “Effect of
Inpatient Versus Outpatient Measurement Protocol on Resting Metabolic
Rate and Respiratory Exchange Ratio.” Annals of Nutrition and
Metabolism 37 (1): 24–32. https://doi.org/10.1159/000177745.
Chang, Clarence D., Oleg K. Kononenko, and Raymond E. Franklin. 1960.
“Maximum Data Through a Statistical Design.” Industrial
& Engineering Chemistry 52 (11): 939–42. https://doi.org/10.1021/ie50611a030.
Czitrom, Veronica. 1999. “One-Factor-at-a-Time Versus Designed
Experiments.” The American Statistician 52 (2): 126–31.
https://doi.org/10.1080/00031305.1999.10474445.
Dean, Angela, Daniel Voss, and Danel Draguljić. 2017. Design and
Analysis of Experiments. 2nd ed. Springer-Verlag. https://doi.org/10.1007/978-3-319-52250-0.
DeLuca, Laura S., Alex Reinhart, Gordon Weinberg, Michael Laudenbach,
Sydney Miller, and David West Brown. 2025. “Developing Students’
Statistical Expertise Through Writing in the Age of
AI.” Journal of Statistics and Data Science
Education 33 (3): 266–78. https://doi.org/10.1080/26939169.2025.2497547.
Giesbrecht, Francis G., and Marcia L. Gumpertz. 2004. Planning,
Construction, and Statistical Analysis of Comparative Experiments.
John Wiley & Sons, Inc. https://doi.org/10.1002/0471476471.
Imbens, Guido W., and Donald B. Rubin. 2015. Causal Inference for
Statistics, Social, and Biomedical Sciences. Cambridge University
Press. https://doi.org/10.1017/CBO9781139025751.
King, James R. 1992. “Presenting Experimental Data
Effectively.” Quality Engineering 4 (3): 399–412. https://doi.org/10.1080/08982119208918921.
Klotz, Jerome. 1969. “A Simple Proof of Scheffé’s
Multiple Comparison Theorem for Contrasts in the One-Way Layout.”
The American Statistician 23 (5): 44–45. https://doi.org/10.2307/2682195.
Lattimore, Tor, and Csaba Szepesvári. 2020. Bandit Algorithms.
Cambridge University Press. https://tor-lattimore.com/downloads/book/book.pdf.
Lock Morgan, Kari, and Donald B. Rubin. 2012. “Rerandomization to
Improve Covariate Balance in Experiments.” Annals of
Statistics 40 (2): 1263–82. https://doi.org/10.1214/12-AOS1008.
Maxwell, Scott E., Ken Kelley, and Joseph R. Rausch. 2008. “Sample
Size Planning for Statistical Power and Accuracy in Parameter
Estimation.” Annual Review of Psychology 59: 537–63. https://doi.org/10.1146/annurev.psych.59.103006.093735.
Mead, R., S. G. Gilmour, and A. Mead. 2012. Statistical Principles
for the Design of Experiments. Cambridge University Press. https://doi.org/10.1017/CBO9781139020879.
O’Brien, Peter C., and Thomas R. Fleming. 1979. “A Multiple
Testing Procedure for Clinical Trials.” Biometrics 35
(3): 549–56. https://doi.org/10.2307/2530245.
Pashley, Nicole E., and Luke W. Miratrix. 2022. “Block What You
Can, Except When You Shouldn’t.” Journal of Educational and
Behavioral Statistics 47 (1): 69–100. https://doi.org/10.3102/10769986211027240.
Pollock, K. H., H. M. Ross-Parker, and R. Mead. 1979. “A Sequence
of Games Useful in Teaching Experimental Design to Agriculture
Students.” The American Statistician 33 (2): 70–76. https://doi.org/10.1080/00031305.1979.10482663.
Reinhart, Alex, Ben Markey, Michael Laudenbach, Kachatad Pantusen,
Ronald Yurko, Gordon Weinberg, and David West Brown. 2025. “Do
LLMs Write Like Humans? Variation in
Grammatical and Rhetorical Styles.” Proceedings of the
National Academy of Sciences 122 (8): e2422455122. https://doi.org/10.1073/pnas.2422455122.
Rosén, Bengt. 1964. “Limit Theorems for Sampling from Finite
Populations.” Arkiv För Matematik 5: 383–424. https://doi.org/10.1007/BF02591138.
Rubin, Donald B. 2008. “Comment: The Design and Analysis of Gold
Standard Randomized Experiments.” Journal of the American
Statistical Association 103 (484): 1350–53. https://doi.org/10.1198/016214508000001011.
Scheffé, Henry. 1953. “A Method for Judging All Contrasts in the
Analysis of Variance.” Biometrika 40: 87–104. https://doi.org/10.2307/2333100.
Semitala, Fred C., Jillian L. Kadota, Allan Musinguzi, Fred Welishe,
Anne Nakitende, Lydia Akello, Lynn Kunihira Tinka, et al. 2024.
“Comparison of 3 Optimized Delivery Strategies for Completion of
Isoniazid-Rifapentine (3HP) for Tuberculosis Prevention
Among People Living with HIV in Uganda: A
Single-Center Randomized Trial.” PLOS Medicine 21 (2):
e1004356. https://doi.org/10.1371/journal.pmed.1004356.
Siegmund, David. 1985. Sequential Analysis: Tests and Confidence
Intervals. Springer. https://doi.org/10.1007/978-1-4757-1862-1.
Wassmer, Gernot, and Werner Brannath. 2016. Group Sequential and
Confirmatory Adaptive Designs in Clinical Trials. Springer. https://doi.org/10.1007/978-3-319-32562-0.
Wu, C. F. Jeff, and Michael Hamada. 2021. Experiments: Planning,
Analysis, and Optimization. John Wiley & Sons. https://doi.org/10.1002/9781119470007.