References

Aalen, Odd O., Ørnulf Borgan, and Håkon K. Gjessing. 2008. Survival and Event History Analysis: A Process Point of View. Springer. https://doi.org/10.1007/978-0-387-68560-1.
Agresti, Alan. 2015. Foundations of Linear and Generalized Linear Models. Wiley.
Arega, Balew, Gashaw Solela, Elias Tewabe, Asnake Agunie, Amanuel Zeleke, Ermiyas Tefera, Abraham Minda, and Yitagesu Getachew. 2024. “Weekends Admitted Adult Medical Patients Have Higher in-Hospital Mortality in Ethiopia: An Implication for Quality Improvement.” PLOS ONE 19 (10). https://doi.org/10.1371/journal.pone.0312538.
Bates, Douglas, Martin Mächler, Ben Bolker, and Steve Walker. 2015. “Fitting Linear Mixed-Effects Models Using lme4.” Journal of Statistical Software 67 (1): 1–48. https://doi.org/10.18637/jss.v067.i01.
Bates, Stephen, Trevor Hastie, and Robert Tibshirani. 2024. “Cross-Validation: What Does It Estimate and How Well Does It Do It?” Journal of the American Statistical Association 119 (546): 1434–45. https://doi.org/10.1080/01621459.2023.2197686.
Belenky, Gregory, Nancy J. Wesensten, David R. Thorne, Maria L. Thomas, Helen C. Sing, Daniel P. Redmond, Michael B. Russo, and Thomas J. Balkin. 2003. “Patterns of Performance Degradation and Restoration During Sleep Restriction and Subsequent Recovery: A Sleep Dose-Response Study.” Journal of Sleep Research 12 (1): 1–12. https://doi.org/10.1046/j.1365-2869.2003.00337.x.
Belkin, Mikhail, Daniel Hsu, Siyuan Ma, and Soumik Mandal. 2019. “Reconciling Modern Machine-Learning Practice and the Classical Bias–Variance Trade-Off.” Proceedings of the National Academy of Sciences 116 (32): 15849–54. https://doi.org/10.1073/pnas.1903070116.
Bengio, Yoshua, and Yves Grandvalet. 2004. “No Unbiased Estimator of the Variance of K-Fold Cross-Validation.” Journal of Machine Learning Research 5: 1089–1105. https://jmlr.csail.mit.edu/papers/v5/grandvalet04a.html.
Berkson, Joseph. 1946. “Limitations of the Application of Fourfold Table Analysis to Hospital Data.” Biometrics Bulletin 2 (3): 47–53. https://doi.org/10.2307/3002000.
Blum, Avrim, Adam Kalai, and John Langford. 1999. “Beating the Hold-Out: Bounds for K-Fold and Progressive Cross-Validation.” In Proceedings of the Twelfth Annual Conference on Computational Learning Theory, 203–8. https://doi.org/10.1145/307400.307439.
Bolles, Robert C. 1962. “The Difference Between Statistical Hypotheses and Scientific Hypotheses.” Psychological Reports 11 (3): 639–45. https://doi.org/10.2466/pr0.1962.11.3.639.
Boyd, Stephen, and Lieven Vandenberghe. 2004. Convex Optimization. Cambridge University Press.
Brandt, Allan M. 2007. The Cigarette Century. Basic Books.
Bringhurst, Robert. 2012. The Elements of Typographic Style. v4.0 ed. Hartley and Marks.
Buja, Andreas, Dianne Cook, Heike Hofmann, Michael Lawrence, Eun-Kyung Lee, Deborah F. Swayne, and Hadley Wickham. 2009. “Statistical Inference for Exploratory Data Analysis and Model Diagnostics.” Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 367 (1906): 4361–83. https://doi.org/10.1098/rsta.2009.0120.
Button, Katherine S., John P. A. Ioannidis, Claire Mokrysz, Brian A. Nosek, Jonathan Flint, Emma S. J. Robinson, and Marcus R. Munafò. 2013. “Power Failure: Why Small Sample Size Undermines the Reliability of Neuroscience.” Nature Reviews Neuroscience 14: 365–76. https://doi.org/10.1038/nrn3475.
Buuren, Stef van, and Karin Groothuis-Oudshoorn. 2011. mice: Multivariate Imputation by Chained Equations in R.” Journal of Statistical Software 45 (3). https://doi.org/10.18637/jss.v045.i03.
Casella, George, and Roger L. Berger. 2002. Statistical Inference. 2nd ed. Duxbury.
Charig, C. R., D. R. Webb, S. R. Payne, and J. E. A. Wickham. 1986. “Comparison of Treatment of Renal Calculi by Open Surgery, Percutaneous Nephrolithotomy, and Extracorporeal Shockwave Lithotripsy.” BMJ 292: 879–82. https://doi.org/10.1136/bmj.292.6524.879.
Christensen, Ronald. 2011. Plane Answers to Complex Questions. 4th ed. Springer.
Cook, R Dennis. 1993. “Exploring Partial Residual Plots.” Technometrics 35 (4): 351–62. https://doi.org/10.1080/00401706.1993.10485350.
Cook, R Dennis, and Rodney Croos-Dabrera. 1998. “Partial Residual Plots in Generalized Linear Models.” Journal of the American Statistical Association 93 (442): 730–39. https://doi.org/10.1080/01621459.1998.10473725.
Crowder, Martin J. 1978. “Beta-Binomial Anova for Proportions.” Applied Statistics 27 (1): 34–37. https://doi.org/10.2307/2346223.
Daniel, Rhian M., Michael G. Kenward, Simon N. Cousens, and Bianca L. De Stavola. 2012. “Using Causal Diagrams to Guide Analysis in Missing Data Problems.” Statistical Methods in Medical Research 21 (3): 244–56. https://doi.org/10.1177/0962280210394469.
Davison, A. C., and D. V. Hinkley. 1997. Bootstrap Methods and Their Application. Cambridge University Press.
De Neve, Jan, and Thomas A. Gerds. 2020. “On the Interpretation of the Hazard Ratio in Cox Regression.” Biometrical Journal 62 (3): 742–50. https://doi.org/10.1002/bimj.201800255.
Doll, Richard, and A. Bradford Hill. 1954. “The Mortality of Doctors in Relation to Their Smoking Habits.” British Medical Journal 1 (4877): 1451–55. https://doi.org/10.1136/bmj.1.4877.1451.
Dunn, Peter K., and Gordon K. Smyth. 1996. “Randomized Quantile Residuals.” Journal of Computational and Graphical Statistics 5 (3): 236–44. https://doi.org/10.2307/1390802.
Eronen, Markus I, and Laura F Bringmann. 2021. “The Theory Crisis in Psychology: How to Move Forward.” Perspectives on Psychological Science 16 (4): 779–88. https://doi.org/10.1177/1745691620970586.
Fox, John, and Sanford Weisberg. 2018. “Visualizing Fit and Lack of Fit in Complex Regression Models with Predictor Effect Plots and Partial Residuals.” Journal of Statistical Software 87 (9). https://doi.org/10.18637/jss.v087.i09.
Friedman, Jerome H., Trevor Hastie, and Rob Tibshirani. 2010. “Regularization Paths for Generalized Linear Models via Coordinate Descent.” Journal of Statistical Software 33 (1). https://doi.org/10.18637/jss.v033.i01.
Garner, Bryan. 2022. Garner’s Modern English Usage. 5th ed. Oxford University Press.
Gelman, Andrew, Jennifer Hill, and Aki Vehtari. 2021. Regression and Other Stories. Cambridge University Press. https://avehtari.github.io/ROS-Examples/.
Gelman, Andrew, and Eric Loken. 2014. “The Statistical Crisis in Science.” The American Scientist 102 (6): 460–65. https://doi.org/10.1511/2014.111.460.
Gentle, James E. 2017. Matrix Algebra: Theory, Computations, and Applications in Statistics. 2nd ed. Springer. https://doi.org/10.1007/978-3-319-64867-5.
Gomila, R. 2021. “Logistic or Linear? Estimating Causal Effects of Experimental Treatments on Binary Outcomes Using Regression Analysis.” Journal of Experimental Psychology: General 150 (4): 700–709. https://doi.org/10.1037/xge0000920.
Gopen, George, and Judith Swan. 1990. “The Science of Scientific Writing.” American Scientist 78 (6): 550–58. https://www.jstor.org/stable/29774235.
Gorman, J. W., and R. J. Toman. 1966. “Selection of Variables for Fitting Equations to Data.” Technometrics 8 (1): 27–51. https://doi.org/10.2307/1266260.
Gotelli, Nicholas J., and Aaron M. Ellison. 2002. “Biogeography at a Regional Scale: Determinants of Ant Species Density in New England Bogs and Forests.” Ecology 83 (6): 1604–9. https://doi.org/10.1890/0012-9658(2002)083[1604:BAARSD]2.0.CO;2.
Harrell, Frank E., Robert M. Califf, David B. Pryor, Kerry L. Lee, and Robert A. Rosati. 1982. “Evaluating the Yield of Medical Tests.” Journal of the American Medical Association 247 (18): 2543–46. https://doi.org/10.1001/jama.1982.03320430047030 .
Harrison, Stephanie L., Elnara Fazio-Eynullayeva, Deirdre A. Lane, Paula Underhill, and Gregory Y. H. Lip. 2024. “Comorbidities Associated with Mortality in 31,461 Adults with COVID-19 in the United States: A Federated Electronic Medical Record Analysis.” PLOS Medicine 17 (9). https://doi.org/10.1371/journal.pmed.1003321.
Harville, David A. 1997. Matrix Algebra from a Statistician’s Perspective. Springer. https://doi.org/10.1007/b98818.
Hastie, Trevor, and Clive Loader. 1993. “Local Regression: Automatic Kernel Carpentry.” Statistical Science 8 (2): 120–43. https://doi.org/10.1214/ss/1177011002.
Hastie, Trevor, Andrea Montanari, Saharon Rosset, and Ryan J. Tibshirani. 2022. “Surprises in High-Dimensional Ridgeless Least Squares Interpolation.” Annals of Statistics 50 (2): 949–86. https://doi.org/10.1214/21-AOS2133.
Hastie, Trevor, Robert Tibshirani, and Jerome Friedman. 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd ed. Springer. https://hastie.su.domains/ElemStatLearn/.
Hastie, Trevor, Robert Tibshirani, and Martin Wainright. 2015. Statistical Learning with Sparsity: The Lasso and Generalizations. CRC Press. https://hastie.su.domains/StatLearnSparsity/.
Helgestad, Mette Bach AND Njor, Anne Dorte Lerche AND Larsen. 2024. “Increasing Coverage in Cervical and Colorectal Cancer Screening by Leveraging Attendance at Breast Cancer Screening: A Cluster-Randomised, Crossover Trial.” PLOS Medicine 21 (August). https://doi.org/10.1371/journal.pmed.1004431.
Hernán, Miguel A., and James M. Robins. 2020. Causal Inference: What If. Chapman & Hall/CRC. https://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/.
Hosmer, David W., Stanley Lemeshow, and Rodney X. Sturdivant. 2013. Applied Logistic Regression. 3rd ed. Wiley.
Humpherys, Jeffrey, and Tyler J. Jarvis. 2020. Foundations of Applied Mathematics: Algorithms, Approximation, Optimization. Vol. 2. Society for Industrial and Applied Mathematics.
Humpherys, Jeffrey, Tyler J. Jarvis, and Emily J. Evans. 2017. Foundations of Applied Mathematics: Mathematical Analysis. Vol. 1. Society for Industrial and Applied Mathematics.
Ioannidis, John P. A. 2008. “Why Most Discovered True Associations Are Inflated.” Epidemiology 19 (5): 640–48. https://doi.org/10.1097/EDE.0b013e31818131e7.
Julious, Steven A., and Mark A. Mullee. 1994. “Confounding and Simpson’s Paradox.” BMJ 309: 1480. https://doi.org/10.1136/bmj.309.6967.1480.
Kanstrup, Marie, Laura Singh, Elisabeth Johanna Leehr, Katarina E. Göransson, Sara Ahmed Pihlgren, Lalitha Iyadurai, Oili Dahl, et al. 2024. “A Guided Single Session Intervention to Reduce Intrusive Memories of Work-Related Trauma: A Randomised Controlled Trial with Healthcare Workers in the COVID-19 Pandemic.” BMC Medicine 22 (1): 403. https://doi.org/10.1186/s12916-024-03569-8.
Kirk, David S. 2009. “A Natural Experiment on Residential Change and Recidivism: Lessons from Hurricane Katrina.” American Sociological Review 74 (3): 484–505. https://doi.org/10.1177/000312240907400308.
Kuchibhotla, Arun K., John E. Kolassa, and Todd A. Kuffner. 2022. “Post-Selection Inference.” Annual Review of Statistics and Its Application 9: 505–27. https://doi.org/10.1146/annurev-statistics-100421-044639.
Landwehr, James M., Daryl Pregibon, and Anne C. Shoemaker. 1984. “Graphical Methods for Assessing Logistic Regression Models.” Journal of the American Statistical Association 79 (385): 61–71. https://doi.org/10.1080/01621459.1984.10477062.
Lei, Jing. 2020. “Cross-Validation with Confidence.” Journal of the American Statistical Association 115 (532): 1978–97. https://doi.org/10.1080/01621459.2019.1672556.
Li, Qi, and Jeff Racine. 2003. “Nonparametric Estimation of Distributions with Categorical and Continuous Data.” Journal of Multivariate Analysis 86 (2): 266–92. https://doi.org/10.1016/S0047-259X(02)00025-8.
Lin, Kevin Z., Yixuan Qiu, and Kathryn Roeder. 2024. eSVD-DE: Cohort-Wide Differential Expression in Single-Cell RNA-Seq Data Using Exponential-Family Embeddings.” BMC Bioinformatics 25 (113). https://doi.org/10.1186/s12859-024-05724-7.
Long, J. Scott, and Laurie H. Ervin. 2000. “Using Heteroscedasticity Consistent Standard Errors in the Linear Regression Model.” The American Statistician 54 (3): 217–24. https://doi.org/10.1080/00031305.2000.10474549.
Longcore, Travis, Hannah L. Aldern, John F. Eggers, Steve Flores, Lesly Franco, Eric Hirshfield-Yamanishi, Laina N. Petrinec, Wilson A. Yan, and André M. Barroso. 2015. “Tuning the White Light Spectrum of Light Emitting Diode Lamps to Reduce Attraction of Nocturnal Arthropods.” Philosophical Transactions of the Royal Society 370. https://doi.org/10.1098/rstb.2014.0125.
Loy, Adam. 2021. “Bringing Visual Inference to the Classroom.” Journal of Statistics and Data Science Education 29 (2): 171–82. https://doi.org/10.1080/26939169.2021.1920866.
MacKinnon, James G., and Halbert White. 1985. “Some Heteroskedasticity-Consistent Covariance Matrix Estimators with Improved Finite Sample Properties.” Journal of Econometrics 29 (3): 305–25. https://doi.org/10.1016/0304-4076(85)90158-7.
McCarthy, Daniel, Kai Zhang, Lawrence D. Brown, Richard Berk, Andreas Buja, Edward I. George, and Linda Zhao. 2018. “Calibrated Percentile Double Bootstrap for Robust Linear Regression Inference.” Statistica Sinica 28: 2565–89. https://doi.org/10.5705/ss.202016.0546.
Meehl, Paul E. 1990. “Why Summaries of Research on Psychological Theories Are Often Uninterpretable.” Psychological Reports 66 (1): 195–244. https://doi.org/10.2466/pr0.1990.66.1.195.
Miller, Jane E. 2013. The Chicago Guide to Writing about Multivariate Analysis. 2nd ed. University of Chicago Press.
Mohan, Karthika, and Judea Pearl. 2021. “Graphical Models for Processing Missing Data.” Journal of the American Statistical Association 116 (534): 1023–37. https://doi.org/10.1080/01621459.2021.1874961.
Moher, David, Corinne S. Dulberg, and George A. Wells. 1994. “Statistical Power, Sample Size, and Their Reporting in Randomized Controlled Trials.” JAMA 272 (2): 122–24. https://doi.org/10.1001/jama.1994.03520020048013.
Monin, B., P. J. Sawyer, and M. J. Marquez. 2008. “The Rejection of Moral Rebels: Resenting Those Who Do the Right Thing.” Journal of Personality and Social Psychology 95 (1): 76–93. https://doi.org/10.1037/0022-3514.95.1.76.
Muthukrishna, Michael, and Joseph Henrich. 2019. “A Problem in Theory.” Nature Human Behaviour 3 (February): 229–29. https://doi.org/10.1038/s41562-018-0522-1.
Nelder, John A. 1998. “The Selection of Terms in Response-Surface Models—How Strong Is the Weak-Heredity Principle?” The American Statistician 52 (4): 315–218. https://doi.org/10.1080/00031305.1998.10480588.
Nolan, Deborah, and Sara Stoudt. 2021. Communicating with Data. Oxford University Press.
Pearl, Judea, Madelyn Glymour, and Nicholas P. Jewell. 2016. Causal Inference in Statistics: A Primer. Wiley.
Peixoto, Julio L. 1990. “A Property of Well-Formulated Polynomial Regression Models.” The American Statistician 44 (1): 26–30. https://doi.org/10.1080/00031305.1990.10475687.
Perneger, Thomas V. 1998. “Smoking and Risk of Myocardial Infarction: Statistical and Biological Interactions Should Not Be Confused.” BMJ 317: 1017. https://doi.org/10.1136/bmj.317.7164.1017a.
Platt, John R. 1964. “Strong Inference.” Science 146 (3642): 347–53. https://doi.org/10.1126/science.146.3642.347.
Prescott, Eva, Merete Hippe, Peter Schnohr, Hans Ole Hein, and Jørgen Vestbo. 1998. “Smoking and Risk of Myocardial Infarction in Women and Men: Longitudinal Population Study.” BMJ 316: 1043–47. https://doi.org/10.1136/bmj.316.7137.1043.
Ramsey, Fred L., and Daniel W. Schafer. 2013. The Statistical Sleuth. 3rd ed. Brooks/Cole.
Reinhart, Alex. 2015. Statistics Done Wrong. No Starch Press. https://www.statisticsdonewrong.com/.
Rodell, Fred. 1936. “Goodbye to Law Reviews.” Virginia Law Review 43: 38–45. https://www.refsmmat.com/files/goodbye.pdf.
Rohrer, Julia M., and Ruben C. Arslan. 2021. “Precise Answers to Vague Questions: Issues with Interactions.” Advances in Methods and Practices in Psychological Science 4 (2). https://doi.org/10.1177/2515245921100736.
Rosenbaum, Paul R. 2020. Design of Observational Studies. 2nd ed. Springer. https://doi.org/10.1007/978-3-030-46405-9.
Rosenbusch, Hannes, Anthony M. Evans, and Marcel Zeelenberg. 2022. “The Relative Importance of Joke and Audience Characteristics in Eliciting Amusement.” Psychological Science 33 (9). https://doi.org/10.1177/09567976221098595.
Salomon, Joshua A., Alex Reinhart, Alyssa Bilinski, Eu Jing Chua, Wichada La Motte-Kerr, Minttu M. Rönn, Marissa B. Reitsma, et al. 2021. “The US COVID-19 Trends and Impact Survey: Continuous Real-Time Measurement of COVID-19 Symptoms, Risks, Protective Behaviors, Testing, and Vaccination.” Proceedings of the National Academy of Sciences 118 (51): e2111454118. https://doi.org/10.1073/pnas.2111454118.
Schervish, Mark J. 1995. Theory of Statistics. Springer.
Seber, George A. F., and Alan J. Lee. 2003. Linear Regression Analysis. 2nd ed. Wiley. https://doi.org/10.1002/9780471722199.
Sedlmeier, P., and G. Gigerenzer. 1989. “Do Studies of Statistical Power Have an Effect on the Power of Studies?” Psychological Bulletin 105 (2): 309–16. https://doi.org/10.1037/0033-2909.105.2.309.
Shalizi, Cosma Rohilla. 2024a. “Advanced Data Analysis from an Elementary Point of View.” https://www.stat.cmu.edu/~cshalizi/ADAfaEPoV/.
———. 2024b. “The Truth about Linear Regression.” https://www.stat.cmu.edu/~cshalizi/TALR/.
Silberzahn, R., E. L. Uhlmann, D. P. Martin, P. Anselmi, F. Aust, E. Awtrey, Š. Bahník, et al. 2018. “Many Analysts, One Data Set: Making Transparent How Variations in Analytic Choices Affect Results.” Advances in Methods and Practices in Psychological Science 1 (3): 337–56. https://doi.org/10.1177/2515245917747646.
Smith, Gordon C. S., and Jill P. Pell. 2003. “Parachute Use to Prevent Death and Major Trauma Related to Gravitational Challenge: Systematic Review of Randomised Controlled Trials.” BMJ 327 (7429): 1459–61. https://doi.org/10.1136/bmj.327.7429.1459.
Smith, Jack W., J. E. Everhart, W. C. Dickson, W. C. Knowler, and R. S. Johannes. 1988. “Using the ADAP Learning Algorithm to Forecast the Onset of Diabetes Mellitus.” In Proceedings of the Symposium on Computer Applications in Medical Care, 261–65.
Sollaci, Luciana B., and Mauricio G. Pereira. 2004. “The Introduction, Methods, Results, and Discussion (IMRAD) Structure: A Fifty-Year Survey.” Journal of the Medical Library Association 92 (3): 364–71.
Tang, Jin-Ling, and James A. Dickinson. 1998. “Smoking and Risk of Myocardial Infarction: Studying Relative Risk Is Not Enough.” BMJ 317: 1018. https://doi.org/10.1136/bmj.317.7164.1017a.
Therneau, Terry M., Patricia M. Grambsch, and Thomas R. Fleming. 1990. “Martingale-Based Residuals for Survival Models.” Biometrika 77 (1): 147–60. https://doi.org/10.1093/biomet/77.1.147.
Väisänen, Risto A., and Olli Järvinen. 1977. “Dynamics of Protected Bird Communities in a Finnish Archipelago.” Journal of Animal Ecology 46 (3): 891–908. https://www.jstor.org/stable/3648.
VanderWeele, Tyler J., and Ilya Shpitser. 2011. “A New Criterion for Confounder Selection.” Biometrics 67 (4): 1406–13. https://doi.org/10.1111/j.1541-0420.2011.01619.x.
Wasserman, Larry. 1999. “Estimation of the Causal Effect of a Time-Varying Exposure on the Marginal Mean of a Repeated Binary Outcome: Comment.” Journal of the American Statistical Association 94 (447). https://doi.org/10.1080/01621459.1999.10474171.
Weisberg, Sanford. 2014. Applied Linear Regression. 4th ed. Wiley.
Westreich, Daniel, and Sander Greenland. 2013. “The Table 2 Fallacy: Presenting and Interpreting Confounder and Modifier Coefficients.” American Journal of Epidemiology 177 (4): 292–98. https://doi.org/10.1093/aje/kws412.
Wewe, Jan. 1980. “Violations Force Indiana Airways to Reapply for FAA Authorization.” The Purdue Exponent 96 (54): 1.
White, Halbert. 1980. “A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity.” Econometrica 48 (4): 817–38. https://doi.org/10.2307/1912934.
Whitney, P. J. 1978. “Broomrape (Orobanche) Seed Germination Inhibitors from Plant Roots.” Annals of Applied Biology 89 (3): 475–78. https://doi.org/10.1111/j.1744-7348.1978.tb05976.x.
Wickham, Hadley. 2019. Advanced R. 2nd ed. CRC Press. https://adv-r.hadley.nz/.
Wickham, Hadley, Mine Çetinkaya-Rundel, and Garrett Grolemund. 2023. R for Data Science. 2nd ed. O’Reilly. https://r4ds.hadley.nz/.
Williams, D. A. 1987. “Generalized Linear Model Diagnostics Using the Deviance and Single Case Deletions.” Applied Statistics 36 (2): 181–91. https://doi.org/10.2307/2347550.
Woll, Penella J. 1998. “Smoking and Risk of Myocardial Infarction: Smoking Is a Feminist Issue.” BMJ 317: 1018. https://doi.org/10.1136/bmj.317.7164.1017a.
Wood, Simon N. 2017. Generalized Additive Models: An Introduction with R. 2nd ed. CRC Press.
Zhao, Peng, and Bin Yu. 2006. “On Model Selection Consistency of Lasso.” Journal of Machine Learning Research 7 (90): 2541–63. https://www.jmlr.org/papers/v7/zhao06a.html.