Skip to content
National Cancer Institute National Cancer Institute U.S. National Institutes of Health
Division of Cancer Prevention logo
Home Site Map Contact DCP
Programs & Resources

Division of Cancer Prevention Staff

Stuart G. Baker, ScD

Mathematical Statistician
Biometry Research Group
Division of Cancer Prevention
National Cancer Institute
9609 Medical Center Drive, Room 5E638
Rockville, MD 20850
Phone (240) 276-7147
Fax (240) 276-7845
  • Dr. Stuart G. Baker ( develops and applies statistical methods to biological and clinical topics and also writes commentaries on theories of carcinogenesis. Examples include

    1. the concept of "paradigm instability," based on a buried treasure analogy, (Baker et. al. 2010, Baker 2013) for discussing somatic mutation theory versus the tissue organization field theory of carcinogenesis;
    2. the paired availability design for historical controls, which adjusts for different availabilities of treatment in different centers using a principal stratification model independently developed by Permutt and Hebel (1989), Baker and Lindeman (1994), Angrist, et al. (1996);
    3. relative utility curves for evaluating risk prediction via decision analysis (Baker et al, 2009, Baker et al. 2012);
    4. the BK-Plot for illustrating mixtures of probabilities, independently developed for Simpson’s paradox by Tan (1986), Jeon et. al. (1987), and Baker and Kramer (2001);
    5. Swirls and Ripples, a centroid-based analysis that can include “islands” of one class within another class, called Swirls, or the usual curved boundaries, called Ripples (Baker, 2010);
    6. a leave-one-out approach for evaluating surrogate endpoints (Baker et. al. 2012) and a graphical display of the sensitivity of the Prentice Criterion to sample size (Baker 2013);
    7. a biomarker pipeline to develop and evaluate cancer screening tests (Baker 2009);
    8. a latent-class model for the genetic analysis of twin data (Baker et. al. 2005), which represents a major departure from the traditional variance components model;
    9. a method to compare biologically relevant response curves in gene expression experiments with emphasis on heteromorphy, heterochrony, and heterometry (Baker 2014).

    Dr. Baker was the first recipient of the distinguished alum award from the Department of Biostatistics at the Harvard School of Public Health. He is also a fellow of the American Statistical Association

  • Mathematica

Back to top