Program Official

Principal Investigator

Awardee Organization

Yale University
United States

Fiscal Year
Activity Code
Early Stage Investigator Grants (ESI)
Not Applicable
Project End Date

Lung Cancer Screening via Ultrasensitive and Cost-efficient Analysis of Tumor DNA Signatures in Blood

Substantial improvements in cancer survival rates could be achieved by developing better tools to detect occult malignancies at an earlier, more curable stage. Unfortunately, efforts to identify serum protein biomarkers that are sufficiently cancer-specific to be used for screening have found little success. Here, we pursue an alternative strategy based on detection of exceptionally tumor-specific mutant DNA fragments in the circulation of patients with early-stage lung cancer. Because these tumor-derived DNA fragments harbor genetic signatures that would be uncommon in healthy individuals, they hold great promise for screening applications where a high frequency of false-positive results would be unacceptable. However, it is a formidable challenge to create an assay that is able to detect trace quantities of mutant DNA released into the bloodstream from a small, early-stage tumor, without knowing the tumor’s mutation profile beforehand. Additional challenges are posed by economic factors as well as the presence of low-level mutations in the healthy population. To address these challenges, we have assembled a multidisciplinary team with highly complementary expertise from Microsoft Research and from Yale, Rice, and Harvard Universities. In this proposal, we describe innovative solutions in which we apply tools of biochemistry, thermodynamics, machine learning, and biostatistics to develop and validate an ultrasensitive, cost-efficient assay for detecting rare mutant DNA fragments in blood as markers of early-stage lung cancer.


  • Wu LR, Dai P, Wang MX, Chen SX, Cohen EN, Jayachandran G, Zhang JX, Serrano AV, Xie NG, Ueno NT, Reuben JM, Barcenas CH, Zhang DY. Ensemble of nucleic acid absolute quantitation modules for copy number variation detection and RNA profiling. Nature communications. 2022 Apr 4;13(1):1791. PMID: 35379811
  • Zhang K, Pinto A, Cheng LY, Song P, Dai P, Wang M, Rodriguez L, Weller C, Zhang DY. Hairpin Structure Facilitates Multiplex High-Fidelity DNA Amplification in Real-Time Polymerase Chain Reaction. Analytical chemistry. 2022 Jul 12;94(27):9586-9594. Epub 2022 Jun 24. PMID: 35749270
  • Song P, Wu LR, Yan YH, Zhang JX, Chu T, Kwong LN, Patel AA, Zhang DY. Limitations and opportunities of technologies for the analysis of cell-free DNA in cancer diagnostics. Nature biomedical engineering. 2022 Mar;6(3):232-245. Epub 2022 Jan 31. PMID: 35102279
  • Song P, Chen SX, Yan YH, Pinto A, Cheng LY, Dai P, Patel AA, Zhang DY. Selective multiplexed enrichment for the detection and quantitation of low-fraction DNA variants via low-depth sequencing. Nature biomedical engineering. 2021 Jul;5(7):690-701. Epub 2021 May 3. PMID: 33941896
  • Zhang JX, Yordanov B, Gaunt A, Wang MX, Dai P, Chen YJ, Zhang K, Fang JZ, Dalchau N, Li J, Phillips A, Zhang DY. A deep learning model for predicting next-generation sequencing depth from DNA sequence. Nature communications. 2021 Jul 19;12(1):4387. PMID: 34282137
  • Dai P, Wu LR, Chen SX, Wang MX, Cheng LY, Zhang JX, Hao P, Yao W, Zarka J, Issa GC, Kwong L, Zhang DY. Calibration-free NGS quantitation of mutations below 0.01% VAF. Nature communications. 2021 Oct 21;12(1):6123. PMID: 34675197