Phylogenetic ctDNA analysis depicts early stage lung cancer evolution

Authors

Christopher Abbosh

Nicolai J. Birkbak

Gareth A. Wilson

Mariam Jamal-Hanjani

Tudor Constantin

Raheleh Salari

John Le Quesne

David A. Moore

Selvaraju Veeriah

Rachel Rosenthal

Teresa Marafioti

Eser Kirkizlar

Thomas B. K. Watkins

Nicholas McGranahan

Sophia Ward

Luke Martinson

Joan Riley

Francesco Fraioli

Maise Al Bakir

Eva Gronroos

Francisco Zambrana

Raymondo Endozo

Wenya Linda Bi

Fiona M. Fennessy

Nicole Sponer

Diana Johnson

Joanne Laycock

Seema Shafi

Justyna Czyzewska-Khan

Andrew Rowan

Tim Chambers

Nirupa Matthews

Samra Turajlic

Crispin Hiley

Siow Ming Lee

Martin D. Forster

Tanya Ahmad

Mary Falzon

Elaine Borg

David Lawrence

Martin Hayward

Shyam Kolvekar

Nikolaos Panagiotopoulos

Sam M. Janes

Ricky Thakrar

Asia Ahmed

Fiona Blackhall

Yvonne Summers

Rajesh Shah

Leena Joseph

Anne M. Quinn

Phil Crosbie

Babu Naidu

Gary Middleton

Gerald Langman

Simon Trotter

Marianne Nicolson

Hardy Remmen

Keith Kerr

Mahendran Chetty

Lesley Gomersall

Dean A. Fennell

Apostolos Nakas

Sridhar Rathinam

Girija Anand

Sajid Khan

Peter Russell

Veni Ezhil

Babikir Ismail

Melanie Irvin-Sellers

Vineet Prakash

Jason Lester

Malgorzata Kornaszewska

Richard Attanoos

Haydn Adams

Helen Davies

Stefan Dentro

Peter Van Loo

Allan Hackshaw

Jacqui Shaw

Yenting Ngai

Hugo J. W. L. Aerts

Jonas Demeulemeester

Aengus Stewart

Sergio A. Quezada

Karl Peggs

Hisham Bandulet

Nitzan Rosenfeld

Caroline Dive

Javier Herrero

Bernhard Kohli

Allyson Hugues

Helen Lowe

Yelena Nuzhdina

Charles Swanton

Doi

PMID: 28445469 · DOI: 10.1038/nature22364 · Journal: Nature (2017)

TL;DR

Abbosh et al. report the first ctDNA analysis from the TRACERx prospective lung cancer evolution study, profiling the first 100 surgically resected early-stage NSCLC patients with bespoke multiplex-PCR NGS panels (median 18 SNVs per patient) targeting clonal and subclonal variants identified by multi-region exome sequencing. Pre-operative ctDNA was detected in 48% (46/96) of tumors with a stark histology effect: 97% of LUSC versus 19% of LUAD. Post-operatively in a 24-patient longitudinal sub-cohort, ctDNA detected NSCLC relapse in 13/14 (93%) confirmed-relapse cases with a median lead-time of 70 days over imaging, identified adjuvant-chemotherapy resistance in real time, and resolved the subclonal architecture of metastatic seeding via phylogenetic tracking — including a targetable ERBB2 amplification subclone driving relapse in CRUK0004.

Cohort & data

  • Cohort: First 100 patients of the prospective TRACERx NSCLC study (NCT01888601); 96 evaluable for pre-operative ctDNA. Histologies include 58 LUAD, 31 LUSC, and 7 other NSCLC subtypes.
  • Longitudinal sub-cohort: 24 patients with pre- and post-surgical plasma profiling (14 confirmed relapse, 10 relapse-free with median follow-up 775 days).
  • Tumor genomics: 327 spatially separated primary tumor regions plus 4 metastatic biopsies underwent whole-exome sequencing through the TRACERx multi-region exome (M-Seq) pipeline; copy number / purity / ploidy via ASCAT; subclonal clusters via a modified PyClone; phylogenetic trees constructed with CITUP (0.1.0) plus manual reconstruction for relapse/autopsy cases (PMID:28445469).
  • ctDNA assay: Bespoke patient-specific multiplex-PCR NGS panels (Natera assay design pipeline; precursor to Signatera) — amplicon sequencing of clonal + subclonal SNVs, sequenced on Illumina HiSeq 2500 to ~40,000× target depth. Median 18 SNVs per panel pre-operatively; expanded to median 28 SNVs for LUAD panels in the longitudinal phase. Analytical sensitivity >99% at VAF ≥0.1%; specificity 99.6% per SNV.
  • Imaging: Pre-operative PET/CT reviewed for 92/96 patients; primary tumor volume measured on CT using 3D Slicer. Ki67 immunohistochemistry on TMA cores.
  • Post-mortem analysis: One patient (CRUK0063) co-recruited to the PEACE post-mortem study (NCT03004755); 6 metastatic regions sampled within 24 h of death.
  • Validation: Cross-platform validation in 28 patients using the Oncomine Lung cfDNA Assay (ThermoFisher).
  • Data deposition: EGA accessions EGAS00001002247 (primary) and EGAS00001002415 (metastatic).

Key findings

  • Pre-operative ctDNA detection rate is dominated by histology. ≥2 SNVs detected in 46/96 (48%) NSCLCs pre-operatively: 30/31 (97%) LUSC, 5/7 (71%) other NSCLC, 11/58 (19%) LUAD. Stage I subgroup: 16/17 (94%) LUSC vs 5/39 (13%) LUAD (PMID:28445469).
  • Independent predictors of ctDNA detection (multivariable logistic regression): non-adenocarcinoma histology, lymphovascular invasion, and high Ki67 proliferation index. PET FDG-avidity (TBR) predicted ctDNA detection by ROC AUC 0.84 (P<0.001, n=92).
  • Driver mutation status does not predict ctDNA in LUAD. Within the LUAD subgroup, KRAS, EGFR, and TP53 driver events were not associated with ctDNA detection (Extended Data Fig. 3e).
  • Tumor volume linearly predicts plasma clonal VAF. Tumor volume vs mean clonal VAF: Spearman’s ρ = 0.63, P<0.001, n=37; pathological tumor size ρ = 0.405, P=0.005, n=46. Linear model predicts a 10 cm³ primary tumor → mean clonal plasma VAF of 0.1% (95% CI 0.06–0.18%), corresponding to ~302 million malignant cells.
  • Implication for screening sensitivity: A 4 mm low-dose CT-detectable nodule (~0.034 cm³) would correspond to a predicted plasma VAF of 1.8 × 10⁻⁴ % (95% CI 9.8 × 10⁻⁶ to 0.0033%), at or beyond current ctDNA platform detection limits.
  • Subclonal SNV detection scales with subclone volume. Shared subclones detected in 35/57 (61%) vs private subclones in 26/80 (33%); subclone volume (CCF × tumor volume × purity) correlated with subclonal plasma VAF.
  • Post-operative ctDNA detects relapse with a median 70-day lead-time. ≥2 SNVs detected in 13/14 (93%) confirmed-relapse patients prior to or at clinical relapse vs 1/10 (10%) relapse-free patients (CRUK0013). Lead-time range 10–346 days; 4/13 cases had >6-month lead-times. In two cases ctDNA preceded inconclusive CT imaging by 157 (CRUK0004) and 163 days (CRUK0045).
  • ctDNA marks adjuvant chemotherapy resistance in real time. CRUK0080, CRUK0004, and CRUK0062 had detectable ctDNA within 30 days of surgery, with SNV counts increasing through adjuvant chemotherapy — all recurred within 1 year. Conversely, CRUK0013 had 20 SNVs detectable post-op, became undetectable after adjuvant chemoradiotherapy, and remained relapse-free at 688 days.
  • Phylogenetic ctDNA tracks the subclone(s) seeding metastasis. In CRUK0063, post-operative detection of an OR5D18 SNV traced back to a subclone private to primary region 3; metastatic biopsy confirmed this subclone gave rise to the para-vertebral metastasis (and contained an ARID1A stop-gain driver only retrospectively detectable at lower threshold in primary region 3).
  • Targetable relapse-driving subclones can be identified. CRUK0004 relapse was dominated by a subclone harboring an ERBB2 (HER2) amplification (>15 copies, triploid background) — a potentially actionable target in NSCLC.
  • Polyclonal vs monoclonal relapse can be distinguished. Relapses involving subclones from a single phylogenetic branch (CRUK0004, CRUK0063, CRUK0065, CRUK0044) vs multiple branches (CRUK0080, CRUK0062, CRUK0041) were resolvable from plasma.
  • Expanded 103-SNV panel detects relapse 189 days earlier. In the post-mortem case CRUK0063, the expanded panel detected ctDNA at day 151 post-surgery — 189 days before the 19-SNV panel.
  • Plasma VAF of subclonal clusters mirrors phylogenetic distance from the trunk. In CRUK0063, mean VAFs of clusters 11, 8, 12, 9, and 5 reflected their proximity to the clonal cluster on the M-Seq tree, supporting tiered-burden inference from plasma alone.
  • An ancestral subclone with KRAS amplification (>15 copies, triploid background) drove residual lymph-node disease in CRUK0013 that responded to adjuvant chemoradiotherapy.

Genes & alterations

  • KRAS — driver SNVs targeted in LUAD assay-panels; not associated with pre-operative ctDNA detection within LUADs. KRAS amplification (>15 copies) found as an ancestral subclonal event driving lymph-node residual disease in CRUK0013, which responded to adjuvant chemoradiotherapy (PMID:28445469).
  • EGFR — driver SNVs in LUAD; not associated with pre-operative ctDNA detection within the LUAD subset.
  • TP53 — driver SNVs in LUAD; not associated with ctDNA detection within LUADs.
  • ERBB2 — focal amplification (>15 copies, triploid background) on a relapse-dominant subclone identified in plasma for patient CRUK0004; flagged by the authors as a potentially targetable relapse driver in NSCLC.
  • ARID1A — stop-gain driver SNV in the para-vertebral metastasis of CRUK0063, originally below M-Seq calling threshold in primary tumor region 3 from which the metastatic subclone derived.

Clinical implications

  • Post-operative ctDNA as a residual-disease biomarker in early-stage NSCLC, with median 70-day lead-time over CT-confirmed relapse, supports adjuvant trials triggered by molecular-residual-disease detection rather than population-level treatment of all resected stage I–III patients (PMID:28445469).
  • Adjuvant chemotherapy resistance can be detected in plasma within 1–6 months of surgery — patients whose ctDNA fails to clear (or whose SNV burden rises) on adjuvant cisplatin/carboplatin plus vinorelbine recurred within 1 year, while ctDNA clearance after adjuvant therapy was associated with sustained remission (CRUK0013, 688 days relapse-free at publication).
  • Phylogenetic ctDNA tracking can identify targetable subclonal drivers of relapse, exemplified by an ERBB2-amplified subclone in CRUK0004, opening the prospect of relapse-directed (rather than primary-tumor-directed) targeted therapy.
  • Histology must inform ctDNA assay design and clinical interpretation. With LUAD detection rates of only 13–19% for stage I disease using clonal-SNV panels of ~18 markers, LUAD-targeted residual-disease platforms require either expanded panels (28+ SNVs as used in the longitudinal cohort) or orthogonal approaches.
  • Cost remains a constraint: the authors estimate ~$1,750 per patient for primary-tumor exome + bespoke panel synthesis + 5 plasma time points.

Limitations & open questions

  • LUAD sensitivity ceiling. Even the expanded LUAD panel (median 28 SNVs) may underdetect ctDNA-low adenocarcinomas; the volume-VAF model predicts that small (<1 cm³) LUADs lie below current ctDNA detection thresholds.
  • Cohort size for the longitudinal phase is small (n=24): the 93% relapse-detection sensitivity and 70-day lead-time estimates carry wide confidence intervals and require validation in the planned larger TRACERx 421-patient cohort.
  • Sampling bias for relapse tissue. Metastatic biopsy was only available for 4 patients plus the CRUK0063 PEACE autopsy; the inferred dominance of monoclonal vs polyclonal relapse may shift with broader sampling.
  • Subclone resolution depends on M-Seq density. Subclones present below the M-Seq detection threshold in the primary tumor (e.g., the ARID1A stop-gain in CRUK0063) can only be retrospectively validated; the platform cannot prospectively design assays against truly de-novo metastatic mutations.
  • No validation against orthogonal MRD platforms beyond a 28-patient cross-platform comparison with the Oncomine Lung cfDNA hotspot panel.
  • Type I error at scale. The 103-SNV expanded panel for CRUK0063 carried a 10.7% per-time-point risk of a single false-positive SNV call, motivating the ≥2-SNV positivity threshold but illustrating that panel expansion is not free.
  • Generalizability beyond surgically resected early-stage disease to unresectable, oligometastatic, or non-NSCLC settings remains untested in this report.

Citations from this paper used in the wiki

  • “At least two SNVs were detected in ctDNA pre-operatively in 46 of 96 (48%) early-stage NSCLCs… 97% (30/31) of lung squamous cell carcinomas (LUSCs)… compared with 19% (11/58) of lung adenocarcinomas (LUADs).” (Results, p. 2)
  • “Multivariable analysis revealed non-adenocarcinoma histology, the presence of lympho-vascular invasion and high Ki67 proliferation index as independent predictors of ctDNA detection.” (Results, p. 3)
  • “Tumor volume correlated with mean clonal plasma VAF (Spearman’s Rho = 0.63, P<0.001, n=37)… a primary tumor burden of 10cm3 would result in a mean clonal plasma VAF of 0.1%.” (Results, p. 3)
  • “At least 2 SNVs were detected in 13 of 14 (93%) patients with confirmed NSCLC relapse prior to, or at, clinical relapse… the median interval between ctDNA detection and NSCLC relapse confirmed on clinically indicated CT imaging (lead-time) was 70 days (range 10 to 346 days).” (Results, p. 4)
  • “The subclone implicated by ctDNA as driving the relapse in the case of CRUK0004 contained an ERRB2 (HER2) amplification event (>15 copies, triploid background), that may be targetable in NSCLC.” (Results, p. 5)
  • “This ancestral subclone contained a KRAS amplification (>15 copies, triploid background) and was identified as present in primary tumor and sampled lymph-nodes by M-Seq.” (Results, p. 6, CRUK0013)
  • “A limitation to targeted ctDNA profiling is cost, estimated at $1750 per patient for sequencing a single tumor region, synthesis of a patient-specific assay-panel and profiling of five plasma samples.” (Discussion, p. 7)

This page was processed by crosslinker on 2026-05-15.