Integrated Genomic Characterization of Papillary Thyroid Carcinoma
PMID: 25417114 · DOI: 10.1016/j.cell.2014.09.050 · Journal: Cell (2014)
TL;DR
The TCGA Research Network performed multiplatform genomic characterization of 496 papillary thyroid carcinomas (PTCs), the largest such cohort to date. They confirmed the dominance and mutual exclusivity of MAPK-pathway drivers (BRAF V600E, RAS mutations, RET/NTRK/BRAF/ALK fusions), discovered new PTC drivers (EIF1AX, PPM1D, CHEK2 and diverse novel gene fusions), and reduced the fraction of PTC cases without an identified oncogenic driver from ~25% to 3.5%. Combined SNV/CNV/methylation/mRNA/miRNA/RPPA analyses revealed that BRAFV600E-like (BVL) and RAS-like (RL) tumors are fundamentally different in genomic, epigenomic and proteomic profiles, with distinct signaling outputs and differentiation states, supporting a proposed molecular reclassification of thyroid cancer.
Cohort & data
- 496 primary PTCs plus 8 metastatic tumors from non-irradiated patients, accrued through TCGA. Histology breakdown: 324 (69.4%) classical-type (CT), 99 (21.2%) follicular-variant (FV), 35 (7.5%) tall cell variant (TCV), 9 (2.0%) uncommon variants, 29 without histological annotation.
- Aggressive thyroid cancers (poorly differentiated and undifferentiated/anaplastic) were excluded; cohort focuses on [[../cancer_types/THPA.md|papillary thyroid carcinoma (THPA)]].
- Dataset: thca_tcga_pub.
- 402 of 496 tumor/normal pairs had [[../methods/whole-exome-seq.md|whole-exome sequencing]] (mean tumor depth 97.0×, normal 94.9×); 390 tumors were profiled on all major platforms.
- Platforms (Table S1B): exome and whole genome DNA-seq, [[../methods/rna-seq.md|RNA-seq]], [[../methods/mirna-seq.md|miRNA-seq]], [[../methods/affymetrix-snp6.md|SNP6 arrays]], [[../methods/450k-methylation-array.md|Illumina 450k methylation arrays]], and [[../methods/rppa.md|reverse phase protein arrays (RPPA)]].
- Analytical methods include [[../methods/mutsig.md|MutSig]] for SMG calling, [[../methods/gistic.md|GISTIC2]] for focal SCNA, [[../methods/memo.md|MEMo]] for mutual exclusivity, and [[../methods/absolute.md|ABSOLUTE]] for cancer cell fraction estimation; HotNet2 and NBS for network analyses; PathSeq and BioBloom Tools for viral detection.
Key findings
- Low somatic mutation density. Mean 0.41 non-synonymous mutations per Mb (Figures 1A, S1A; Table S3), low relative to most other carcinomas. Mutation density correlated with age (Pearson p=5.2×10⁻¹⁸), risk of recurrence (Kruskal-Wallis p=3.4×10⁻⁴), and MACIS score (Pearson p=4×10⁻¹⁵). After age correction, the association with risk of recurrence persisted (p=9.7×10⁻³); the MACIS association did not (p=0.19).
- APOBEC enrichment in hypermutators. The 10 tumors with highest mutation densities (>1/Mb) were enriched for APOBEC-associated mutations (Mann-Whitney p=4×10⁻⁵), similar to bladder cancer.
- Seven significantly mutated genes (q<0.1). BRAF, NRAS, HRAS, KRAS, EIF1AX, PPM1D, CHEK2 (Figure 1C, Table S2).
- MAPK driver mutual exclusivity. BRAF + NRAS + HRAS + KRAS mutations were mutually exclusive in 300/402 (74.6%) patients (Fisher’s exact p=1.1×10⁻⁵; MEMo-corrected p<0.01).
- BRAF mutations. 248/402 (61.7%) BRAF mutations, mostly V600E; BRAFV600E characterized CT and TCV histologies.
- RAS mutations. 52/402 (12.9%) SSNVs at codons 12/61 of RAS genes; RAS mutations characterized the follicular variant.
- EIF1AX is a novel PTC driver. 6/402 (1.5%) tumors carried EIF1AX mutations (q=5.3×10⁻⁸), near-mutually exclusive with MAPK drivers (Fisher’s exact p=0.013 vs. RAS/BRAF). The single overlapping case carried a clonal KRAS mutation (CCF 100%) with subclonal EIF1AX (CCF 76%) and BRAF (CCF 53%) mutations.
- DNA-repair SMGs. PPM1D and CHEK2 mutations co-occurred with MAPK-pathway drivers. An additional 8 DNA-repair-related mutations across 26 (6.5%) tumors were mutually exclusive (MEMo p=0.24), associated with higher mutation density (Mann-Whitney p=0.022, attenuated after age adjustment) and enriched among high-risk patients (Fisher’s exact p=0.018).
- TERT promoter mutations. 36/384 (9.4%) informative tumors — 27 (7.0%) C228T, 1 (0.3%) C228A, 8 (2.1%) C250T — present across all histological types. Associated with older age, higher MACIS scores (Kruskal-Wallis p=2.6×10⁻⁹ for age, p=1.3×10⁻¹¹ for MACIS), higher risk of recurrence (Fisher’s exact p=7×10⁻⁸), and lower thyroid differentiation score (TDS) (Kruskal-Wallis p=4.2×10⁻⁵). Associations persisted within BRAFV600E tumors.
- Driver clonality. Using [[../methods/absolute.md|ABSOLUTE]], cancer cell fractions (CCF) for BRAF, NRAS, HRAS, KRAS, and EIF1AX driver mutations were uniformly high — all driver mutations are largely clonal (Figure 2D). Targeted validation confirmed 264/265 (99.6%) driver-gene mutations and 49/54 (91%) random mutations (overall validation rate 96%).
- Gene fusions identified in 74/484 (15.3%) cases, mutually exclusive with BRAF/RAS/EIF1AX mutations (Fisher’s exact p=4.9×10⁻⁴³). Fusion-positive tumors associated with younger age (Wilcoxon rank-sum p=0.005) but not with recurrence risk (p=0.55):
- RET fusions: 33/484 (6.8%); 4 novel RET fusions identified that preserved the kinase domain.
- [[../genes/BRAF.md|BRAF]] fusions: 13/484 (2.7%) with diverse partners including SND1/BRAF (3 tumors) and MKRN1/BRAF.
- PAX8/PPARG fusions: 4/484 (0.8%).
- ETV6/NTRK3 and RBPMS/NTRK3 fusions: 6/484 (1.2%) combined.
- THADA fusions: 6/484 (1.2%).
- ALK fusions (incl. EML4/ALK): 4/484 (0.8%).
- FGFR2 fusions: 2 cases; non-recurrent MET and LTK fusions: 2 cases.
- Somatic copy number alterations. SCNAs identified in 135/495 (27.2%) informative tumors and enriched in driver-mutation-negative cases (Fisher’s exact p=4.4×10⁻⁴). Unsupervised clustering of arm-level events defined 4 classes: SCNA-quiet (72.9%), SCNA-22q-del (9.9%, NF2 and CHEK2 lost — 70 tumors with 22q loss, 5 with CHEK2 mutation, 4 cases overlapping, p=0.0035), SCNA-low-1q-amp (14.8%, enriched for TCV p<0.0001 and BRAF p<0.05, with higher MACIS p<0.0001), and SCNA-high (2.4%).
- ‘Dark matter’ reduction. Starting from 402 exome-sequenced cases: SSNVs explained 299 (73.6%); fusions raised this to 358 (89.0%); focal deletions (2 PTEN, 1 BRAF) to 361 (89.8%); arm-level SCNAs in 27 mostly-FV cases brought it to ~96.5%. Only 14 (3.5%) cases remained as ‘dark matter’, down from a historical ~25%. Including additional candidates (APC, ATM, NF1, SPOP, chromatin remodeling genes, potential fusions), putative drivers were identified in 397/402 (98.8%) PTCs.
- BRAFV600E-RAS Score (BRS). A 71-gene expression signature derived from BRAF- vs. RAS-mutant tumors yielded a continuous score (−1 BVL to +1 RL). Used to characterize less-common alterations: all non-V600E BRAF mutations (K601E, splice-site, indels) were RL; all BRAF fusions were BVL; 4/6 EIF1AX mutations were RL; PAX8/PPARG fusions were RL; nearly all RET fusions were weakly BVL; NTRK1/3 and ALK fusions were largely neutral.
- Thyroid Differentiation Score (TDS). 16-gene metric of thyroid metabolism/function genes. Strongly correlated with BRS (Spearman 0.78, p=3.1×10⁻⁸⁰). Within the BRAFV600E cohort, TDS varied widely and correlated with histological grade (Kruskal-Wallis p=4×10⁻⁶), risk (p=2×10⁻⁵), and MACIS (Spearman p=1.3×10⁻⁶), but only weakly with tumor purity (p=1.5×10⁻³).
- Differential signaling. BVL-PTCs (BRAFV600E-driven) had high MAPK / ERK activation (52-gene MEK-inhibition signature correlated with BRS) and increased DUSP gene expression. RL-PTCs had concurrent PI3K/AKT and MAPK activation (latter through c-RAF phosphorylation), and significantly higher phosphorylation of p90RSK, robust TSC2 inhibition (likely mTOR-inducing), BAD phosphorylation, and BCL2 over-expression (anti-apoptotic). TieDIE highlighted RHEB as a contributing factor.
- Molecular classification. Unsupervised clustering of mRNA, miRNA, methylation and protein data produced 5/6/4/4 subtypes respectively, all consistent with a BVL-vs-RL meta-cluster split. A ‘tall cell-like’ mRNA-cluster 5 (largely embedded in miR-cluster 6, Fisher’s exact p=1.3×10⁻³⁶) contained 74% of TCVs, had the highest BRAFV600E mutation frequency (78%), lowest BRS and TDS values, and was associated with more advanced stage and higher risk.
- OncomiRs and tumor suppressor miRs. miR-21 was the most negatively TDS-correlated miR (in both the whole cohort and the BRAFV600E subcohort). miR-146b expression was similarly associated with low BRS/TDS and DNA-methylation variation; miR-204 was lost in low-BRS, low-TDS BVL-PTC sub-clusters. miR-21 and miR-146b are oncomiRs; miR-204 acts as a tumor suppressor.
- Viral pathogens are unlikely contributors. PathSeq and BioBloom Tools identified only 2 HBV-positive and 1 HPV45-positive tumors above the per-million reads thresholds.
Genes & alterations
- [[../genes/BRAF.md|BRAF]] — V600E in 61.7% of tumors (n=248/402); 13/484 (2.7%) BRAF fusions including SND1/BRAF and MKRN1/BRAF. Mutually exclusive with RAS/EIF1AX/fusion-driven cases. BRAFV600E defines the BVL signaling/differentiation class; BRAF fusions also BVL but with diverse activation mechanisms.
- [[../genes/NRAS.md|NRAS]] / [[../genes/HRAS.md|HRAS]] / [[../genes/KRAS.md|KRAS]] — codon 12/61 SSNVs in 52/402 (12.9%) tumors; characterize the follicular variant; drive the RL phenotype with concurrent MAPK/PI3K signaling.
- [[../genes/EIF1AX.md|EIF1AX]] — novel PTC driver, 6/402 (1.5%), q=5.3×10⁻⁸; near-mutually exclusive with MAPK drivers.
- [[../genes/PPM1D.md|PPM1D]] / [[../genes/CHEK2.md|CHEK2]] — DNA-repair-related SMGs, co-occurring with MAPK drivers; CHEK2 mutations associated with 22q loss (p=0.0035).
- TERT — promoter mutations (C228T, C228A, C250T) in 9.4% (36/384) of informative tumors; associated with high-risk, less differentiated, BRAFV600E-positive aggressive PTCs (Kruskal-Wallis p=4.2×10⁻⁵ for TDS).
- [[../genes/RET.md|RET]] — fusions in 6.8% (33/484), most weakly BVL. Four novel kinase-domain-preserving RET fusion partners discovered.
- NTRK1 / [[../genes/NTRK3.md|NTRK3]] — fusions detected; ETV6/NTRK3 and RBPMS/NTRK3 in 1.2% (6/484); fusions were BRS-neutral.
- [[../genes/ALK.md|ALK]] — fusions in 0.8% (4/484), including EML4/ALK, potential targetable alteration.
- PAX8/PPARG — fusions in 0.8% (4/484); RL phenotype.
- PTEN — 2 focal deletions; lost in 5 tumors with 10q23.31 deletions. PI3K/PPARγ pathway mutations (PTEN, AKT1/2, PAX8/PPARG) in 4.5% (18/402).
- PIK3CA — mutations overlap with BRAFV600E in HotNet2 MAPK subnetwork.
- NF2 — within 22q loss region characteristic of SCNA-22q-del FV-enriched subtype.
- TP53 / RB1 / NF1 / [[../genes/NF2.md|NF2]] / MEN1 — tumor suppressor mutations in 15/402 (3.7%).
- APC / ATM / SPOP / KMT2A / ARID1B / KMT2C — additional candidate dark-matter drivers and chromatin-remodeling alterations; 93 mutations across 57 epigenetic regulator genes in 80/402 (20.0%) tumors.
- ZFHX3 (7/402, 1.7%, q=0.79) and BDP1 (5/402, 1.2%, q=0.58) — near-significant putative tumor suppressors.
Clinical implications
- TERT promoter genotyping is supported as a molecular biomarker for identifying high-risk patients within the BRAFV600E cohort, given its strong association with recurrence risk, MACIS score, age, and reduced thyroid differentiation.
- BRAFV600E PTC is heterogeneous. The authors argue that BRAFV600E PTC should not be treated as a homogeneous group in clinical studies; it comprises at least four molecular subtypes with varying differentiation. This may explain mixed reports on the prognostic and predictive value of BRAFV600E alone.
- Targeted therapy opportunities expanded. Confirmation that BRAF and other driver mutations are clonal supports BRAF-inhibitor use in PTC. Novel RET, BRAF (e.g. SND1/BRAF, MKRN1/BRAF) and ALK (including EML4/ALK) fusion partners suggest additional opportunities for kinase-inhibitor therapy.
- Differential signaling has therapeutic implications. BVL-PTCs signal preferentially through MAPK and may respond differently to MEK inhibitors than RL-PTCs, which co-activate MAPK and PI3K/AKT/mTOR. This is consistent with reports of genotype-dependent MEK-inhibitor response (Ho et al., 2013).
- Diagnostic implications. Reducing ‘dark matter’ from 25% to 3.5% expands the panel of driver alterations interrogated by FNA-based molecular testing, supporting more precise patient selection for thyroid surgery and informing the extent of initial surgery (lobectomy vs. total thyroidectomy).
- Proposed reclassification. Authors propose unifying follicular-variant PTC (the RL-PTC group) with follicular carcinomas under a molecular reclassification that better reflects underlying biology, potentially guiding precision-medicine approaches.
- Age as a continuous variable. Strong correlation of mutation density with age supports treating age as a continuous risk variable rather than dichotomizing at 45 years in staging systems.
Limitations & open questions
- The cohort excluded poorly differentiated and undifferentiated/anaplastic thyroid carcinomas, limiting inference about progression to aggressive disease — the authors explicitly note this trade-off.
- Cases of less-common PTC variants (n=9) and 29 without histological annotation reduce statistical power for variant-specific analyses.
- BRAF fusion PCR validation was inconclusive for 3/9 attempted fusions; some novel fusions remain validated only at the sequencing/RNA-seq level.
- The biological significance of the Meth-CpG-Island RL-PTC sub-cluster (associated with higher tumor purity / less lymphocyte infiltration) remains unclear.
- The 27 mostly-FV cases with arm-level SCNAs as putative drivers cannot be assigned a specific driving gene.
- Although the study reduces unexplained PTC cases to 3.5%, the contribution of the candidate dark-matter genes (APC, ATM, NF1, SPOP) requires independent validation.
- The role of PI3K pathway alterations co-occurring with BRAFV600E (e.g. PIK3CA) requires functional validation.
- The framework as outlined applies to PTC as currently defined; whether the BVL/RL meta-cluster framework extends to aggressive thyroid cancers requires separate analysis.
Citations from this paper used in the wiki
- “Here, we describe the genomic landscape of 496 PTCs… extended the set of known PTC driver alterations to include EIF1AX, PPM1D and CHEK2 and diverse gene fusions. These discoveries reduced the fraction of PTC cases with unknown oncogenic driver from 25% to 3.5%.” (Summary)
- “Whole exome DNA sequencing of 402 (of 496) tumor/normal pairs (average depth-of-coverage; 97.0× for tumors, 94.9× for normals) showed a low somatic mutation density (0.41 non-synonymous mutations per Mb, on average)” (p. 4)
- “The 248 (61.7%) BRAF mutations were mostly V600E substitution… Somatic single nucleotide variants (SSNVs) were identified in 52 patients (12.9%) within codons 12 and 61 of RAS genes” (p. 4)
- “We identified TERT promoter mutations in 36 (9.4%) of 384 informative tumors, with 27 (7.0%) C228T, 1 (0.3%) C228A and 8 (2.1%) C250T substitutions… (Kruskal-Wallis test p=2.6×10⁻⁹, and p=1.3×10⁻¹¹, Figure 2B, respectively), and high risk of recurrence (Fisher’s exact test p=7×10⁻⁸, Figure 2A)” (pp. 5-6)
- “We identified both known and novel fusions, including new partners of previously described fusions, in 74 (15.3%) of 484 informative cases… Fusions were mutually exclusive with each other and with BRAF, RAS and EIF1AX mutations (Fisher’s exact test p=4.9×10⁻⁴³)” (p. 6)
- “Including these events and considering arm-level SCNAs as drivers, we have identified putative cancer drivers in 397/402 PTCs (98.8%).” (p. 9)
- “We developed a BRAFV600E-RAS score (BRS) to quantify the extent to which the gene expression profile of a given tumor resembles either the BRAFV600E- or RAS-mutant profiles. Using 391 samples with both exome and RNA sequencing data, we compared BRAFV600E-mutated and RAS-mutated tumors to derive a 71-gene signature.” (p. 9)
- “The TDS and BRS measures were highly correlated across all tumors (Spearman = 0.78, P=3.1×10⁻⁸⁰)” (p. 10)
- “BRAFV600E PTC represents a diverse group of tumors, consisting of at least four molecular subtypes, with variable degrees of thyroid differentiation.” (Discussion, p. 14)
This page was processed by crosslinker on 2026-05-14.