Combined SNPs sequencing and allele specific proteomics capture reveal functional causality underpinning the 2p25 prostate cancer susceptibility locus
PMID: 28927585 · DOI: 10.1038/s41467-025-64005-w · Journal: Nature Communications (2025)
TL;DR
Dong et al. used their high-throughput SNPs-seq screen across 374 prostate-cancer GWAS risk loci to nominate rs4519489 in the 2p25 locus (intron of NOL10) as a functional SNP showing strong allele-specific protein binding. The risk A allele binds the transcription factor USF1 preferentially, upregulating NOL10. Across multiple PRAD cohorts (CPGEA, TCGA PRAD, MSKCC, FHCRC, SU2C, Grasso/U-Michigan, DKFZ, FUSCC), elevated NOL10 expression — and the rs4519489 A/A genotype — track with higher tumor stage, lymph-node metastasis, Gleason score, biochemical recurrence and shorter survival. NOL10 knockdown in DU145, 22Rv1, LNCaP and PC3 cells, in xenografts, and in Ptenpc-/-;Lkb1pc-/- murine organoids reduces proliferation, migration, EMT and tumor growth via cell-cycle (E2F / G2M) gene downregulation. The rs4519489–USF1–NOL10 axis is proposed as a prognostic biomarker and therapeutic target for prostate cancer (PMID:28927585).
Cohort & data
- Disease: Prostate adenocarcinoma (PRAD).
- Discovery / functional cohorts (genotype, eQTL, ChIP-seq, expression, survival):
- CPGEA — Chinese Prostate Cancer Genome and Epigenome Atlas (n≈134 tumors used in eQTL; 136 in CCS analysis; not in cBioPortal).
- prad_tcga — TCGA PRAD (n=497 tumors, 52 normals; 144 staged tumors; n=409 with lymph-node data; n=122 BCR; n=492 in BCR meta-analysis).
- prad_mskcc — Taylor 2010 MSKCC cohort (GSE21034, n=140 in BCR meta-analysis; n=150 in NOL10–EMT/AR signature correlation).
- prad_fhcrc — FHCRC cohort (NOL10 CCS / CCP correlation).
- prad_su2c_2019 — SU2C/PCF Dream Team metastatic CRPC (PNAS 2019, n=266 polyA samples).
- prad_mich — Grasso et al. University of Michigan cohort (GSE35988, used for NOL10/OS Kaplan-Meier and metastasis analysis; n=28 normal / 59 primary / 35 metastasis).
- DKFZ Heidelberg cohort (n=105 in BCR meta-analysis; not in cBioPortal as a registered prad_* study).
- FUSCC (Fudan University Shanghai Cancer Center) — RT-qPCR + Western blot validation in matched tumor/normal pairs.
- GSE62872 (n=160 normal / 264 tumor), GSE10645 (Chandran cohort, n=10 tumor / 21 metastasis), Stockholm CamCap (n=213), NPC, SMMU EU 2017 (n=65) cohorts for cross-cohort NOL10/USF1 expression and EMT/AR signature analysis.
- Cell models: LNCaP, DU145, 22Rv1, PC3, VCaP, RM1 (murine), 293T.
- In vivo / organoid models: Subcutaneous xenografts in male nude mice (DU145, PC3); prostate-cancer organoids derived from Ptenpc-/-;Lkb1pc-/- C57BL/6J mice (subcutaneous-xenograft, prostate-organoid-pten-lkb1).
- Assays / panels: SNPs-seq screen of 374 prostate-cancer risk loci; EMSA and super-shift EMSA; ChIP-seq for H3K27ac/H3K4me1/H3K4me3 in cell lines and CPGEA tissues; ChIP-qPCR / ChIP-AS-qPCR; allele-specific luciferase reporter assays; CRISPRi (dCas9 + sgRNAs targeting rs4519489 enhancer); CRISPR base editing (CBE in PC3 PC3 T/T→A/T; AYBE editor in DU145 A/A→T/T); LC-MS/MS proteomics (PWAS-inspired allele-specific TF capture); RNA-seq; RT-qPCR; Sanger sequencing; IHC (Ki67, E-cadherin, Vimentin); GSEA; IGV; Enhancer Element Locator (EEL) motif analysis; GWAS lead-SNP linkage analysis (PMID:28927585).
Key findings
- Functional SNP nomination at 2p25. SNPs-seq across 374 prostate-cancer risk loci flagged rs4519489 (intron of NOL10) as exhibiting biased allelic protein binding, with the A allele binding more strongly than the T allele. rs4519489 is in strong LD with the GWAS lead SNPs rs9287719 (R²=0.67, D’=0.82) and rs1990613 (R²=0.8, D’=0.98) (PMID:28927585).
- Allele-specific binding and enhancer activity. EMSA in 293T and LNCaP showed stronger A-allele binding (specifically displaced by consensus competitor); luciferase reporters in LNCaP, DU145, 22Rv1, PC3 and VCaP (±DHT) gave significantly higher activity for the A allele. Active enhancer marks (H3K27ac, H3K4me1, H3K4me3) were enriched at the rs4519489 locus in cell lines and in CPGEA tumor/normal prostate tissues; allele-specific ChIP-qPCR in heterozygous tissues showed greater H3K4me1/H3K27ac at the A allele (PMID:28927585).
- eQTL: A allele → higher NOL10 mRNA. In CPGEA the A allele significantly correlated with elevated NOL10 expression (P=0.00056, n=18 AA / 62 AT / 54 TT). sQTL analysis in TCGA PRAD and CPGEA showed no genotype-dependent NOL10 isoform usage (PMID:28927585).
- Direct enhancer–gene causality. CRISPRi against the rs4519489 region reduced NOL10 mRNA in 22Rv1 and PC3 (P=9.79E-5, 4.79E-4). CRISPR base-editing of T→A in PC3 (T/T→A/T) increased NOL10 expression (P=9.06E-3); reverse editing A→T in DU145 reduced NOL10 (PMID:28927585).
- NOL10 is overexpressed in prostate cancer and tracks with severity. NOL10 mRNA is elevated in tumor vs normal across CPGEA, TCGA PRAD, GSE62872 and FUSCC; protein up by Western blot. Higher NOL10 correlates with advanced T stage (TCGA PRAD, P=0.0058), lymph-node metastasis (P=0.045), Gleason score (P=0.015), biochemical recurrence (P=0.036), and shorter overall survival in the Grasso cohort (HR=2.20, 95% CI 1.03–4.69, P=0.034) (PMID:28927585).
- rs4519489 genotype is a prognostic factor. A/A patients in TCGA PRAD have higher BCR rates (P=0.0110) and shorter disease-free / progression-free survival (HR=2.40, P=0.019; HR=1.72, P=0.017). Multivariate Cox: rs4519489 (AA vs AT+TT) HR=13.05 (1.16–147.20), P=0.038 for OS (TCGA). Combined risk-A/A genotype + high NOL10 carries the worst OS / PFS (HR=2.79, P=0.015) (PMID:28927585).
- NOL10 is oncogenic in vitro and in vivo. shRNA / siRNA / sgRNA knockdown of NOL10 in DU145, LNCaP, 22Rv1 and PC3 reduced proliferation (CCK8), colony formation, migration and invasion. NOL10 overexpression in DU145 / 22Rv1 increased all four. Subcutaneous DU145 and PC3 xenografts in nude mice with NOL10-knockdown were significantly smaller (volume P=1.64E-2, 4.68E-3; weight P=3.22E-3, 3.38E-2) and showed lower Ki67, lower Vimentin and higher E-cadherin — consistent with reduced EMT. Ptenpc-/-;Lkb1pc-/- murine organoids showed fewer/smaller organoids on NOL10 knockdown (PMID:28927585).
- NOL10 drives cell-cycle programs. GSEA on TCGA PRAD ranked by NOL10 expression scored E2F targets and G2M checkpoint among the top hallmarks. RNA-seq of NOL10-knockdown LNCaP identified 71 downregulated genes enriched in cell cycle (DLGAP5, MCM4, KIF20B, DIAPH3, SUV39H1, CENPE, GINS2, HMGB3, CDC6 — RT-qPCR validated). A NOL10 cell-cycle signature (CCS) correlated with cell-cycle-progression (CCP) score across CPGEA, TCGA PRAD, SU2C, FHCRC, GSE62872. Higher CCS predicted shorter OS (HR=2.45, P=2.01e-5), RFS (HR=2.36, P=5.34e-5) and MFS (HR=5.77, P=0.0132) and was an independent prognostic factor by multivariate Cox in CPGEA (HR=2.86, 95% CI 1.29–6.34, P=0.010). Fixed-effects meta-analysis of BCR across CPGEA / MSKCC / TCGA / DKFZ: HR=2.49 (1.85–3.35), P=1.81e-9 (PMID:28927585).
- USF1 is the allele-specific TF. Allele-specific PWAS-inspired LC-MS/MS proteomics on rs4519489 oligos identified USF1, TBX3 and TFAP4 as A-allele-preferring binders. Enhancer Element Locator (EEL) ranked USF1 as the top motif at rs4519489. Only USF1 (not TBX3 or TFAP4) showed ChIP-qPCR enrichment at the locus; enrichment was stronger in A/A (LNCaP, VCaP) and A/T (22Rv1) than T/T (PC3) cells. ChIP-AS-qPCR and Flag-USF1 super-shift EMSA confirmed preferential binding to the A allele in LNCaP and 22Rv1; in vivo USF1 ChIP-seq on a heterozygous tissue showed input A:T = 1:5 vs USF1-IP A:T = 11:6 (PMID:28927585).
- USF1 is itself oncogenic and regulates NOL10. USF1 sgRNA-knockout in 22Rv1 and shRNA-knockdown in DU145 / PC3 reduced NOL10 protein and mRNA; USF1 overexpression raised NOL10. USF1 mRNA correlates with NOL10 across CPGEA (R=0.637, P=1.32e-25), TCGA PRAD (R=0.57, P=3.31e-13), GTEx, Stockholm CamCap, SMMU and NPC cohorts. High USF1 tracks with advanced T stage (CPGEA P=0.009), lymph-node metastasis (TCGA P=0.001), Gleason, BCR and shorter PFS. USF1 knockdown phenocopies NOL10 loss in proliferation / colony formation / migration / xenograft growth and Ptenpc-/-;Lkb1pc-/- organoid assays (PMID:28927585).
- Synergistic NOL10 + USF1 effect. Co-overexpression in 22Rv1 superadditively boosted proliferation, colony formation and migration. NOL10HiUSF1Hi patients in TCGA had worse BCR-free (HR=2.44, P=0.003) and metastasis-free (HR=2.35, P=0.001) survival than NOL10LoUSF1Lo. ROC analyses across CPGEA, TCGA, SU2C and DKFZ showed the combined NOL10+USF1 score outperformed either gene alone for 1/3/5/10-year survival prediction (PMID:28927585).
- APOBEC mutagenesis at NOL10. APOBEC-associated mutations are modestly enriched within NOL10 vs the global mutational landscape in CPGEA and TCGA PRAD, suggesting APOBEC3A/APOBEC3B-mediated mutagenesis may contribute to NOL10 dysregulation in addition to the inherited rs4519489 risk allele (PMID:28927585).
Genes & alterations
- NOL10 — eQTL target of rs4519489 at 2p25; intronic risk SNP A allele drives higher NOL10 mRNA via enhanced USF1 binding. Overexpressed in PRAD tumors vs normals across multiple cohorts; high expression correlates with stage, lymph-node metastasis, Gleason, BCR and shorter OS. Knockdown abrogates proliferation, colony formation, migration, invasion, EMT and xenograft growth; overexpression promotes them. Acts via cell-cycle regulators (E2F targets, G2M checkpoint: DLGAP5, MCM4, KIF20B, DIAPH3, SUV39H1, CENPE, GINS2, HMGB3, CDC6) (PMID:28927585).
- USF1 — basic helix-loop-helix transcription factor; preferentially binds the A allele of rs4519489 and transactivates NOL10. Highly co-expressed with NOL10 across cohorts; overexpression in PRAD tracks with advanced stage, lymph-node positivity, Gleason and BCR. Knockdown reduces NOL10 protein/mRNA and phenocopies NOL10 loss in proliferation and xenograft assays. Independently of androgen signaling (PMID:28927585).
- TBX3 — A-allele-preferring binder identified by PWAS-style proteomics and EEL motif analysis at rs4519489, but no ChIP-qPCR enrichment at the locus, so not a confirmed regulator in this paper (PMID:28927585).
- TFAP4 — A-allele-preferring binder identified by proteomics and EEL motif analysis at rs4519489; not enriched by ChIP-qPCR at the locus (PMID:28927585).
- APOBEC3A / APOBEC3B — APOBEC mutational signature is modestly enriched in NOL10 in CPGEA and TCGA PRAD vs the genome-wide background, suggesting ectopic APOBEC mutagenesis as a complementary somatic contributor to NOL10 deregulation (PMID:28927585).
- TMPRSS2 / ERG — cited as canonical somatic prostate-cancer fusion in background; cross-referenced to the authors’ prior 17q12/HNF1B work but not directly assayed here (PMID:28927585).
- STK11 (LKB1) — not somatically profiled; used as a co-deletion partner with Pten in the murine prostate organoid model (Ptenpc-/-;Lkb1pc-/-) for in-vivo NOL10 / USF1 knockdown experiments (PMID:28927585).
Clinical implications
- Prognostic biomarker. rs4519489 A/A genotype is an independent predictor of overall survival in TCGA PRAD (Cox HR=13.05, 95% CI 1.16–147.20, P=0.038), with combined high-NOL10 + A/A status further stratifying OS / PFS. The NOL10 cell-cycle signature is an independent BCR-free-survival predictor in CPGEA (HR=2.86, P=0.010), with a fixed-effects meta-analysis HR of 2.49 across CPGEA / MSKCC / TCGA / DKFZ. Combined NOL10 + USF1 expression outperforms either marker alone in time-dependent ROC for 1–10 year survival (PMID:28927585).
- Therapeutic target hypothesis. The authors propose the rs4519489–USF1–NOL10 axis as a candidate therapeutic target — degrading USF1 or NOL10 could blunt cell-cycle (E2F / G2M) drive in A-allele-carrying tumors. No clinical drug, dose, or trial is reported; this is a preclinical hypothesis only (PMID:28927585).
- Genetic stratification. Because the A allele exists in multi-ancestry GWAS at meaningful frequency and is in strong LD with the GWAS lead SNPs rs9287719 and rs1990613, genotyping at 2p25 is proposed as a way to identify men at higher risk of aggressive disease and to enrich for trials targeting the cell-cycle axis (PMID:28927585).
Limitations & open questions
- Inferential limits of the eQTL / survival associations. All clinical associations are observational across heterogeneous cohorts (CPGEA, TCGA PRAD, MSKCC, FHCRC, SU2C, DKFZ, FUSCC, NPC, SMMU, Stockholm CamCap, GSE10645, GSE21034, GSE35988, GSE62872) with differing endpoints (BCR, PFS, OS, MFS) and confounders. The TCGA PRAD multivariate HR for rs4519489 (13.05) has very wide 95% CI (1.16–147.20), reflecting limited A/A counts (PMID:28927585).
- CPGEA and several validation cohorts are not in cBioPortal. This complicates external reproduction; the FUSCC validation panel is small (8 paired tissues for USF1 RT-qPCR; 5 for NOL10).
- Mechanism beyond cell cycle. While E2F / G2M GSEA enrichment is robust and consistent across LNCaP / DU145 / 22Rv1 / PC3 perturbations, direct USF1 ChIP-seq genome-wide is not reported; downstream NOL10 effectors (whether NOL10 acts via ribosome biogenesis, via specific TFs, or non-canonically) are not pinpointed.
- TBX3 and TFAP4 discrepancy. Both score as A-allele-preferring in proteomics and EEL motif analysis but do not show ChIP-qPCR enrichment at the locus — potentially false positives of the proteomics screen, or context-dependent binding not captured by the antibody/conditions used (PMID:28927585).
- APOBEC contribution is modest and correlative. The enrichment is described as modest in both CPGEA and TCGA; the paper does not separate inherited rs4519489 effects from somatic APOBEC effects on NOL10 expression.
- Therapeutic strategy is undefined. No drug, degrader, or inhibitor is tested for USF1 or NOL10; the “therapeutic target” claim is preclinical and aspirational. Whether USF1 inhibition would have acceptable on-target toxicity (USF1 is a broadly expressed bHLH TF with metabolic and other roles) is open.
- PMID / publication-year mismatch (curatorial note). The PMID assigned in this corpus is 28927585, but the extracted PDF (PMC12508145) reports a 2024-submission / 2025-acceptance Nature Communications article (DOI 10.1038/s41467-025-64005-w). Citation metadata in this page reflects the actual paper text; downstream curators should confirm the PMID against PubMed.
Citations from this paper used in the wiki
- “the A allele of rs4519489 had stronger protein binding than the T allele, with significant biased allelic binding (BAB) scores in different samples” (Fig. 1b).
- “The A allele conferred significantly greater luciferase activity compared to the T allele across multiple prostate cancer cell lines (LNCaP, DU145, 22Rv1, PC3, and VCaP), both in the presence and absence of dihydrotestosterone (DHT) treatment” (Fig. 1e).
- “The analysis linked the aggressive prostate cancer-associated A allele of rs4519489 with higher NOL10 mRNA expression” (CPGEA eQTL, Fig. 1f, P=0.00056).
- “Multivariate analysis confirmed that rs4519489 is an independent prognostic factor for overall survival in the TCGA cohort” (Fig. 2n; HR 13.05, CI 1.16–147.20, P=0.038).
- “patients with the A/A genotype and higher NOL10 levels exhibited markedly poorer overall and progression-free survival” (Fig. 2o,p).
- “GSEA … showed NOL10 expression significantly associated with critical cell cycle pathways, notably E2F targets and G2M checkpoint pathways” (Fig. 4a).
- “we discovered that several TFs, notably USF1, TBX3, and TFAP4, showed specific interactions with the A allele” (Fig. 5b, LC-MS/MS PWAS).
- “ChIP-AS-qPCR targeting rs4519489 demonstrated a notably higher enrichment of USF1 at the A allele compared to the T allele” (Fig. 5d).
- “the ratio of A to T read counts from the Input ChIP-seq sample in one prostate specimen was 1:5, while the ratio in the USF1 ChIP-seq sample shifted to 11:6” (Supplementary Fig. 13k).
- “Co-overexpression of USF1 and NOL10 resulted in a significant enhancement of cell proliferation, colony formation, and migration compared to the individual overexpression of either gene alone” (Fig. 7e–g).
- “Targeting the rs4519489-USF1-NOL10 regulatory axis holds promise for innovative therapeutic strategies aimed at curtailing prostate cancer progression and severity” (Discussion).
This page was processed by crosslinker on 2026-05-15.