Genomic Landscapes of Epstein-Barr Virus in Pulmonary Lymphoepithelioma-Like Carcinoma

J Virol. 2022 Feb 23;96(4):e0169321. doi: 10.1128/JVI.01693-21. Epub 2021 Dec 15.

Abstract

Epstein-Barr virus (EBV) infection is associated with multiple malignancies, including pulmonary lymphoepithelioma-like carcinoma (pLELC), a particular subtype of primary lung cancer. However, the genomic characteristics of EBV related to pLELC remain unclear. Here, we obtained the whole-genome data set of EBV isolated from 78 pLELC patients and 37 healthy controls using EBV-captured sequencing. Compared with the reference genome (NC_007605), a total of 3,995 variations were detected across pLELC-derived EBV sequences, with the mutational hot spots located in latent genes. Combined with 180 published EBV sequences derived from healthy people in Southern China, we performed a genome-wide association study and identified 32 variations significantly related to pLELC (P < 2.56 × 10-05, Bonferroni correction), with the top signal of single nucleotide polymorphism (SNP) coordinate T7327C (OR = 1.22, P = 2.39 × 10-15) locating in the origin of plasmid replication (OriP). The results of population structure analysis of EBV isolates in East Asian showed the EBV strains derived from pLELC were more similar to those from nasopharyngeal carcinoma (NPC) than other EBV-associated diseases. In addition, typical latency type-II infection were recognized for EBV of pLELC at both transcription and methylation levels. Taken together, we defined the global view of EBV genomic profiles in pLELC patients for the first time, providing new insights to deepening our understanding of this rare EBV-associated primary lung carcinoma. IMPORTANCE Pulmonary lymphoepithelioma-like carcinoma (pLELC) is a rare, distinctive subtype of primary lung cancer closely associated with Epstein-Barr virus (EBV) infection. Here, we gave the first overview of pLELC-derived EBV at the level of genome, methylation and transcription. We obtained the EBV sequences data set from 78 primary pLELC patients, and revealed the sequences diversity across EBV genome and detected variability in known immune epitopes. Genome-wide association analysis combining 217 healthy controls identifies significant variations related to the risk of pLELC. Meanwhile, we characterized the integration landscapes of EBV at the genome-wide level. These results provided new insight for understanding EBV's role in pLELC tumorigenesis.

Keywords: DNA integration; Epstein-Barr virus; genetic variation; genome sequencing; pulmonary lymphoepithelioma-like carcinoma.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Asian People
  • Carcinoma, Non-Small-Cell Lung / virology*
  • China
  • DNA Methylation
  • Epitopes, T-Lymphocyte / genetics
  • Epstein-Barr Virus Infections / virology*
  • Genes, Viral / genetics
  • Genetic Variation
  • Genome, Viral / genetics*
  • Genome-Wide Association Study
  • Herpesvirus 4, Human / genetics*
  • Herpesvirus 4, Human / isolation & purification
  • Humans
  • Lung Neoplasms / virology*
  • Virus Integration
  • Virus Latency / genetics

Substances

  • Epitopes, T-Lymphocyte