Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by LIAAD

2017

Large-Scale Uniform Analysis of Cancer Whole Genomes in Multiple Computing Environments

Authors
Yung, CK; O’Connor, BD; Yakneen, S; Zhang, J; Ellrott, K; Kleinheinz, K; Miyoshi, N; Raine, KM; Royo, R; Saksena, GB; Schlesner, M; Shorser, SI; Vazquez, M; Weischenfeldt, J; Yuen, D; Butler, AP; Davis-Dusenbery, BN; Eils, R; Ferretti, V; Grossman, RL; Harismendy, O; Kim, Y; Nakagawa, H; Newhouse, SJ; Torrents, D; Stein, LD; Rodriguez, JB; Boroevich, KA; Boyce, R; Brooks, AN; Buchanan, A; Buchhalter, I; Byrne, NJ; Cafferkey, A; Campbell, PJ; Chen, Z; Cho, S; Choi, W; Clapham, P; De La Vega, FM; Demeulemeester, J; Dow, MT; Dursi, LJ; Eils, J; Farcas, C; Favero, F; Fayzullaev, N; Flicek, P; Fonseca, NA; Gelpi, JL; Getz, G; Gibson, B; Heinold, MC; Hess, JM; Hofmann, O; Hong, JH; Hudson, TJ; Huebschmann, D; Hutter, B; Hutter, CM; Imoto, S; Ivkovic, S; Jeon, S; Jiao, W; Jung, J; Kabbe, R; Kahles, A; Kerssemakers, J; Kim, H; Kim, H; Kim, J; Korbel, JO; Koscher, M; Koures, A; Kovacevic, M; Lawerenz, C; Leshchiner, I; Livitz, DG; Mihaiescu, GL; Mijalkovic, S; Lazic, AM; Miyano, S; Nahal, HK; Nastic, M; Nicholson, J; Ocana, D; Ohi, K; Ohno-Machado, L; Omberg, L; Francis Ouellette, B; Paramasivam, N; Perry, MD; Pihl, TD; Prinz, M; Puiggròs, M; Radovic, P; Rheinbay, E; Rosenberg, MW; Short, C; Sofia, HJ; Spring, J; Struck, AJ; Tiao, G; Tijanic, N; Loo, PV; Vicente, D; Wala, JA; Wang, Z; Werner, J; Williams, A; Woo, Y; Wright, AJ; Xiang, Q;

Publication

Abstract
AbstractThe International Cancer Genome Consortium (ICGC)’s Pan-Cancer Analysis of Whole Genomes (PCAWG) project aimed to categorize somatic and germline variations in both coding and non-coding regions in over 2,800 cancer patients. To provide this dataset to the research working groups for downstream analysis, the PCAWG Technical Working Group marshalled ~800TB of sequencing data from distributed geographical locations; developed portable software for uniform alignment, variant calling, artifact filtering and variant merging; performed the analysis in a geographically and technologically disparate collection of compute environments; and disseminated high-quality validated consensus variants to the working groups. The PCAWG dataset has been mirrored to multiple repositories and can be located using the ICGC Data Portal. The PCAWG workflows are also available as Docker images through Dockstore enabling researchers to replicate our analysis on their own data.

2017

Transcription factor activities enhance markers of drug response in cancer

Authors
Garcia-Alonso, L; Iorio, F; Matchan, A; Fonseca, N; Jaaks, P; Falcone, F; Bignell, G; McDade, SS; Garnett, MJ; Saez-Rodriguez, J;

Publication

Abstract
AbstractTranscriptional dysregulation is a key feature of cancer. Transcription factors (TFs) are the main link between signalling pathways and the transcriptional regulatory machinery of the cell, positioning them as key oncogenic inductors and therefore potential targets of therapeutic intervention. We implemented a computational pipeline to infer TF regulatory activities from basal gene expression and applied it to publicly available and newly generated RNA-seq data from a collection of 1,010 cancer cell lines and 9,250 primary tumors. We show that the predicted TF activities recapitulate known mechanisms of transcriptional dysregulation in cancer and dissect mutant-specific effects in driver genes. Importantly, we show the potential for predicted TF activities to be used as markers of sensitivity to the inhibition of their upstream regulators. Furthermore, combining these inferred activities with existing pharmacogenomic markers significantly improves the stratification of sensitive and resistant cell lines for several compounds. Our approach provides a framework to link driver genomic alterations with transcriptional dysregulation that helps to predict drug sensitivity in cancer and to dissect its mechanistic determinants.

2017

Whole genome and RNA sequencing of 1,220 cancers reveals hundreds of genes deregulated by rearrangement of cis-regulatory elements

Authors
Zhang, Y; Chen, F; Fonseca, NA; He, Y; Fujita, M; Nakagawa, H; Zhang, Z; Brazma, A; Creighton, CJ;

Publication

Abstract
AbstractUsing a dataset of somatic Structural Variants (SVs) in cancers from 2658 patients—1220 with corresponding gene expression data—we identified hundreds of genes for which the nearby presence (within 100kb) of an SV breakpoint was associated with altered expression. For the vast majority of these genes, expression was increased rather than decreased with corresponding SV event. Well-known up-regulated cancer-associated genes impacted by this phenomenon included TERT, MDM2, CDK4, ERBB2, CD274, PDCD1LG2, and IGF2. SVs upstream of TERT involved ~3% of cancer cases and were most frequent in liver-biliary, melanoma, sarcoma, stomach, and kidney cancers. SVs associated with up-regulation of PD1 and PDL1 genes involved ~1% of non-amplified cases. For many genes, SVs were significantly associated with either increased numbers or greater proximity of enhancer regulatory elements near the gene. DNA methylation near the gene promoter was often increased with nearby SV breakpoint, which may involve inactivation of repressor elements.AbbreviationsPCAWGthe Pan-Cancer Analysis of Whole Genomes projectSVStructural Variant

2017

Online resources for PCAWG data exploration, visualization, and discovery

Authors
Goldman, M; Zhang, J; Fonseca, NA; Xiang, Q; Craft, B; Piñeiro-Yáñez, E; O'Connor, B; Bazant, W; Barrera, E; Muñoz, A; Petryszak, R; Füllgrabe, A; Al-Shahrour, F; Keays, M; Haussler, D; Weinstein, J; Huber, W; Valencia, A; Papatheodorou, I; Zhu, J; Ferreti, V; Vazquez, M; PCAWG-12 Working Group,; PCAWG Network,;

Publication

Abstract
The Pan-Cancer Analysis of Whole Genomes (PCAWG) cohort provides a large, uniformly-analyzed, whole-genome dataset. The PCAWG Landing Page (http://docs.icgc.org/pcawg) focuses on four biologist-friendly, publicly-available web tools for exploring this data: The ICGC Data Portal, UCSC Xena, Expression Atlas, and PCAWG-Scout. They enable researchers to dynamically query the complex genomics data, explore tumors' molecular landscapes, and include external information to facilitate interpretation.

2017

Assessing the Gene Regulatory Landscape in 1,188 Human Tumors

Authors
Calabrese, C; Lehmann, K; Urban, L; Liu, F; Erkek, S; Fonseca, N; Kahles, A; Kilpinen-Barrett, LH; Markowski, J; Waszak, S; Korbel, J; Zhang, Z; Brazma, A; Raetsch, G; Schwarz, R; Stegle, O; PCAWG-3,;

Publication

Abstract
Cancer is characterised by somatic genetic variation, but the effect of the majority of non-coding somatic variants and the interface with the germline genome are still unknown. We analysed the whole genome and RNA-seq data from 1,188 human cancer patients as provided by the Pan-cancer Analysis of Whole Genomes (PCAWG) project to map cis expression quantitative trait loci of somatic and germline variation and to uncover the causes of allele-specific expression patterns in human cancers. The availability of the first large-scale dataset with both whole genome and gene expression data enabled us to uncover the effects of the non-coding variation on cancer. In addition to confirming known regulatory effects, we identified novel associations between somatic variation and expression dysregulation, in particular in distal regulatory elements. Finally, we uncovered links between somatic mutational signatures and gene expression changes, including TERT and LMO2, and we explained the inherited risk factors in APOBEC-related mutational processes. This work represents the first large-scale assessment of the effects of both germline and somatic genetic variation on gene expression in cancer and creates a valuable resource cataloguing these effects.

2017

Comprehensive genome and transcriptome analysis reveals genetic basis for gene fusions in cancer

Authors
Fonseca, NA; He, Y; Greger, L; Brazma, A; Zhang, Z; - PCAWG-3,;

Publication

Abstract
Gene fusions are an important class of cancer-driving events with therapeutic and diagnostic values, yet their underlying genetic mechanisms have not been systematically characterized. Here by combining RNA and whole genome DNA sequencing data from 1188 donors across 27 cancer types we obtained a list of 3297 high-confidence tumour-specific gene fusions, 82% of which had structural variant (SV) support and 2372 of which were novel. Such a large collection of RNA and DNA alterations provides the first opportunity to systematically classify the gene fusions at a mechanistic level. While many could be explained by single SVs, numerous fusions involved series of structural rearrangements and thus are composite fusions. We discovered 75 fusions of a novel class of inter-chromosomal composite fusions, termed bridged fusions, in which a third genomic location bridged two different genes. In addition, we identified 522 fusions involving non-coding genes and 157 ORF-retaining fusions, in which the complete open reading frame of one gene was fused to the UTR region of another. Although only a small proportion (5%) of the discovered fusions were recurrent, we found a set of highly recurrent fusion partner genes, which exhibited strong 5' or 3' bias and were significantly enriched for cancer genes. Our findings broaden the view of the gene fusion landscape and reveal the general properties of genetic alterations underlying gene fusions for the first time.

  • 193
  • 430