Federico Agostini
Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institutet
Stockholm, Sweden
Tomtebodavägen 23a
171 65 Solna, Sweden
I studied biotechnologies in Italy, obtained my PhD in Barcelona and did my first postdoc in London. During the years, I’ve been working with RNA binding proteins and transcription, but now I’m more focused on understanding how the genome is organised inside the nucleus. To do this, I analyse different sources of data and try to make a sense out of them.
selected publications
- NAT COMMUNRADICL-seq identifies general and cell type–specific principles of genome-wide RNA-chromatin interactionsBonetti, Alessandro, Agostini, Federico, Suzuki, Ana Maria, Hashimoto, Kosuke, Pascarella, Giovanni, Gimenez, Juliette, Roos, Leonie, Nash, Alex J, Ghilotti, Marco, Cameron, Christopher J F, Valentine, Matthew, Medvedeva, Yulia A., Noguchi, Shuhei, Agirre, Eneritz, Kashi, Kaori, Samudyata, , Luginbühl, Joachim, Cazzoli, Riccardo, Agrawal, Saumya, Luscombe, Nicholas M., Blanchette, Mathieu, Kasukawa, Takeya, Hoon, Michiel, Arner, Erik, Lenhard, Boris, Plessy, Charles, Castelo-Branco, Gonçalo, Orlando, Valerio, and Carninci, PieroNature Communications 2020
Mammalian genomes encode tens of thousands of noncoding RNAs. Most noncoding transcripts exhibit nuclear localization and several have been shown to play a role in the regulation of gene expression and chromatin remodeling. To investigate the function of such RNAs, methods to massively map the genomic interacting sites of multiple transcripts have been developed; however, these methods have some limitations. Here, we introduce RNA And DNA Interacting Complexes Ligated and sequenced (RADICL-seq), a technology that maps genome-wide RNA–chromatin interactions in intact nuclei. RADICL-seq is a proximity ligation-based methodology that reduces the bias for nascent transcription, while increasing genomic coverage and unique mapping rate efficiency compared with existing methods. RADICL-seq identifies distinct patterns of genome occupancy for different classes of transcripts as well as cell type–specific RNA-chromatin interactions, and highlights the role of transcription in the establishment of chromatin structure.
- BIORXIVIntergenic RNA mainly derives from nascent transcripts of known genesAgostini, Federico, Zagalak, Julian A, Attig, Jan, Ule, Jernej, Luscombe, Nicholas M, Federico, Agostini, Julian, Zagalak, Jan, Attig, Jernej, Ule, and M, Luscombe NicholasbioRxiv 2020
Background: Eukaryotic genomes undergo pervasive transcription, leading to the production of many types of stable and unstable RNAs. Transcription is not restricted to regions with annotated gene features but includes almost any genomic context. Currently, the source and function of most RNAs originating from intergenic regions in the human genome remains unclear. Results: We hypothesised that many intergenic RNA can be ascribed to the presence of as-yet unannotated genes or the 'fuzzy' transcription of known genes that extends beyond the annotated boundaries. To elucidate the contributions of these two sources, we assembled a dataset of >2.5 billion publicly available RNA-seq reads across 5 human cell lines and multiple cellular compartments to annotate transcriptional units in the human genome. About 80% of transcripts from unannotated intergenic regions can be attributed to the fuzzy transcription of existing genes; the remaining transcripts originate mainly from putative long non-coding RNA loci that are rarely spliced. We validated the transcriptional activity of these intergenic RNA using independent measurements, including transcriptional start sites, chromatin signatures, and genomic occupancies of RNA polymerase II in various phosphorylation states. We also analysed the nuclear localisation and sensitivities of intergenic transcripts to nucleases to illustrate that they tend to be rapidly degraded either 'on-chromatin' by XRN2 or 'off-chromatin' by the exosome. Conclusions: We provide a curated atlas of intergenic RNAs that distinguishes between alternative processing of well annotated genes from independent transcriptional units based on the combined analysis of chromatin signatures, nuclear RNA localisation and degradation pathways.
- NAT BIOTECHNOLGPSeq reveals the radial organization of chromatin in the cell nucleusGirelli, Gabriele, Custodio, Joaquin, Kallas, Tomasz, Agostini, Federico, Wernersson, Erik, Spanjaard, Bastiaan, Mota, Ana, Kolbeinsdottir, Solrun, Gelali, Eleni, Crosetto, Nicola, and Bienko, MagdaNature Biotechnology 2020
With the exception of lamina-associated domains, the radial organization of chromatin in mammalian cells remains largely unexplored. Here we describe genomic loci positioning by sequencing (GPSeq), a genome-wide method for inferring distances to the nuclear lamina all along the nuclear radius. GPSeq relies on gradual restriction digestion of chromatin from the nuclear lamina toward the nucleus center, followed by sequencing of the generated cut sites. Using GPSeq, we mapped the radial organization of the human genome at 100-kb resolution, which revealed radial patterns of genomic and epigenomic features and gene expression, as well as A and B subcompartments. By combining radial information with chromosome contact frequencies measured by Hi-C, we substantially improved the accuracy of whole-genome structure modeling. Finally, we charted the radial topography of DNA double-strand breaks, germline variants and cancer mutations and found that they have distinctive radial arrangements in A and B subcompartments. We conclude that GPSeq can reveal fundamental aspects of genome architecture.
- CELLHeteromeric RNP Assembly at LINEs Controls Lineage-Specific RNA ProcessingAttig, Jan, Agostini, Federico, Gooding, Clare, Chakrabarti, Anob M, Singh, Aarti, Haberman, Nejc, Zagalak, Julian A, Emmett, Warren, Smith, Christopher W.J., Luscombe, Nicholas M, and Ule, JernejCell 2018
Long mammalian introns make it challenging for the RNA processing machinery to identify exons accurately. We find that LINE-derived sequences (LINEs) contribute to this selection by recruiting dozens of RNA-binding proteins (RBPs) to introns. This includes MATR3, which promotes binding of PTBP1 to multivalent binding sites within LINEs. Both RBPs repress splicing and 3′ end processing within and around LINEs. Notably, repressive RBPs preferentially bind to evolutionarily young LINEs, which are located far from exons. These RBPs insulate the LINEs and the surrounding intronic regions from RNA processing. Upon evolutionary divergence, changes in RNA motifs within LINEs lead to gradual loss of their insulation. Hence, older LINEs are located closer to exons, are a common source of tissue-specific exons, and increasingly bind to RBPs that enhance RNA processing. Thus, LINEs are hubs for the assembly of repressive RBPs and also contribute to the evolution of new, lineage-specific transcripts in mammals.