Publication Date:
2013-11-26
Description:
RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.〈br /〉〈span class="detail_caption"〉Notes: 〈/span〉Ding, Yiliang -- Tang, Yin -- Kwok, Chun Kit -- Zhang, Yu -- Bevilacqua, Philip C -- Assmann, Sarah M -- England -- Nature. 2014 Jan 30;505(7485):696-700. doi: 10.1038/nature12756. Epub 2013 Nov 24.〈br /〉〈span class="detail_caption"〉Author address: 〈/span〉1] Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA [2] Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania 16802, USA [3] Center for RNA Molecular Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA [4]. ; 1] Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA [2] Center for RNA Molecular Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA [3] Bioinformatics and Genomics Graduate Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA [4]. ; 1] Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania 16802, USA [2] Center for RNA Molecular Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA [3]. ; 1] Bioinformatics and Genomics Graduate Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA [2] Department of Statistics, Pennsylvania State University, University Park, Pennsylvania 16802, USA. ; 1] Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania 16802, USA [2] Center for RNA Molecular Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA [3] Plant Biology Graduate Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA. ; 1] Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA [2] Center for RNA Molecular Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA [3] Bioinformatics and Genomics Graduate Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA [4] Plant Biology Graduate Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA.〈br /〉〈span class="detail_caption"〉Record origin:〈/span〉 〈a href="http://www.ncbi.nlm.nih.gov/pubmed/24270811" target="_blank"〉PubMed〈/a〉
Keywords:
Arabidopsis/*genetics
;
Base Sequence
;
Codon, Initiator/genetics
;
Computational Biology
;
Genome, Plant/*genetics
;
Molecular Sequence Data
;
*Nucleic Acid Conformation
;
Phylogeny
;
Polyadenylation/genetics
;
Protein Biosynthesis/genetics
;
RNA Splice Sites/genetics
;
RNA, Messenger/chemistry/genetics/metabolism
;
RNA, Plant/analysis/*chemistry/genetics/*metabolism
;
RNA, Ribosomal, 18S/chemistry/genetics/metabolism
;
*Regulatory Sequences, Ribonucleic Acid/genetics
;
Sequence Analysis, RNA
;
Stress, Physiological/genetics
;
Structure-Activity Relationship
Print ISSN:
0028-0836
Electronic ISSN:
1476-4687
Topics:
Biology
,
Chemistry and Pharmacology
,
Medicine
,
Natural Sciences in General
,
Physics
Permalink