Identification and characterization of novel conserved RNA structures in Drosophila

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Identification and characterization of novel conserved RNA structures in Drosophila. / Kirsch, Rebecca; Seemann, Stefan E; Ruzzo, Walter L; Cohen, Stephen M; Stadler, Peter F; Gorodkin, Jan.

In: BMC Genomics, Vol. 19, 899, 2018.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Kirsch, R, Seemann, SE, Ruzzo, WL, Cohen, SM, Stadler, PF & Gorodkin, J 2018, 'Identification and characterization of novel conserved RNA structures in Drosophila', BMC Genomics, vol. 19, 899. https://doi.org/10.1186/s12864-018-5234-4

APA

Kirsch, R., Seemann, S. E., Ruzzo, W. L., Cohen, S. M., Stadler, P. F., & Gorodkin, J. (2018). Identification and characterization of novel conserved RNA structures in Drosophila. BMC Genomics, 19, [899]. https://doi.org/10.1186/s12864-018-5234-4

Vancouver

Kirsch R, Seemann SE, Ruzzo WL, Cohen SM, Stadler PF, Gorodkin J. Identification and characterization of novel conserved RNA structures in Drosophila. BMC Genomics. 2018;19. 899. https://doi.org/10.1186/s12864-018-5234-4

Author

Kirsch, Rebecca ; Seemann, Stefan E ; Ruzzo, Walter L ; Cohen, Stephen M ; Stadler, Peter F ; Gorodkin, Jan. / Identification and characterization of novel conserved RNA structures in Drosophila. In: BMC Genomics. 2018 ; Vol. 19.

Bibtex

@article{b47449c315774351be3c643d7c0462a2,
title = "Identification and characterization of novel conserved RNA structures in Drosophila",
abstract = "BACKGROUND: Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes - and thus for conserved secondary structures - even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome.RESULTS: To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed ∼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units.CONCLUSIONS: This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines.",
keywords = "CMfinder, Development, Drosophila melanogaster, Gene expression, Non-coding RNA, RNA secondary structure prediction",
author = "Rebecca Kirsch and Seemann, {Stefan E} and Ruzzo, {Walter L} and Cohen, {Stephen M} and Stadler, {Peter F} and Jan Gorodkin",
year = "2018",
doi = "10.1186/s12864-018-5234-4",
language = "English",
volume = "19",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central Ltd.",

}

RIS

TY - JOUR

T1 - Identification and characterization of novel conserved RNA structures in Drosophila

AU - Kirsch, Rebecca

AU - Seemann, Stefan E

AU - Ruzzo, Walter L

AU - Cohen, Stephen M

AU - Stadler, Peter F

AU - Gorodkin, Jan

PY - 2018

Y1 - 2018

N2 - BACKGROUND: Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes - and thus for conserved secondary structures - even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome.RESULTS: To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed ∼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units.CONCLUSIONS: This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines.

AB - BACKGROUND: Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes - and thus for conserved secondary structures - even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome.RESULTS: To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed ∼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units.CONCLUSIONS: This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines.

KW - CMfinder

KW - Development

KW - Drosophila melanogaster

KW - Gene expression

KW - Non-coding RNA

KW - RNA secondary structure prediction

UR - https://www.nature.com/articles/s41467-018-08108-7

U2 - 10.1186/s12864-018-5234-4

DO - 10.1186/s12864-018-5234-4

M3 - Journal article

C2 - 30537930

VL - 19

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

M1 - 899

ER -

ID: 210016480