A pilot study of bacterial genes with disrupted ORFs reveals a surprising profusion of protein sequence recoding mediated by ribosomal frameshifting and transcriptional realignment

Virag Sharma, Andrew E. Firth, Ivan Antonov, Olivier Fayet, John F. Atkins, Mark Borodovsky, Pavel V. Baranov

Research output: Contribution to journalArticlepeer-review

Abstract

Bacterial genome annotations contain a number of coding sequences (CDSs) that, in spite of reading frame disruptions, encode a single continuous polypeptide. Such disruptions have different origins: sequencing errors, frameshift, or stop codon mutations, as well as instances of utilization of nontriplet decoding. We have extracted over 1,000 CDSs with annotated disruptions and found that about 75% of them can be clustered into 64 groups based on sequence similarity. Analysis of the clusters revealed deep phylogenetic conservation of open reading frame organization as well as the presence of conserved sequence patterns that indicate likely utilization of the nonstandard decoding mechanisms: programmed ribosomal frameshifting (PRF) and programmed transcriptional realignment (PTR). Further enrichment of these clusters with additional homologous nucleotide sequences revealed over 6,000 candidate genes utilizing PRF or PTR. Analysis of the patterns of conservation apparently associated with nontriplet decoding revealed the presence of both previously characterized frameshift-prone sequences and a few novel ones. Since the starting point of our analysis was a set of genes with already annotated disruptions, it is highly plausible that in this study, we have identified only a fraction of all bacterial genes that utilize PRF or PTR. In addition to the identification of a large number of recoded genes, a surprising observation is that nearly half of them are expressed via PTR - a mechanism that, in contrast to PRF, has not yet received substantial attention. The Author 2011. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved.

Original languageEnglish
Pages (from-to)3195-3211
Number of pages17
JournalMolecular Biology and Evolution
Volume28
Issue number11
DOIs
Publication statusPublished - Nov 2011
Externally publishedYes

Keywords

  • frameshift mutation
  • IS element
  • nonstandard decoding
  • programmed ribosomal frameshifting
  • pseudogene
  • recoding
  • RNA editing
  • transcriptional realignment
  • transcriptional slippage

Fingerprint

Dive into the research topics of 'A pilot study of bacterial genes with disrupted ORFs reveals a surprising profusion of protein sequence recoding mediated by ribosomal frameshifting and transcriptional realignment'. Together they form a unique fingerprint.

Cite this