Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation

Virag Sharma, Michael Hiller

Research output: Contribution to journalArticlepeer-review

Abstract

Genome alignments provide a powerful basis to transfer gene annotations from a well-annotated reference genome to many other aligned genomes. The completeness of these annotations crucially depends on the sensitivity of the underlying genome alignment. Here, we investigated the impact of the genome alignment parameters and found that parameters with a higher sensitivity allow the detection of thousands of novel alignments between orthologous exons that have been missed before. In particular, comparisons between species separated by an evolutionary distance of >0.75 substitutions per neutral site, like human and other nonplacental vertebrates, benefit from increased sensitivity. To systematically test if increased sensitivity improves comparative gene annotations, we built a multiple alignment of 144 vertebrate genomes and used this alignment to map human genes to the other 143 vertebrates with CESAR. We found that higher alignment sensitivity substantially improves the completeness of comparative gene annotations by adding on average 2382 and 7440 novel exons and 117 and 317 novel genes for mammalian and non-mammalian species, respectively. Our results suggest a more sensitive alignment strategy that should generally be used for genome alignments between distantly-related species. Our 144- vertebrate genome alignment and the comparative gene annotations (https://bds.mpi-cbg.de/hillerlab/ 144VertebrateAlignment CESAR/) are a valuable resource for comparative genomics.

Original languageEnglish
Pages (from-to)8369-8377
Number of pages9
JournalNucleic Acids Research
Volume45
Issue number14
DOIs
Publication statusPublished - 21 Aug 2017
Externally publishedYes

Fingerprint

Dive into the research topics of 'Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation'. Together they form a unique fingerprint.

Cite this