Skip to main navigation Skip to search Skip to main content

Integrating gene annotation with orthology inference at scale

  • Zoonomia Consortium
  • Max Planck Institute of Molecular Cell Biology and Genetics
  • Max-Planck-Institute for the Physics of Complex Systems
  • Center for Systems Biology Dresden
  • Goethe University Frankfurt
  • Uppsala University
  • University of Massachusetts Medical School
  • Baskin School of Engineering
  • Department of Molecular Biology
  • Texas A&M University
  • University of Maine
  • University of California
  • Genome British Columbia
  • University of East Anglia
  • Pontifícia Universidade Católica do Rio Grande do Sul
  • University of Nevada, Las Vegas
  • University of Nottingham
  • Texas Tech University
  • American Museum of Natural History
  • University of Southern California
  • Fauna Bio Incorporated
  • LOEWE Centre for Translational Biodiversity Genomics
  • Senckenberg Gesellschaft für Naturforschung
  • Institute for Systems Biology
  • University College Dublin
  • Pompeu Fabra University
  • Carnegie Mellon University
  • University of California at San Francisco
  • Gladstone Institutes
  • Smithsonian Institution
  • St. Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO)
  • George Mason University
  • Yale University
  • ICREA
  • Department of Cell and Developmental Biology
  • Autonomous University of Barcelona
  • University of Bern
  • Lehigh University
  • Pasqual Maragall Foundation
  • Case Western Reserve University
  • Canadian Museum of Nature
  • Harvard University
  • Leibniz Institute for Zoo and Wildlife Research
  • University of Pittsburgh
  • Chan Zuckerberg Biohub
  • Zoological Society of San Diego
  • University of California at San Diego
  • University of California at Santa Cruz
  • University of California at Riverside
  • University of North Carolina at Chapel Hill
  • Karolinska Institutet
  • Allen Institute for Brain Science

Research output: Contribution to journalArticlepeer-review

Abstract

Annotating coding genes and inferring orthologs are two classical challenges in genomics and evolutionary biology that have traditionally been approached separately, limiting scalability. We present TOGA (Tool to infer Orthologs from Genome Alignments), a method that integrates structural gene annotation and orthology inference. TOGA implements a different paradigm to infer orthologous loci, improves ortholog detection and annotation of conserved genes compared with state-of-the-art methods, and handles even highly fragmented assemblies. TOGA scales to hundreds of genomes, which we demonstrate by applying it to 488 placental mammal and 501 bird assemblies, creating the largest comparative gene resources so far. Additionally, TOGA detects gene losses, enables selection screens, and automatically provides a superior measure of mammalian genome quality. TOGA is a powerful and scalable method to annotate and compare genes in the genomic era.

Original languageEnglish
Article numbereabn3107
JournalScience
Volume380
Issue number6643
DOIs
Publication statusPublished - 28 Apr 2023
Externally publishedYes

Fingerprint

Dive into the research topics of 'Integrating gene annotation with orthology inference at scale'. Together they form a unique fingerprint.

Cite this