Skip to main navigation Skip to search Skip to main content

ULTRA: A Model Based Tool to Detect Tandem Repeats

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In biological sequences, tandem repeats consist of tens to hundreds of residues of a repeated pattern, such as atgatgatgatgatg ('atg' repeated), often the result of replication slippage. Over time, these repeats decay so that the original sharp pattern of repetition is somewhat obscured, but even degenerate repeats pose a problem for sequence annotation: when two sequences both contain shared patterns of similar repetition, the result can be a false signal of sequence homology. We describe an implementation of a new hidden Markov model for detecting tandem repeats that shows substantially improved sensitivity to labeling decayed repetitive regions, presents low and reliable false annotation rates across a wide range of sequence composition, and produces scores that follow a stable distribution. On typical genomic sequence, the time and memory requirements of the resulting tool (ULTRA) are competitive with the most heavily used tool for repeat masking (TRF). ULTRA is released under an open source license and lays the groundwork for inclusion of the model in sequence alignment tools and annotation pipelines.

Original languageEnglish (US)
Title of host publicationACM-BCB 2018 - Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
PublisherAssociation for Computing Machinery, Inc
Pages37-46
Number of pages10
ISBN (Electronic)9781450357944
DOIs
StatePublished - Aug 15 2018
Externally publishedYes
Event9th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM-BCB 2018 - Washington, United States
Duration: Aug 29 2018Sep 1 2018

Publication series

NameACM-BCB 2018 - Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics

Conference

Conference9th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM-BCB 2018
Country/TerritoryUnited States
CityWashington
Period8/29/189/1/18

Keywords

  • Annotation error
  • Sequence alignment
  • Tandem repeats

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Health Informatics
  • Biomedical Engineering

Fingerprint

Dive into the research topics of 'ULTRA: A Model Based Tool to Detect Tandem Repeats'. Together they form a unique fingerprint.

Cite this