An alignment-free test for recombination
Standard
An alignment-free test for recombination. / Haubold, Bernhard; Krause, Linda; Horn, Thomas; Pfaffelhuber, Peter.
In: BIOINFORMATICS, Vol. 29, No. 24, 15.12.2013, p. 3121-7.Research output: SCORING: Contribution to journal › SCORING: Journal article › Research › peer-review
Harvard
APA
Vancouver
Bibtex
}
RIS
TY - JOUR
T1 - An alignment-free test for recombination
AU - Haubold, Bernhard
AU - Krause, Linda
AU - Horn, Thomas
AU - Pfaffelhuber, Peter
PY - 2013/12/15
Y1 - 2013/12/15
N2 - MOTIVATION: Why recombination? is one of the central questions in biology. This has led to a host of methods for quantifying recombination from sequence data. These methods are usually based on aligned DNA sequences. Here, we propose an efficient alignment-free alternative.RESULTS: Our method is based on the distribution of match lengths, which we look up using enhanced suffix arrays. By eliminating the alignment step, the test becomes fast enough for application to whole bacterial genomes. Using simulations we show that our test has similar power as established tests when applied to long pairs of sequences. When applied to 58 genomes of Escherichia coli, we pick up the strongest recombination signal from a 125 kb horizontal gene transfer engineered 20 years ago.AVAILABILITY AND IMPLEMENTATION: We have implemented our method in the command-line program rush. Its C sources and documentation are available under the GNU General Public License from http://guanine.evolbio.mpg.de/rush/.
AB - MOTIVATION: Why recombination? is one of the central questions in biology. This has led to a host of methods for quantifying recombination from sequence data. These methods are usually based on aligned DNA sequences. Here, we propose an efficient alignment-free alternative.RESULTS: Our method is based on the distribution of match lengths, which we look up using enhanced suffix arrays. By eliminating the alignment step, the test becomes fast enough for application to whole bacterial genomes. Using simulations we show that our test has similar power as established tests when applied to long pairs of sequences. When applied to 58 genomes of Escherichia coli, we pick up the strongest recombination signal from a 125 kb horizontal gene transfer engineered 20 years ago.AVAILABILITY AND IMPLEMENTATION: We have implemented our method in the command-line program rush. Its C sources and documentation are available under the GNU General Public License from http://guanine.evolbio.mpg.de/rush/.
KW - Algorithms
KW - Computational Biology
KW - Computer Simulation
KW - Escherichia coli
KW - Genome, Bacterial
KW - Phylogeny
KW - Recombination, Genetic
KW - Sequence Alignment
KW - Journal Article
KW - Research Support, Non-U.S. Gov't
U2 - 10.1093/bioinformatics/btt550
DO - 10.1093/bioinformatics/btt550
M3 - SCORING: Journal article
C2 - 24064419
VL - 29
SP - 3121
EP - 3127
JO - BIOINFORMATICS
JF - BIOINFORMATICS
SN - 1367-4803
IS - 24
ER -