TY - JOUR
T1 - A note on the incidence of reverse complementary fungal ITS sequences in the public sequence databases and a software tool for their detection and reorientation
AU - Nilsson, R. Henrik
AU - Veldre, Vilmar
AU - Wang, Zheng
AU - Eckart, Martin
AU - Branco, Sara
AU - Hartmann, Martin
AU - Quince, Christopher
AU - Godhe, Anna
AU - Bertrand, Yann
AU - Alfredsson, Johan F.
AU - Larsson, Karl Henrik
AU - Kõljalg, Urmas
AU - Abarenkov, Kessy
N1 - Funding Information:
Acknowledgments R.H.N. and K.A. gratefully acknowledge support from the Frontiers in Biodiversity Research Centre of Excellence (University of Tartu) and the Fungi in Boreal Forest Soils network. Matt von Konrat and Anders Hagborg are acknowledged for valuable advice on the liverwort data. Two anonymous reviewers are acknowledged for valuable input on the manuscript. The authors declare that they have no conflict of interests. No laboratory experiments were undertaken as a component––or result––of the present study.
PY - 2011/7
Y1 - 2011/7
N2 - Reverse complementary DNA sequences--sequences that are inadvertently cast backward and in which all purines and pyrimidines are transposed--are not uncommon in sequence databases, where they may introduce noise into sequence-based research. We show that about 1% of the public fungal ITS sequences, the most commonly sequenced genetic marker in mycology, are reverse complementary, and we introduce an open source software solution to automate their detection and reorientation. The MacOSX/Linux/UNIX software operates on public or private datasets of any size, although some 50 base pairs of the 5.8S gene of the ITS region are needed for the analysis.
AB - Reverse complementary DNA sequences--sequences that are inadvertently cast backward and in which all purines and pyrimidines are transposed--are not uncommon in sequence databases, where they may introduce noise into sequence-based research. We show that about 1% of the public fungal ITS sequences, the most commonly sequenced genetic marker in mycology, are reverse complementary, and we introduce an open source software solution to automate their detection and reorientation. The MacOSX/Linux/UNIX software operates on public or private datasets of any size, although some 50 base pairs of the 5.8S gene of the ITS region are needed for the analysis.
KW - DNA barcoding
KW - Environmental sampling
KW - Hidden Markov models
KW - Quality assessment
KW - Sequence identification
UR - http://www.scopus.com/inward/record.url?scp=79959633458&partnerID=8YFLogxK
U2 - 10.1007/s10267-010-0086-z
DO - 10.1007/s10267-010-0086-z
M3 - Article
AN - SCOPUS:79959633458
VL - 52
SP - 278
EP - 282
JO - Mycoscience
JF - Mycoscience
SN - 1340-3540
IS - 4
ER -