Transposable element annotation in non-model species ‐ the benefits of species‐specific repeat libraries using semi-automated EDTA and DeepTE de novo pipelines

Ellen A. Bell, Christopher L. Butler, Claudio Oliveira, Sarah Marburger, Levi Yant, Martin I. Taylor

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)
75 Downloads (Pure)

Abstract

Transposable elements (TEs) are significant genomic components which can be detected either through sequence homology against existing databases or de novo, with the latter potentially reducing the risk of underestimating TE abundance. Here, we describe the semi-automated generation of a de novo TE library using the newly developed EDTA pipeline and DeepTE classifier in a non-model teleost (Corydoras fulleri). Using both genomic and transcriptomic data, we assess this de novo pipeline’s performance across four TE based metrics: (i) abundance, (ii) composition, (iii) fragmentation and (iv) age distributions. We then compare the results to those found when using a curated teleost library (Danio rerio). We identify quantitative differences in these metrics and highlight how TE library choice can have major impacts on TE-based estimates in non-model species.
Original languageEnglish
Pages (from-to)823-833
Number of pages11
JournalMolecular Ecology Resources
Volume22
Issue number2
Early online date18 Aug 2021
DOIs
Publication statusPublished - Feb 2022

Cite this