LtrDetector: A tool-suite for detecting long terminal repeat retrotransposons de-novo.

Abstract:

BACKGROUND:Long terminal repeat retrotransposons are the most abundant transposons in plants. They play important roles in alternative splicing, recombination, gene regulation, and defense mechanisms. Large-scale sequencing projects for plant genomes are currently underway. Software tools are important for annotating long terminal repeat retrotransposons in these newly available genomes. However, the available tools are not very sensitive to known elements and perform inconsistently on different genomes. Some are hard to install or obsolete. They may struggle to process large plant genomes. None can be executed in parallel out of the box and very few have features to support visual review of new elements. To overcome these limitations, we developed LtrDetector, which uses techniques inspired by signal-processing. RESULTS:We compared LtrDetector to LTR_Finder and LTRharvest, the two most successful predecessor tools, on six plant genomes. For each organism, we constructed a ground truth data set based on queries from a consensus sequence database. According to this evaluation, LtrDetector was the most sensitive tool, achieving 16-23% improvement in sensitivity over LTRharvest and 21% improvement over LTR_Finder. All three tools had low false positive rates, with LtrDetector achieving 98.2% precision, in between its two competitors. Overall, LtrDetector provides the best compromise between high sensitivity and low false positive rate while requiring moderate time and utilizing memory available on personal computers. CONCLUSIONS:LtrDetector uses a novel methodology revolving around k-mer distributions, which allows it to produce high-quality results using relatively lightweight procedures. It is easy to install and use. It is not species specific, performing well using its default parameters on genomes of varying size and repeat content. It is automatically configured for parallel execution and runs efficiently on an ordinary personal computer. It includes a k-mer scores visualization tool to facilitate manual review of the identified elements. These features make LtrDetector an attractive tool for future annotation projects involving long terminal repeat retrotransposons.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Valencia JD,Girgis HZ

doi

10.1186/s12864-019-5796-9

subject

Has Abstract

pub_date

2019-06-03 00:00:00

pages

450

issue

1

issn

1471-2164

pii

10.1186/s12864-019-5796-9

journal_volume

20

pub_type

杂志文章
  • Universal Reference RNA as a standard for microarray experiments.

    abstract:BACKGROUND:Obtaining reliable and reproducible two-color microarray gene expression data is critically important for understanding the biological significance of perturbations made on a cellular system. Microarray design, RNA preparation and labeling, hybridization conditions and data acquisition and analysis are varia...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-5-20

    authors: Novoradovskaya N,Whitfield ML,Basehore LS,Novoradovsky A,Pesich R,Usary J,Karaca M,Wong WK,Aprelikova O,Fero M,Perou CM,Botstein D,Braman J

    更新日期:2004-03-09 00:00:00

  • Response of the mosquito protein interaction network to dengue infection.

    abstract:BACKGROUND:Two fifths of the world's population is at risk from dengue. The absence of effective drugs and vaccines leaves vector control as the primary intervention tool. Understanding dengue virus (DENV) host interactions is essential for the development of novel control strategies. The availability of genome sequenc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-380

    authors: Guo X,Xu Y,Bian G,Pike AD,Xie Y,Xi Z

    更新日期:2010-06-16 00:00:00

  • Screening populations for copy number variation using genotyping-by-sequencing: a proof of concept using soybean fast neutron mutants.

    abstract:BACKGROUND:The effective use of mutant populations for reverse genetic screens relies on the population-wide characterization of the induced mutations. Genome- and population-wide characterization of the mutations found in fast neutron populations has been hindered, however, by the wide range of mutations generated and...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5998-1

    authors: Lemay MA,Torkamaneh D,Rigaill G,Boyle B,Stec AO,Stupar RM,Belzile F

    更新日期:2019-08-06 00:00:00

  • Bcheck: a wrapper tool for detecting RNase P RNA genes.

    abstract:BACKGROUND:Effective bioinformatics solutions are needed to tackle challenges posed by industrial-scale genome annotation. We present Bcheck, a wrapper tool which predicts RNase P RNA genes by combining the speed of pattern matching and sensitivity of covariance models. The core of Bcheck is a library of subfamily spec...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-432

    authors: Yusuf D,Marz M,Stadler PF,Hofacker IL

    更新日期:2010-07-13 00:00:00

  • Culture-independent genomic characterisation of Candidatus Chlamydia sanzinia, a novel uncultivated bacterium infecting snakes.

    abstract:BACKGROUND:Recent molecular studies have revealed considerably more diversity in the phylum Chlamydiae than was previously thought. Evidence is growing that many of these novel chlamydiae may be important pathogens in humans and animals. A significant barrier to characterising these novel chlamydiae is the requirement ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3055-x

    authors: Taylor-Brown A,Bachmann NL,Borel N,Polkinghorne A

    更新日期:2016-09-05 00:00:00

  • Evaluation of variant identification methods for whole genome sequencing data in dairy cattle.

    abstract:BACKGROUND:Advances in human genomics have allowed unprecedented productivity in terms of algorithms, software, and literature available for translating raw next-generation sequence data into high-quality information. The challenges of variant identification in organisms with lower quality reference genomes are less we...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-948

    authors: Baes CF,Dolezal MA,Koltes JE,Bapst B,Fritz-Waters E,Jansen S,Flury C,Signer-Hasler H,Stricker C,Fernando R,Fries R,Moll J,Garrick DJ,Reecy JM,Gredler B

    更新日期:2014-11-01 00:00:00

  • Clustered regulatory elements at nucleosome-depleted regions punctuate a constant nucleosomal landscape in Schizosaccharomyces pombe.

    abstract:BACKGROUND:Nucleosomes facilitate the packaging of the eukaryotic genome and modulate the access of regulators to DNA. A detailed description of the nucleosomal organization under different transcriptional programmes is essential to understand their contribution to genomic regulation. RESULTS:To visualize the dynamics...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-813

    authors: Soriano I,Quintales L,Antequera F

    更新日期:2013-11-21 00:00:00

  • In silico prediction and characterization of secondary metabolite biosynthetic gene clusters in the wheat pathogen Zymoseptoria tritici.

    abstract:BACKGROUND:Fungal pathogens of plants produce diverse repertoires of secondary metabolites, which have functions ranging from iron acquisition, defense against immune perturbation, to toxic assaults on the host. The wheat pathogen Zymoseptoria tritici causes Septoria tritici blotch, a foliar disease which is a signific...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3969-y

    authors: Cairns T,Meyer V

    更新日期:2017-08-17 00:00:00

  • Whole genome resequencing of the Iranian native dogs and wolves to unravel variome during dog domestication.

    abstract:BACKGROUND:Advances in genome technology have simplified a new comprehension of the genetic and historical processes crucial to rapid phenotypic evolution under domestication. To get new insight into the genetic basis of the dog domestication process, we conducted whole-genome sequence analysis of three wolves and thre...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6619-8

    authors: Amiri Ghanatsaman Z,Wang GD,Asadollahpour Nanaei H,Asadi Fozi M,Peng MS,Esmailizadeh A,Zhang YP

    更新日期:2020-03-04 00:00:00

  • Identification and population genetic analyses of copy number variations in six domestic goat breeds and Bezoar ibexes using next-generation sequencing.

    abstract:BACKGROUND:Copy number variations (CNVs) are a major form of genetic variations and are involved in animal domestication and genetic adaptation to local environments. We investigated CNVs in the domestic goat (Capra hircus) using Illumina short-read sequencing data, by comparing our lab data for 38 goats from three Chi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07267-6

    authors: Guo J,Zhong J,Liu GE,Yang L,Li L,Chen G,Song T,Zhang H

    更新日期:2020-11-27 00:00:00

  • InCoB celebrates its tenth anniversary as first joint conference with ISCB-Asia.

    abstract::In 2009 the International Society for Computational Biology (ISCB) started to roll out regional bioinformatics conferences in Africa, Latin America and Asia. The open and competitive bid for the first meeting in Asia (ISCB-Asia) was awarded to Asia-Pacific Bioinformatics Network (APBioNet) which has been running the I...

    journal_title:BMC genomics

    pub_type:

    doi:10.1186/1471-2164-12-S3-S1

    authors: Schönbach C,Tan TW,Kelso J,Rost B,Nathan S,Ranganathan S

    更新日期:2011-11-30 00:00:00

  • Molecular mechanisms of an antimicrobial peptide piscidin (Lc-pis) in a parasitic protozoan, Cryptocaryon irritans.

    abstract:BACKGROUND:Cryptocaryon irritans is an obligate parasitic ciliate protozoan that can infect various commercially important mariculture fish species and cause high lethality and economic loss. Current methods of controlling this parasite with chemicals or antibiotics are widely considered to be environmentally harmful. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4565-5

    authors: Chen R,Mao Y,Wang J,Liu M,Qiao Y,Zheng L,Su Y,Ke Q,Zheng W

    更新日期:2018-03-12 00:00:00

  • The developmental transcriptomes of two sea biscuit species with differing larval types.

    abstract:BACKGROUND:Larval developmental patterns are extremely varied both between and within phyla, however the genetic mechanisms leading to this diversification are poorly understood. We assembled and compared the developmental transcriptomes for two sea biscuit species (Echinodermata: Echinoidea) with differing patterns of...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4768-9

    authors: Armstrong AF,Grosberg RK

    更新日期:2018-05-18 00:00:00

  • Predicting chemical bioavailability using microarray gene expression data and regression modeling: A tale of three explosive compounds.

    abstract:BACKGROUND:Chemical bioavailability is an important dose metric in environmental risk assessment. Although many approaches have been used to evaluate bioavailability, not a single approach is free from limitations. Previously, we developed a new genomics-based approach that integrated microarray technology and regressi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2541-5

    authors: Gong P,Nan X,Barker ND,Boyd RE,Chen Y,Wilkins DE,Johnson DR,Suedel BC,Perkins EJ

    更新日期:2016-03-08 00:00:00

  • Comparison of leaf transcriptome in response to Rhizoctonia solani infection between resistant and susceptible rice cultivars.

    abstract:BACKGROUND:Sheath blight (SB), caused by Rhizoctonia solani, is a common rice disease worldwide. Currently, rice cultivars with robust resistance to R. solani are still lacking. To provide theoretic basis for molecular breeding of R. solani-resistant rice cultivars, the changes of transcriptome profiles in response to ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6645-6

    authors: Shi W,Zhao SL,Liu K,Sun YB,Ni ZB,Zhang GY,Tang HS,Zhu JW,Wan BJ,Sun HQ,Dai JY,Sun MF,Yan GH,Wang AM,Zhu GY

    更新日期:2020-03-19 00:00:00

  • Applicability of DNA pools on 500 K SNP microarrays for cost-effective initial screens in genomewide association studies.

    abstract:BACKGROUND:Genetic influences underpinning complex traits are thought to involve multiple quantitative trait loci (QTLs) of small effect size. Detection of such QTL associations requires systematic screening of large numbers of DNA markers within large sample populations. Using pooled DNA on SNP microarrays to screen f...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-214

    authors: Docherty SJ,Butcher LM,Schalkwyk LC,Plomin R

    更新日期:2007-07-04 00:00:00

  • High-throughput cis-regulatory element discovery in the vector mosquito Aedes aegypti.

    abstract:BACKGROUND:Despite substantial progress in mosquito genomic and genetic research, few cis-regulatory elements (CREs), DNA sequences that control gene expression, have been identified in mosquitoes or other non-model insects. Formaldehyde-assisted isolation of regulatory elements paired with DNA sequencing, FAIRE-seq, i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2468-x

    authors: Behura SK,Sarro J,Li P,Mysore K,Severson DW,Emrich SJ,Duman-Scheel M

    更新日期:2016-05-10 00:00:00

  • The venom composition of the parasitic wasp Chelonus inanitus resolved by combined expressed sequence tags analysis and proteomic approach.

    abstract:BACKGROUND:Parasitic wasps constitute one of the largest group of venomous animals. Although some physiological effects of their venoms are well documented, relatively little is known at the molecular level on the protein composition of these secretions. To identify the majority of the venom proteins of the endoparasit...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-693

    authors: Vincent B,Kaeslin M,Roth T,Heller M,Poulain J,Cousserans F,Schaller J,Poirié M,Lanzrein B,Drezen JM,Moreau SJ

    更新日期:2010-12-07 00:00:00

  • A memory-efficient algorithm to obtain splicing graphs and de novo expression estimates from de Bruijn graphs of RNA-Seq data.

    abstract:BACKGROUND:The recent advance of high-throughput sequencing makes it feasible to study entire transcriptomes through the application of de novo sequence assembly algorithms. While a popular strategy is to first construct an intermediate de Bruijn graph structure to represent the transcriptome, an additional step is nee...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-S5-S6

    authors: Sze SH,Tarone AM

    更新日期:2014-01-01 00:00:00

  • Altered patterns of gene duplication and differential gene gain and loss in fungal pathogens.

    abstract:BACKGROUND:Duplication, followed by fixation or random loss of novel genes, contributes to genome evolution. Particular outcomes of duplication events are possibly associated with pathogenic life histories in fungi. To date, differential gene gain and loss have not been studied at genomic scales in fungal pathogens, de...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-147

    authors: Powell AJ,Conant GC,Brown DE,Carbone I,Dean RA

    更新日期:2008-03-28 00:00:00

  • Analysis and functional annotation of expressed sequence tags (ESTs) from multiple tissues of oil palm (Elaeis guineensis Jacq.).

    abstract:BACKGROUND:Oil palm is the second largest source of edible oil which contributes to approximately 20% of the world's production of oils and fats. In order to understand the molecular biology involved in in vitro propagation, flowering, efficient utilization of nitrogen sources and root diseases, we have initiated an ex...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-381

    authors: Ho CL,Kwan YY,Choi MC,Tee SS,Ng WH,Lim KA,Lee YP,Ooi SE,Lee WW,Tee JM,Tan SH,Kulaveerasingam H,Alwee SS,Abdullah MO

    更新日期:2007-10-22 00:00:00

  • Next generation sequencing analysis reveals a relationship between rDNA unit diversity and locus number in Nicotiana diploids.

    abstract:BACKGROUND:Tandemly arranged nuclear ribosomal DNA (rDNA), encoding 18S, 5.8S and 26S ribosomal RNA (rRNA), exhibit concerted evolution, a pattern thought to result from the homogenisation of rDNA arrays. However rDNA homogeneity at the single nucleotide polymorphism (SNP) level has not been detailed in organisms with ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-722

    authors: Matyášek R,Renny-Byfield S,Fulneček J,Macas J,Grandbastien MA,Nichols R,Leitch A,Kovařík A

    更新日期:2012-12-23 00:00:00

  • STATc is a key regulator of the transcriptional response to hyperosmotic shock.

    abstract:BACKGROUND:Dictyostelium discoideum is frequently subjected to environmental changes in its natural habitat, the forest soil. In order to survive, the organism had to develop effective mechanisms to sense and respond to such changes. When cells are faced with a hypertonic environment a complex response is triggered. It...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-123

    authors: Na J,Tunggal B,Eichinger L

    更新日期:2007-05-21 00:00:00

  • Exploring hierarchical and overlapping modular structure in the yeast protein interaction network.

    abstract:BACKGROUND:Developing effective strategies to reveal modular structures in protein interaction networks is crucial for better understanding of molecular mechanisms of underlying biological processes. In this paper, we propose a new density-based algorithm (ADHOC) for clustering vertices of a protein interaction network...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S4-S17

    authors: Liu C,Li J,Zhao Y

    更新日期:2010-12-02 00:00:00

  • Efficient depletion of ribosomal RNA for RNA sequencing in planarians.

    abstract:BACKGROUND:The astounding regenerative abilities of planarian flatworms prompt steadily growing interest in examining their molecular foundation. Planarian regeneration was found to require hundreds of genes and is hence a complex process. Thus, RNA interference followed by transcriptome-wide gene expression analysis b...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6292-y

    authors: Kim IV,Ross EJ,Dietrich S,Döring K,Sánchez Alvarado A,Kuhn CD

    更新日期:2019-11-29 00:00:00

  • Analysis of 4,664 high-quality sequence-finished poplar full-length cDNA clones and their utility for the discovery of genes responding to insect feeding.

    abstract:BACKGROUND:The genus Populus includes poplars, aspens and cottonwoods, which will be collectively referred to as poplars hereafter unless otherwise specified. Poplars are the dominant tree species in many forest ecosystems in the Northern Hemisphere and are of substantial economic value in plantation forestry. Poplar h...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-57

    authors: Ralph SG,Chun HJ,Cooper D,Kirkpatrick R,Kolosova N,Gunter L,Tuskan GA,Douglas CJ,Holt RA,Jones SJ,Marra MA,Bohlmann J

    更新日期:2008-01-29 00:00:00

  • Comparative genomics of Eucalyptus and Corymbia reveals low rates of genome structural rearrangement.

    abstract:BACKGROUND:Previous studies suggest genome structure is largely conserved between Eucalyptus species. However, it is unknown if this conservation extends to more divergent eucalypt taxa. We performed comparative genomics between the eucalypt genera Eucalyptus and Corymbia. Our results will facilitate transfer of genomi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3782-7

    authors: Butler JB,Vaillancourt RE,Potts BM,Lee DJ,King GJ,Baten A,Shepherd M,Freeman JS

    更新日期:2017-05-22 00:00:00

  • Stenotrophomonas comparative genomics reveals genes and functions that differentiate beneficial and pathogenic bacteria.

    abstract:BACKGROUND:In recent years, the number of human infections caused by opportunistic pathogens has increased dramatically. Plant rhizospheres are one of the most typical natural reservoirs for these pathogens but they also represent a great source for beneficial microbes with potential for biotechnological applications. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-482

    authors: Alavi P,Starcher MR,Thallinger GG,Zachow C,Müller H,Berg G

    更新日期:2014-06-18 00:00:00

  • ArachnoServer: a database of protein toxins from spiders.

    abstract:BACKGROUND:Venomous animals incapacitate their prey using complex venoms that can contain hundreds of unique protein toxins. The realisation that many of these toxins may have pharmaceutical and insecticidal potential due to their remarkable potency and selectivity against target receptors has led to an explosion in th...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-375

    authors: Wood DL,Miljenović T,Cai S,Raven RJ,Kaas Q,Escoubas P,Herzig V,Wilson D,King GF

    更新日期:2009-08-13 00:00:00

  • Identification of genes associated with shell color in the black-lipped pearl oyster, Pinctada margaritifera.

    abstract:BACKGROUND:Color polymorphism in the nacre of pteriomorphian bivalves is of great interest for the pearl culture industry. The nacreous layer of the Polynesian black-lipped pearl oyster Pinctada margaritifera exhibits a large array of color variation among individuals including reflections of blue, green, yellow and pi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1776-x

    authors: Lemer S,Saulnier D,Gueguen Y,Planes S

    更新日期:2015-08-01 00:00:00