Abstract:
BACKGROUND:Long terminal repeat retrotransposons are the most abundant transposons in plants. They play important roles in alternative splicing, recombination, gene regulation, and defense mechanisms. Large-scale sequencing projects for plant genomes are currently underway. Software tools are important for annotating long terminal repeat retrotransposons in these newly available genomes. However, the available tools are not very sensitive to known elements and perform inconsistently on different genomes. Some are hard to install or obsolete. They may struggle to process large plant genomes. None can be executed in parallel out of the box and very few have features to support visual review of new elements. To overcome these limitations, we developed LtrDetector, which uses techniques inspired by signal-processing. RESULTS:We compared LtrDetector to LTR_Finder and LTRharvest, the two most successful predecessor tools, on six plant genomes. For each organism, we constructed a ground truth data set based on queries from a consensus sequence database. According to this evaluation, LtrDetector was the most sensitive tool, achieving 16-23% improvement in sensitivity over LTRharvest and 21% improvement over LTR_Finder. All three tools had low false positive rates, with LtrDetector achieving 98.2% precision, in between its two competitors. Overall, LtrDetector provides the best compromise between high sensitivity and low false positive rate while requiring moderate time and utilizing memory available on personal computers. CONCLUSIONS:LtrDetector uses a novel methodology revolving around k-mer distributions, which allows it to produce high-quality results using relatively lightweight procedures. It is easy to install and use. It is not species specific, performing well using its default parameters on genomes of varying size and repeat content. It is automatically configured for parallel execution and runs efficiently on an ordinary personal computer. It includes a k-mer scores visualization tool to facilitate manual review of the identified elements. These features make LtrDetector an attractive tool for future annotation projects involving long terminal repeat retrotransposons.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Valencia JD,Girgis HZdoi
10.1186/s12864-019-5796-9subject
Has Abstractpub_date
2019-06-03 00:00:00pages
450issue
1issn
1471-2164pii
10.1186/s12864-019-5796-9journal_volume
20pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Obtaining reliable and reproducible two-color microarray gene expression data is critically important for understanding the biological significance of perturbations made on a cellular system. Microarray design, RNA preparation and labeling, hybridization conditions and data acquisition and analysis are varia...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-5-20
更新日期:2004-03-09 00:00:00
abstract:BACKGROUND:Two fifths of the world's population is at risk from dengue. The absence of effective drugs and vaccines leaves vector control as the primary intervention tool. Understanding dengue virus (DENV) host interactions is essential for the development of novel control strategies. The availability of genome sequenc...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-380
更新日期:2010-06-16 00:00:00
abstract:BACKGROUND:The effective use of mutant populations for reverse genetic screens relies on the population-wide characterization of the induced mutations. Genome- and population-wide characterization of the mutations found in fast neutron populations has been hindered, however, by the wide range of mutations generated and...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5998-1
更新日期:2019-08-06 00:00:00
abstract:BACKGROUND:Effective bioinformatics solutions are needed to tackle challenges posed by industrial-scale genome annotation. We present Bcheck, a wrapper tool which predicts RNase P RNA genes by combining the speed of pattern matching and sensitivity of covariance models. The core of Bcheck is a library of subfamily spec...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-432
更新日期:2010-07-13 00:00:00
abstract:BACKGROUND:Recent molecular studies have revealed considerably more diversity in the phylum Chlamydiae than was previously thought. Evidence is growing that many of these novel chlamydiae may be important pathogens in humans and animals. A significant barrier to characterising these novel chlamydiae is the requirement ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3055-x
更新日期:2016-09-05 00:00:00
abstract:BACKGROUND:Advances in human genomics have allowed unprecedented productivity in terms of algorithms, software, and literature available for translating raw next-generation sequence data into high-quality information. The challenges of variant identification in organisms with lower quality reference genomes are less we...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-948
更新日期:2014-11-01 00:00:00
abstract:BACKGROUND:Nucleosomes facilitate the packaging of the eukaryotic genome and modulate the access of regulators to DNA. A detailed description of the nucleosomal organization under different transcriptional programmes is essential to understand their contribution to genomic regulation. RESULTS:To visualize the dynamics...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-813
更新日期:2013-11-21 00:00:00
abstract:BACKGROUND:Fungal pathogens of plants produce diverse repertoires of secondary metabolites, which have functions ranging from iron acquisition, defense against immune perturbation, to toxic assaults on the host. The wheat pathogen Zymoseptoria tritici causes Septoria tritici blotch, a foliar disease which is a signific...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3969-y
更新日期:2017-08-17 00:00:00
abstract:BACKGROUND:Advances in genome technology have simplified a new comprehension of the genetic and historical processes crucial to rapid phenotypic evolution under domestication. To get new insight into the genetic basis of the dog domestication process, we conducted whole-genome sequence analysis of three wolves and thre...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6619-8
更新日期:2020-03-04 00:00:00
abstract:BACKGROUND:Copy number variations (CNVs) are a major form of genetic variations and are involved in animal domestication and genetic adaptation to local environments. We investigated CNVs in the domestic goat (Capra hircus) using Illumina short-read sequencing data, by comparing our lab data for 38 goats from three Chi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07267-6
更新日期:2020-11-27 00:00:00
abstract::In 2009 the International Society for Computational Biology (ISCB) started to roll out regional bioinformatics conferences in Africa, Latin America and Asia. The open and competitive bid for the first meeting in Asia (ISCB-Asia) was awarded to Asia-Pacific Bioinformatics Network (APBioNet) which has been running the I...
journal_title:BMC genomics
pub_type:
doi:10.1186/1471-2164-12-S3-S1
更新日期:2011-11-30 00:00:00
abstract:BACKGROUND:Cryptocaryon irritans is an obligate parasitic ciliate protozoan that can infect various commercially important mariculture fish species and cause high lethality and economic loss. Current methods of controlling this parasite with chemicals or antibiotics are widely considered to be environmentally harmful. ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4565-5
更新日期:2018-03-12 00:00:00
abstract:BACKGROUND:Larval developmental patterns are extremely varied both between and within phyla, however the genetic mechanisms leading to this diversification are poorly understood. We assembled and compared the developmental transcriptomes for two sea biscuit species (Echinodermata: Echinoidea) with differing patterns of...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4768-9
更新日期:2018-05-18 00:00:00
abstract:BACKGROUND:Chemical bioavailability is an important dose metric in environmental risk assessment. Although many approaches have been used to evaluate bioavailability, not a single approach is free from limitations. Previously, we developed a new genomics-based approach that integrated microarray technology and regressi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2541-5
更新日期:2016-03-08 00:00:00
abstract:BACKGROUND:Sheath blight (SB), caused by Rhizoctonia solani, is a common rice disease worldwide. Currently, rice cultivars with robust resistance to R. solani are still lacking. To provide theoretic basis for molecular breeding of R. solani-resistant rice cultivars, the changes of transcriptome profiles in response to ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6645-6
更新日期:2020-03-19 00:00:00
abstract:BACKGROUND:Genetic influences underpinning complex traits are thought to involve multiple quantitative trait loci (QTLs) of small effect size. Detection of such QTL associations requires systematic screening of large numbers of DNA markers within large sample populations. Using pooled DNA on SNP microarrays to screen f...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-214
更新日期:2007-07-04 00:00:00
abstract:BACKGROUND:Despite substantial progress in mosquito genomic and genetic research, few cis-regulatory elements (CREs), DNA sequences that control gene expression, have been identified in mosquitoes or other non-model insects. Formaldehyde-assisted isolation of regulatory elements paired with DNA sequencing, FAIRE-seq, i...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2468-x
更新日期:2016-05-10 00:00:00
abstract:BACKGROUND:Parasitic wasps constitute one of the largest group of venomous animals. Although some physiological effects of their venoms are well documented, relatively little is known at the molecular level on the protein composition of these secretions. To identify the majority of the venom proteins of the endoparasit...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-693
更新日期:2010-12-07 00:00:00
abstract:BACKGROUND:The recent advance of high-throughput sequencing makes it feasible to study entire transcriptomes through the application of de novo sequence assembly algorithms. While a popular strategy is to first construct an intermediate de Bruijn graph structure to represent the transcriptome, an additional step is nee...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-S5-S6
更新日期:2014-01-01 00:00:00
abstract:BACKGROUND:Duplication, followed by fixation or random loss of novel genes, contributes to genome evolution. Particular outcomes of duplication events are possibly associated with pathogenic life histories in fungi. To date, differential gene gain and loss have not been studied at genomic scales in fungal pathogens, de...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-147
更新日期:2008-03-28 00:00:00
abstract:BACKGROUND:Oil palm is the second largest source of edible oil which contributes to approximately 20% of the world's production of oils and fats. In order to understand the molecular biology involved in in vitro propagation, flowering, efficient utilization of nitrogen sources and root diseases, we have initiated an ex...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-381
更新日期:2007-10-22 00:00:00
abstract:BACKGROUND:Tandemly arranged nuclear ribosomal DNA (rDNA), encoding 18S, 5.8S and 26S ribosomal RNA (rRNA), exhibit concerted evolution, a pattern thought to result from the homogenisation of rDNA arrays. However rDNA homogeneity at the single nucleotide polymorphism (SNP) level has not been detailed in organisms with ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-722
更新日期:2012-12-23 00:00:00
abstract:BACKGROUND:Dictyostelium discoideum is frequently subjected to environmental changes in its natural habitat, the forest soil. In order to survive, the organism had to develop effective mechanisms to sense and respond to such changes. When cells are faced with a hypertonic environment a complex response is triggered. It...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-123
更新日期:2007-05-21 00:00:00
abstract:BACKGROUND:Developing effective strategies to reveal modular structures in protein interaction networks is crucial for better understanding of molecular mechanisms of underlying biological processes. In this paper, we propose a new density-based algorithm (ADHOC) for clustering vertices of a protein interaction network...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-S4-S17
更新日期:2010-12-02 00:00:00
abstract:BACKGROUND:The astounding regenerative abilities of planarian flatworms prompt steadily growing interest in examining their molecular foundation. Planarian regeneration was found to require hundreds of genes and is hence a complex process. Thus, RNA interference followed by transcriptome-wide gene expression analysis b...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6292-y
更新日期:2019-11-29 00:00:00
abstract:BACKGROUND:The genus Populus includes poplars, aspens and cottonwoods, which will be collectively referred to as poplars hereafter unless otherwise specified. Poplars are the dominant tree species in many forest ecosystems in the Northern Hemisphere and are of substantial economic value in plantation forestry. Poplar h...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-57
更新日期:2008-01-29 00:00:00
abstract:BACKGROUND:Previous studies suggest genome structure is largely conserved between Eucalyptus species. However, it is unknown if this conservation extends to more divergent eucalypt taxa. We performed comparative genomics between the eucalypt genera Eucalyptus and Corymbia. Our results will facilitate transfer of genomi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3782-7
更新日期:2017-05-22 00:00:00
abstract:BACKGROUND:In recent years, the number of human infections caused by opportunistic pathogens has increased dramatically. Plant rhizospheres are one of the most typical natural reservoirs for these pathogens but they also represent a great source for beneficial microbes with potential for biotechnological applications. ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-482
更新日期:2014-06-18 00:00:00
abstract:BACKGROUND:Venomous animals incapacitate their prey using complex venoms that can contain hundreds of unique protein toxins. The realisation that many of these toxins may have pharmaceutical and insecticidal potential due to their remarkable potency and selectivity against target receptors has led to an explosion in th...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-375
更新日期:2009-08-13 00:00:00
abstract:BACKGROUND:Color polymorphism in the nacre of pteriomorphian bivalves is of great interest for the pearl culture industry. The nacreous layer of the Polynesian black-lipped pearl oyster Pinctada margaritifera exhibits a large array of color variation among individuals including reflections of blue, green, yellow and pi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1776-x
更新日期:2015-08-01 00:00:00