Abstract:
BACKGROUND:Repeat masking is an important step in the EST analysis pipeline. For new species, genomic knowledge is scarce and good repeat libraries are typically unavailable. In these cases it is common practice to mask against known repeats from other species (i.e., model organisms). There are few studies that investigate the effectiveness of this approach, or attempt to evaluate the different methods for identifying and masking repeats. RESULTS:Using zebrafish and medaka as example organisms, we show that accurate repeat masking is an important factor for obtaining a high quality clustering. Furthermore, we show that masking with standard repeat libraries based on curated genomic information from other species has little or no positive effect on the quality of the resulting EST clustering. Library based repeat masking which often constitutes a computational bottleneck in the EST analysis pipeline can therefore be reduced to species specific repeat libraries, or perhaps eliminated entirely. In contrast, substantially improved results can be achived by applying a repeat library derived from a partial reference clustering (e.g., from mapping sequences against a partially sequenced genome). CONCLUSION:Of the methods explored, we find that the best EST clustering is achieved after masking with repeat libraries that are species specific. In the absence of such libraries, library-less masking gives results superior to the current practice of using cross-species, genome-based libraries.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Malde K,Jonassen Idoi
10.1186/1471-2164-9-23subject
Has Abstractpub_date
2008-01-18 00:00:00pages
23issn
1471-2164pii
1471-2164-9-23journal_volume
9pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Salmonella Typhimurium is frequently isolated from foodborne infection cases in Hong Kong, but the lack of genome sequences has hindered in-depth epidemiological and phylogenetic studies. In this study, we sought to reconstruct the phylogenetic relationship and investigate the distribution and mutation patte...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1900-y
更新日期:2015-09-14 00:00:00
abstract:BACKGROUND:Aquaporins (AQPs), as members of the major intrinsic protein (MIP) superfamily, facilitated the permeation of water and other solutes and are involved in multiple biological processes. AQP family exists in almost all living organisms and is highly diversified in vertebrates in both classification and functio...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-06942-y
更新日期:2020-07-29 00:00:00
abstract:BACKGROUND:Cytochrome P450s (CYPs) in animals fall into two categories: those that synthesize or metabolize endogenous molecules and those that interact with exogenous chemicals from the diet or the environment. The latter form a critical component of detoxification systems. RESULTS:Data mining and manual curation of ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-169
更新日期:2009-04-21 00:00:00
abstract:BACKGROUND:Dinoflagellates are unicellular marine and freshwater eukaryotes. They possess large nuclear genomes (1.5-245 gigabases) and produce structurally unique and biologically active polyketide secondary metabolites. Although polyketide biosynthesis is well studied in terrestrial and freshwater organisms, only rec...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2195-8
更新日期:2015-11-14 00:00:00
abstract:BACKGROUND:The molecular mechanisms that determine the organism's response to a variety of doses and modalities of stress factors are not well understood. RESULTS:We studied effects of ionizing radiation (144, 360 and 864 Gy), entomopathogenic fungus (10 and 100 CFU), starvation (16 h), and cold shock (+4, 0 and -4°C)...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S13-S8
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:Evolution of novel protein-coding genes is the bedrock of adaptive evolution. Recently, we identified six protein-coding genes with similar signal sequence from Schistosoma japonicum egg stage mRNA using signal sequence trap (SST). To find the mechanism underlying the origination of these genes with similar ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-260
更新日期:2012-06-20 00:00:00
abstract:BACKGROUND:Bloodstream malaria parasites require Ca++ for their development, but the sites and mechanisms of Ca++ utilization are not well understood. We hypothesized that there may be differences in Ca++ uptake or utilization by genetically distinct lines of P. falciparum. These differences, if identified, may provide...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5418-y
更新日期:2019-01-16 00:00:00
abstract:BACKGROUND:The identification of cell type-specific genes (markers) is an essential step for the deconvolution of the cellular fractions, primarily, from the gene expression data of a bulk sample. However, the genes with significant changes identified by pair-wise comparisons cannot indeed represent the specificity of ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-06888-1
更新日期:2020-09-23 00:00:00
abstract::After publication of [1], the authors were informed by John A. Rhodes of a counterexample to Theorem 11 of [1]. ...
journal_title:BMC genomics
pub_type: 杂志文章,已发布勘误
doi:10.1186/s12864-020-6540-1
更新日期:2020-02-10 00:00:00
abstract:BACKGROUND:Streptococcus agalactiae (group B Streptococcus; GBS) is a significant bacterial pathogen of neonates and an emerging pathogen of adults. Though transcriptional regulators are abundantly encoded on the GBS genome, their role in GBS pathogenesis is poorly understood. The mtaR gene encodes a putative LysR-type...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-607
更新日期:2008-12-16 00:00:00
abstract:BACKGROUND:Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regio...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-683
更新日期:2012-12-06 00:00:00
abstract:BACKGROUND:To date, oil-rich plants are the main source of biodiesel products. Because concerns have been voiced about the impact of oil-crop cultivation on the price of food commodities, the interest in oil plants not used for food production and amenable to cultivation on non-agricultural land has soared. As a non-fo...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-462
更新日期:2010-08-06 00:00:00
abstract:BACKGROUND:Sorghum [Sorghum bicolor (L.) Moench] is an important cereal crop for dryland areas in the United States and for small-holder farmers in Africa. Natural variation of sorghum grain composition (protein, fat, and starch) between accessions can be used for crop improvement, but the genetic controls are still un...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3403-x
更新日期:2017-01-05 00:00:00
abstract:BACKGROUND:Advances in whole genome profiling have revolutionized the cancer research field, but at the same time have raised new bioinformatics challenges. For next generation sequencing (NGS), these include data storage, computational costs, sequence processing and alignment, delineating appropriate statistical measu...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-S6-S14
更新日期:2012-01-01 00:00:00
abstract:BACKGROUND:The sRNAs of bacterial pathogens are known to be involved in various cellular roles including environmental adaptation as well as regulation of virulence and pathogenicity. It is expected that sRNAs may also have similar functions for Burkholderia pseudomallei, a soil bacterium that can adapt to diverse envi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-S7-S13
更新日期:2012-01-01 00:00:00
abstract:BACKGROUND:The identification of genetic markers associated with complex traits that are expensive to record such as feed intake or feed efficiency would allow these traits to be included in selection programs. To identify large-effect QTL, we performed a series of genome-wide association studies and functional analyse...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1004
更新日期:2014-11-20 00:00:00
abstract:BACKGROUND:Spiders (Order Araneae) are essential predators in every terrestrial ecosystem largely because they have evolved potent arsenals of silk and venom. Spider silks are high performance materials made almost entirely of proteins, and thus represent an ideal system for investigating genome level evolution of nove...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-365
更新日期:2014-05-23 00:00:00
abstract:BACKGROUND:Gonadotropin releasing hormone (GnRH) is responsible for stimulation of gonadotropic hormone (GtH) in the hypothalamus-pituitary-gonadal axis (HPG). The regulatory mechanisms responsible for brain specificity make the promoter attractive for in silico analysis and reporter gene studies in zebrafish (Danio re...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-3-25
更新日期:2002-08-21 00:00:00
abstract:BACKGROUND:The numerous classes of repeats often impede the assembly of genome sequences from the short reads provided by new sequencing technologies. We demonstrate a simple and rapid means to ascertain the repeat structure and total size of a bacterial or archaeal genome without the need for assembly by directly anal...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-537
更新日期:2013-08-08 00:00:00
abstract:BACKGROUND:The understanding of the biological function, regulation, and cellular interactions of the yeast genome and proteome, along with the high conservation in gene function found between yeast genes and their human homologues, has allowed for Saccharomyces cerevisiae to be used as a model organism to deduce biolo...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-524
更新日期:2009-11-15 00:00:00
abstract:BACKGROUND:The steady-state behaviour of gene regulatory networks (GRNs) can provide crucial evidence for detecting disease-causing genes. However, monitoring the dynamics of GRNs is particularly difficult because biological data only reflects a snapshot of the dynamical behaviour of the living organism. Also most GRN ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-S3-S26
更新日期:2009-12-03 00:00:00
abstract:BACKGROUND:Cadmium (Cd) is a severe detrimental environmental pollutant. To adapt to Cd-induced deleterious effects, plants have evolved sophisticated defence mechanisms. In this study, a genome-wide transcriptome analysis was performed to identify the mechanisms of Cd tolerance using two barley genotypes with distinct...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-611
更新日期:2014-07-19 00:00:00
abstract:BACKGROUND:RNA-seq and microarray are the two popular methods employed for genome-wide transcriptome profiling. Current comparison studies have shown that transcriptome quantified by these two methods correlated well. However, none of them have addressed if they complement each other, considering the strengths and the ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-629
更新日期:2012-11-15 00:00:00
abstract:BACKGROUND:Genetic populations provide the basis for a wide range of genetic and genomic studies and have been widely used in genetic mapping, gene discovery and genomics-assisted breeding. Chromosome segment substitution lines (CSSLs) are the most powerful tools for the detection and precise mapping of quantitative tr...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-656
更新日期:2010-11-24 00:00:00
abstract:BACKGROUND:Gene targeting by homology-directed repair (HDR) can precisely edit the genome and is a versatile tool for biomedical research. However, the efficiency of HDR-based modification is still low in many model organisms including zebrafish. Recently, long single-stranded DNA (lssDNA) molecules have been developed...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6493-4
更新日期:2020-01-21 00:00:00
abstract:BACKGROUND:Adult neurogenesis, which is the continual production of new neurons in the mature brain, demonstrates the strikingly plastic nature of the nervous system. Adult neural stem cells and their neural precursors, collectively referred to as neural progenitor cells (NPCs), are present in the subgranular zone (SGZ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-206
更新日期:2014-03-19 00:00:00
abstract:BACKGROUND:Staphylococcus aureus or MRSA (Methicillin Resistant S. aureus), is an acquired pathogen and the primary cause of nosocomial infections worldwide. In S. aureus, teichoic acid is an essential component of the cell wall, and its biosynthesis is not yet well characterized. Studies in Bacillus subtilis have disc...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-74
更新日期:2006-04-05 00:00:00
abstract:BACKGROUND:Cotton (Gossypium spp.) is commonly grouped into eight diploid genomic groups and an allotetraploid genomic group, AD. The mitochondrial genomes supply new information to understand both the evolution process and the mechanism of cytoplasmic male sterility. Based on previously released mitochondrial genomes ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4282-5
更新日期:2017-11-13 00:00:00
abstract:BACKGROUND:Genes, RNAs, and proteins play important roles during germline development. However, the functions of non-coding RNAs (ncRNAs) on germline development remain unclear in avian species. Recent high-throughput techniques have identified several classes of ncRNAs, including micro RNAs (miRNAs), small-interfering...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-757
更新日期:2014-09-04 00:00:00
abstract:BACKGROUND:Citrus blight is a very important progressive decline disease of commercial citrus. The etiology is unknown, although the disease can be transmitted by root grafts, suggesting a viral etiology. Diagnosis is made by demonstrating physical blockage of xylem cells that prevents the movement of water. This test ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6339-0
更新日期:2019-12-11 00:00:00