Abstract:
BACKGROUND:The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as 'sequence finishing') remains tedious, labor-intensive, and expensive. As a result, the vast majority of vertebrate genome sequences generated to date remain at a draft stage. RESULTS:To date, our genome sequencing efforts have focused on comparative studies of targeted genomic regions, requiring sequence finishing of large blocks of orthologous sequence (average size 0.5-2 Mb) from various subsets of 75 vertebrates. This experience has provided a unique opportunity to compare the relative effort required to finish shotgun-generated genome sequence assemblies from different species, which we report here. Importantly, we found that the sequence assemblies generated for the same orthologous regions from various vertebrates show substantial variation with respect to misassemblies and, in particular, the frequency and characteristics of sequence gaps. As a consequence, the work required to finish different species' sequences varied greatly. Application of the same standardized methods for finishing provided a novel opportunity to "assay" characteristics of genome sequences among many vertebrate species. It is important to note that many of the problems we have encountered during sequence finishing reflect unique architectural features of a particular vertebrate's genome, which in some cases may have important functional and/or evolutionary implications. Finally, based on our analyses, we have been able to improve our procedures to overcome some of these problems and to increase the overall efficiency of the sequence-finishing process, although significant challenges still remain. CONCLUSION:Our findings have important implications for the eventual finishing of the draft whole-genome sequences that have now been generated for a large number of vertebrates.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Blakesley RW,Hansen NF,Gupta J,McDowell JC,Maskeri B,Barnabas BB,Brooks SY,Coleman H,Haghighi P,Ho SL,Schandler K,Stantripop S,Vogt JL,Thomas PJ,NISC Comparative Sequencing Program.,Bouffard GG,Green EDdoi
10.1186/1471-2164-11-21subject
Has Abstractpub_date
2010-01-11 00:00:00pages
21issn
1471-2164pii
1471-2164-11-21journal_volume
11pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Sub-optimal developmental diets often have adverse effects on long-term fitness and health. One hypothesis is that such effects are caused by mismatches between the developmental and adult environment, and may be mediated by persistent changes in gene expression. However, there are few experimental tests of ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3968-z
更新日期:2017-08-22 00:00:00
abstract:BACKGROUND:Although second generation sequencing (2GS) technologies allow re-sequencing of previously gold-standard-sequenced genomes, whole genome shotgun sequencing and de novo assembly of large and complex eukaryotic genomes is still difficult. Availability of a genome-wide physical map is therefore still a prerequi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-247
更新日期:2011-05-19 00:00:00
abstract:BACKGROUND:Root-knot nematodes are sedentary endoparasites that can infect more than 3000 plant species. Root-knot nematodes cause an estimated $100 billion annual loss worldwide. For successful establishment of the root-knot nematode in its host plant, it causes dramatic morphological and physiological changes in plan...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-220
更新日期:2011-05-10 00:00:00
abstract:BACKGROUND:Tumor genomes are often highly heterogeneous, consisting of genomes from multiple subclonal types. Complete characterization of all subclonal types is a fundamental need in tumor genome analysis. With the advancement of next-generation sequencing, computational methods have recently been developed to infer t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S2-S1
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:Adenosine to inosine (A-to-I) RNA-editing is an essential post-transcriptional mechanism that occurs in numerous sites in the human transcriptome, mainly within Alu repeats. It has been shown to have consistent levels of editing across individuals in a few targets in the human brain and altered in several hu...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-608
更新日期:2010-10-28 00:00:00
abstract:BACKGROUND:Mitochondrial genomic sequences are known to be variable. Comparative analyses of mitochondrial genomes can reveal the nature and extent of their variation. RESULTS:Draft mitochondrial genomes of 16 Tremella fuciformis isolates (TF01-TF16) were assembled from Illumina and PacBio sequencing data. Mitochondri...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-06846-x
更新日期:2020-06-24 00:00:00
abstract:BACKGROUND:Alternative splicing (AS) is an important regulatory mechanism that greatly contributes to eukaryotic transcriptome diversity. A substantial amount of evidence has demonstrated that AS complexity is relevant to eukaryotic evolution, development, adaptation, and complexity. In this study, six teosinte and ten...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1582-5
更新日期:2015-05-08 00:00:00
abstract:BACKGROUND:The ability to rapidly map millions of oligonucleotide fragments to a reference genome is crucial to many high throughput genomic technologies. RESULTS:We propose an intuitive and efficient algorithm, titled extreme MApping of OligoNucleotide (xMAN), to rapidly map millions of oligonucleotide fragments to a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-S1-S20
更新日期:2008-01-01 00:00:00
abstract:BACKGROUND:Transposable elements are selfish genetic sequences which only occasionally provide useful functions to their host species. In addition, models of mobile element evolution assume a second type of selfishness: elements of different families do not cooperate, but they independently fight for their survival in ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-219
更新日期:2008-05-14 00:00:00
abstract:BACKGROUND:Calcineurin B-like protein (CBL)-interacting protein kinases (CIPKs) are the primary components of calcium sensors, and play crucial roles in plant developmental processes, hormone signaling transduction, and in the response to exogenous stresses. RESULTS:In this study, 48 CIPK genes (SsCIPKs) were identifi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07264-9
更新日期:2020-12-07 00:00:00
abstract:BACKGROUND:Atlantic salmon have been subject to domestication for approximately ten generations, beginning in the early 1970s. This process of artificial selection will have created various genetic differences between wild and farmed stocks. Each year, hundreds of thousands of farmed fish escape into the wild. These es...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-884
更新日期:2014-10-09 00:00:00
abstract:BACKGROUND:Cotton (Gossypium spp.) is commonly grouped into eight diploid genomic groups and an allotetraploid genomic group, AD. The mitochondrial genomes supply new information to understand both the evolution process and the mechanism of cytoplasmic male sterility. Based on previously released mitochondrial genomes ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4282-5
更新日期:2017-11-13 00:00:00
abstract:BACKGROUND:Reads assignment to taxonomic units is a key step in microbiome analysis pipelines. To date, accurate taxonomy annotation of 16S reads, particularly at species rank, is still challenging due to the short size of read sequences and differently curated classification databases. The close phylogenetic relations...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5914-8
更新日期:2019-07-08 00:00:00
abstract:BACKGROUND:Ammopiptanthus mongolicus (Maxim. ex Kom.) Cheng f., an evergreen broadleaf legume shrub, is distributed in Mid-Asia where the temperature can be as low as -30°C during the winter. Although A. mongolicus is an ideal model to study the plant response to cold stress, insufficient genomic resources for this spe...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-488
更新日期:2013-07-18 00:00:00
abstract:BACKGROUND:In plants carotenoids play an important role in the photosynthetic process and photo-oxidative protection, and are the substrate for the synthesis of abscisic acid and strigolactones. In addition to their protective role as antioxidants and precursors of vitamin A, in wheat carotenoids are important as they ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3395-6
更新日期:2017-01-31 00:00:00
abstract:BACKGROUND:Matrix attachment regions (MAR) are the sites on genomic DNA that interact with the nuclear matrix. There is increasing evidence for the involvement of MAR in regulation of gene expression. The unsuitability of experimental detection of MAR for genome-wide analyses has led to the development of computational...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-418
更新日期:2007-11-15 00:00:00
abstract:BACKGROUND:Mango fruits contain a broad spectrum of phenolic compounds which impart potential health benefits; their biosynthesis is catalysed by enzymes in the phenylpropanoid-flavonoid (PF) pathway. The aim of this study was to reveal the variability in genes involved in the PF pathway in three different mango variet...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1784-x
更新日期:2015-07-30 00:00:00
abstract:BACKGROUND:The origin and importance of exon-intron architecture comprises one of the remaining mysteries of gene evolution. Several studies have investigated the variations of intron length, GC content, ordinal position in a gene and divergence. However, there is little study about the structural variation of exons an...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-47
更新日期:2009-01-24 00:00:00
abstract:BACKGROUND:Chronic Allograft Nephropathy (CAN) is a clinical entity of progressive kidney transplant injury. The defining histology is tubular atrophy with interstitial fibrosis (IFTA). Using a meta-analysis of microarrays from 84 kidney transplant biopsies, we revealed growth factor and integrin adhesion molecule path...
journal_title:BMC genomics
pub_type: 杂志文章,meta分析
doi:10.1186/1471-2164-14-275
更新日期:2013-04-23 00:00:00
abstract:BACKGROUND:Gene ontology analysis using the microarray database generated in a previous study by this laboratory was used to further evaluate how maternal dietary supplementation with pyridoxine combined with different sources of selenium (Se) affected global gene expression of expanded porcine blastocysts. Data were g...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5237-1
更新日期:2018-11-21 00:00:00
abstract:BACKGROUND:Bacteria belonging to the Rhodococcus genus play an important role in the degradation of many contaminants, including methylbenzenes. These bacteria, widely distributed in the environment, are known to be a powerhouse of numerous degradation functions, due to their ability to metabolize a wide range of organ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4965-6
更新日期:2018-08-06 00:00:00
abstract:BACKGROUND:Hepatitis C virus (HCV) is a rapidly evolving RNA virus that has been classified into seven genotypes. All HCV genotypes cause chronic hepatitis, which ultimately leads to liver diseases such as cirrhosis. The genotypes are unevenly distributed across the globe, with genotypes 1 and 3 being the most prevalen...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2575-8
更新日期:2016-03-17 00:00:00
abstract:BACKGROUND:Miniature inverted repeat transposable element (MITE) is one type of transposable element (TE), which is largely found in eukaryotic genomes and involved in a wide variety of biological events. However, only few MITEs were proved to be currently active and their physiological function remains largely unknown...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-135
更新日期:2012-04-13 00:00:00
abstract:BACKGROUND:Senegalese sole (Solea senegalensis) and common sole (S. solea) are two economically and evolutionary important flatfish species both in fisheries and aquaculture. Although some genomic resources and tools were recently described in these species, further sequencing efforts are required to establish a comple...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-952
更新日期:2014-11-03 00:00:00
abstract:BACKGROUND:The obligate intracellular parasite Toxoplasma gondii establishes a life-long chronic infection within any warm-blooded host. After ingestion of an encysted parasite, T. gondii disseminates throughout the body as a rapidly replicating form during acute infection. Over time and after stimulation of the host i...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-806
更新日期:2014-09-20 00:00:00
abstract:BACKGROUND:Following the association of Cronobacter spp. to several publicized fatal outbreaks in neonatal intensive care units of meningitis and necrotising enterocolitis, the World Health Organization (WHO) in 2004 requested the establishment of a molecular typing scheme to enable the international control of the org...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1121
更新日期:2014-12-16 00:00:00
abstract:BACKGROUND:Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e.,...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S13-S5
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:Endogenous α-synuclein (α-Syn) is involved in many pathophysiological processes in the secondary injury stage after acute spinal cord injury (SCI), and the mechanism governing these functions has not been thoroughly elucidated to date. This research aims to characterize the effect of α-Syn knockdown on trans...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6244-6
更新日期:2019-11-14 00:00:00
abstract:BACKGROUND:The MYB superfamily is one of the most abundant transcription factor (TF) families in plants. MYB proteins include highly conserved N-terminal MYB repeats (1R, R2R3, 3R, and atypical) and various C-terminal sequences that confer extensive functions. However, the functions of most MYB genes are unknown, and h...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1216-y
更新日期:2015-01-23 00:00:00
abstract:BACKGROUND:The phylum Platyhelminthes (flatworms) contains an important group of bilaterian organisms responsible for many debilitating and chronic infectious diseases of human and animal populations inhabiting the planet today. In addition to their biomedical and veterinary relevance, some platyhelminths are also freq...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-462
更新日期:2013-07-09 00:00:00