Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates.

Abstract:

BACKGROUND:The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as 'sequence finishing') remains tedious, labor-intensive, and expensive. As a result, the vast majority of vertebrate genome sequences generated to date remain at a draft stage. RESULTS:To date, our genome sequencing efforts have focused on comparative studies of targeted genomic regions, requiring sequence finishing of large blocks of orthologous sequence (average size 0.5-2 Mb) from various subsets of 75 vertebrates. This experience has provided a unique opportunity to compare the relative effort required to finish shotgun-generated genome sequence assemblies from different species, which we report here. Importantly, we found that the sequence assemblies generated for the same orthologous regions from various vertebrates show substantial variation with respect to misassemblies and, in particular, the frequency and characteristics of sequence gaps. As a consequence, the work required to finish different species' sequences varied greatly. Application of the same standardized methods for finishing provided a novel opportunity to "assay" characteristics of genome sequences among many vertebrate species. It is important to note that many of the problems we have encountered during sequence finishing reflect unique architectural features of a particular vertebrate's genome, which in some cases may have important functional and/or evolutionary implications. Finally, based on our analyses, we have been able to improve our procedures to overcome some of these problems and to increase the overall efficiency of the sequence-finishing process, although significant challenges still remain. CONCLUSION:Our findings have important implications for the eventual finishing of the draft whole-genome sequences that have now been generated for a large number of vertebrates.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Blakesley RW,Hansen NF,Gupta J,McDowell JC,Maskeri B,Barnabas BB,Brooks SY,Coleman H,Haghighi P,Ho SL,Schandler K,Stantripop S,Vogt JL,Thomas PJ,NISC Comparative Sequencing Program.,Bouffard GG,Green ED

doi

10.1186/1471-2164-11-21

subject

Has Abstract

pub_date

2010-01-11 00:00:00

pages

21

issn

1471-2164

pii

1471-2164-11-21

journal_volume

11

pub_type

杂志文章
  • Relating past and present diet to phenotypic and transcriptomic variation in the fruit fly.

    abstract:BACKGROUND:Sub-optimal developmental diets often have adverse effects on long-term fitness and health. One hypothesis is that such effects are caused by mismatches between the developmental and adult environment, and may be mediated by persistent changes in gene expression. However, there are few experimental tests of ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3968-z

    authors: May CM,Zwaan BJ

    更新日期:2017-08-22 00:00:00

  • BAC library resources for map-based cloning and physical map construction in barley (Hordeum vulgare L.).

    abstract:BACKGROUND:Although second generation sequencing (2GS) technologies allow re-sequencing of previously gold-standard-sequenced genomes, whole genome shotgun sequencing and de novo assembly of large and complex eukaryotic genomes is still difficult. Availability of a genome-wide physical map is therefore still a prerequi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-247

    authors: Schulte D,Ariyadasa R,Shi B,Fleury D,Saski C,Atkins M,deJong P,Wu CC,Graner A,Langridge P,Stein N

    更新日期:2011-05-19 00:00:00

  • Analysis of gene expression in soybean (Glycine max) roots in response to the root knot nematode Meloidogyne incognita using microarrays and KEGG pathways.

    abstract:BACKGROUND:Root-knot nematodes are sedentary endoparasites that can infect more than 3000 plant species. Root-knot nematodes cause an estimated $100 billion annual loss worldwide. For successful establishment of the root-knot nematode in its host plant, it causes dramatic morphological and physiological changes in plan...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-220

    authors: Ibrahim HM,Hosseini P,Alkharouf NW,Hussein EH,Gamal El-Din Ael K,Aly MA,Matthews BF

    更新日期:2011-05-10 00:00:00

  • MixClone: a mixture model for inferring tumor subclonal populations.

    abstract:BACKGROUND:Tumor genomes are often highly heterogeneous, consisting of genomes from multiple subclonal types. Complete characterization of all subclonal types is a fundamental need in tumor genome analysis. With the advancement of next-generation sequencing, computational methods have recently been developed to infer t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S2-S1

    authors: Li Y,Xie X

    更新日期:2015-01-01 00:00:00

  • Consistent levels of A-to-I RNA editing across individuals in coding sequences and non-conserved Alu repeats.

    abstract:BACKGROUND:Adenosine to inosine (A-to-I) RNA-editing is an essential post-transcriptional mechanism that occurs in numerous sites in the human transcriptome, mainly within Alu repeats. It has been shown to have consistent levels of editing across individuals in a few targets in the human brain and altered in several hu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-608

    authors: Greenberger S,Levanon EY,Paz-Yaacov N,Barzilai A,Safran M,Osenberg S,Amariglio N,Rechavi G,Eisenberg E

    更新日期:2010-10-28 00:00:00

  • Intra-specific comparison of mitochondrial genomes reveals host gene fragment exchange via intron mobility in Tremella fuciformis.

    abstract:BACKGROUND:Mitochondrial genomic sequences are known to be variable. Comparative analyses of mitochondrial genomes can reveal the nature and extent of their variation. RESULTS:Draft mitochondrial genomes of 16 Tremella fuciformis isolates (TF01-TF16) were assembled from Illumina and PacBio sequencing data. Mitochondri...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06846-x

    authors: Deng Y,Zhang X,Xie B,Lin L,Hsiang T,Lin X,Lin Y,Zhang X,Ma Y,Miao W,Ming R

    更新日期:2020-06-24 00:00:00

  • Comparative transcriptomics uncovers alternative splicing changes and signatures of selection from maize improvement.

    abstract:BACKGROUND:Alternative splicing (AS) is an important regulatory mechanism that greatly contributes to eukaryotic transcriptome diversity. A substantial amount of evidence has demonstrated that AS complexity is relevant to eukaryotic evolution, development, adaptation, and complexity. In this study, six teosinte and ten...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1582-5

    authors: Huang J,Gao Y,Jia H,Liu L,Zhang D,Zhang Z

    更新日期:2015-05-08 00:00:00

  • xMAN: extreme MApping of OligoNucleotides.

    abstract:BACKGROUND:The ability to rapidly map millions of oligonucleotide fragments to a reference genome is crucial to many high throughput genomic technologies. RESULTS:We propose an intuitive and efficient algorithm, titled extreme MApping of OligoNucleotide (xMAN), to rapidly map millions of oligonucleotide fragments to a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-S1-S20

    authors: Li W,Carroll JS,Brown M,Liu xS

    更新日期:2008-01-01 00:00:00

  • How Athila retrotransposons survive in the Arabidopsis genome.

    abstract:BACKGROUND:Transposable elements are selfish genetic sequences which only occasionally provide useful functions to their host species. In addition, models of mobile element evolution assume a second type of selfishness: elements of different families do not cooperate, but they independently fight for their survival in ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-219

    authors: Marco A,Marín I

    更新日期:2008-05-14 00:00:00

  • New insights into the evolution and functional divergence of the CIPK gene family in Saccharum.

    abstract:BACKGROUND:Calcineurin B-like protein (CBL)-interacting protein kinases (CIPKs) are the primary components of calcium sensors, and play crucial roles in plant developmental processes, hormone signaling transduction, and in the response to exogenous stresses. RESULTS:In this study, 48 CIPK genes (SsCIPKs) were identifi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07264-9

    authors: Su W,Ren Y,Wang D,Huang L,Fu X,Ling H,Su Y,Huang N,Tang H,Xu L,Que Y

    更新日期:2020-12-07 00:00:00

  • A comparison of gene transcription profiles of domesticated and wild Atlantic salmon (Salmo salar L.) at early life stages, reared under controlled conditions.

    abstract:BACKGROUND:Atlantic salmon have been subject to domestication for approximately ten generations, beginning in the early 1970s. This process of artificial selection will have created various genetic differences between wild and farmed stocks. Each year, hundreds of thousands of farmed fish escape into the wild. These es...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-884

    authors: Bicskei B,Bron JE,Glover KA,Taggart JB

    更新日期:2014-10-09 00:00:00

  • Rapid evolutionary divergence of diploid and allotetraploid Gossypium mitochondrial genomes.

    abstract:BACKGROUND:Cotton (Gossypium spp.) is commonly grouped into eight diploid genomic groups and an allotetraploid genomic group, AD. The mitochondrial genomes supply new information to understand both the evolution process and the mechanism of cytoplasmic male sterility. Based on previously released mitochondrial genomes ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4282-5

    authors: Chen Z,Nie H,Wang Y,Pei H,Li S,Zhang L,Hua J

    更新日期:2017-11-13 00:00:00

  • DAIRYdb: a manually curated reference database for improved taxonomy annotation of 16S rRNA gene sequences from dairy products.

    abstract:BACKGROUND:Reads assignment to taxonomic units is a key step in microbiome analysis pipelines. To date, accurate taxonomy annotation of 16S reads, particularly at species rank, is still challenging due to the short size of read sequences and differently curated classification databases. The close phylogenetic relations...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5914-8

    authors: Meola M,Rifa E,Shani N,Delbès C,Berthoud H,Chassard C

    更新日期:2019-07-08 00:00:00

  • De novo sequencing and transcriptome analysis of the desert shrub, Ammopiptanthus mongolicus, during cold acclimation using Illumina/Solexa.

    abstract:BACKGROUND:Ammopiptanthus mongolicus (Maxim. ex Kom.) Cheng f., an evergreen broadleaf legume shrub, is distributed in Mid-Asia where the temperature can be as low as -30°C during the winter. Although A. mongolicus is an ideal model to study the plant response to cold stress, insufficient genomic resources for this spe...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-488

    authors: Pang T,Ye CY,Xia X,Yin W

    更新日期:2013-07-18 00:00:00

  • The carotenoid biosynthetic and catabolic genes in wheat and their association with yellow pigments.

    abstract:BACKGROUND:In plants carotenoids play an important role in the photosynthetic process and photo-oxidative protection, and are the substrate for the synthesis of abscisic acid and strigolactones. In addition to their protective role as antioxidants and precursors of vitamin A, in wheat carotenoids are important as they ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3395-6

    authors: Colasuonno P,Lozito ML,Marcotuli I,Nigro D,Giancaspro A,Mangini G,De Vita P,Mastrangelo AM,Pecchioni N,Houston K,Simeone R,Gadaleta A,Blanco A

    更新日期:2017-01-31 00:00:00

  • Association of the matrix attachment region recognition signature with coding regions in Caenorhabditis elegans.

    abstract:BACKGROUND:Matrix attachment regions (MAR) are the sites on genomic DNA that interact with the nuclear matrix. There is increasing evidence for the involvement of MAR in regulation of gene expression. The unsuitability of experimental detection of MAR for genome-wide analyses has led to the development of computational...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-418

    authors: Anthony A,Blaxter M

    更新日期:2007-11-15 00:00:00

  • Sequence diversity and differential expression of major phenylpropanoid-flavonoid biosynthetic genes among three mango varieties.

    abstract:BACKGROUND:Mango fruits contain a broad spectrum of phenolic compounds which impart potential health benefits; their biosynthesis is catalysed by enzymes in the phenylpropanoid-flavonoid (PF) pathway. The aim of this study was to reveal the variability in genes involved in the PF pathway in three different mango variet...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1784-x

    authors: Hoang VL,Innes DJ,Shaw PN,Monteith GR,Gidley MJ,Dietzgen RG

    更新日期:2015-07-30 00:00:00

  • Patterns of exon-intron architecture variation of genes in eukaryotic genomes.

    abstract:BACKGROUND:The origin and importance of exon-intron architecture comprises one of the remaining mysteries of gene evolution. Several studies have investigated the variations of intron length, GC content, ordinal position in a gene and divergence. However, there is little study about the structural variation of exons an...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-47

    authors: Zhu L,Zhang Y,Zhang W,Yang S,Chen JQ,Tian D

    更新日期:2009-01-24 00:00:00

  • Genomic meta-analysis of growth factor and integrin pathways in chronic kidney transplant injury.

    abstract:BACKGROUND:Chronic Allograft Nephropathy (CAN) is a clinical entity of progressive kidney transplant injury. The defining histology is tubular atrophy with interstitial fibrosis (IFTA). Using a meta-analysis of microarrays from 84 kidney transplant biopsies, we revealed growth factor and integrin adhesion molecule path...

    journal_title:BMC genomics

    pub_type: 杂志文章,meta分析

    doi:10.1186/1471-2164-14-275

    authors: Dosanjh A,Robison E,Mondala T,Head SR,Salomon DR,Kurian SM

    更新日期:2013-04-23 00:00:00

  • Gene ontology analysis of expanded porcine blastocysts from gilts fed organic or inorganic selenium combined with pyridoxine.

    abstract:BACKGROUND:Gene ontology analysis using the microarray database generated in a previous study by this laboratory was used to further evaluate how maternal dietary supplementation with pyridoxine combined with different sources of selenium (Se) affected global gene expression of expanded porcine blastocysts. Data were g...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5237-1

    authors: Dalto DB,Tsoi S,Dyck MK,Matte JJ

    更新日期:2018-11-21 00:00:00

  • Genome-based analysis for the identification of genes involved in o-xylene degradation in Rhodococcus opacus R7.

    abstract:BACKGROUND:Bacteria belonging to the Rhodococcus genus play an important role in the degradation of many contaminants, including methylbenzenes. These bacteria, widely distributed in the environment, are known to be a powerhouse of numerous degradation functions, due to their ability to metabolize a wide range of organ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4965-6

    authors: Di Canito A,Zampolli J,Orro A,D'Ursi P,Milanesi L,Sello G,Steinbüchel A,Di Gennaro P

    更新日期:2018-08-06 00:00:00

  • A method for near full-length amplification and sequencing for six hepatitis C virus genotypes.

    abstract:BACKGROUND:Hepatitis C virus (HCV) is a rapidly evolving RNA virus that has been classified into seven genotypes. All HCV genotypes cause chronic hepatitis, which ultimately leads to liver diseases such as cirrhosis. The genotypes are unevenly distributed across the globe, with genotypes 1 and 3 being the most prevalen...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2575-8

    authors: Bull RA,Eltahla AA,Rodrigo C,Koekkoek SM,Walker M,Pirozyan MR,Betz-Stablein B,Toepfer A,Laird M,Oh S,Heiner C,Maher L,Schinkel J,Lloyd AR,Luciani F

    更新日期:2016-03-17 00:00:00

  • A Gaijin-like miniature inverted repeat transposable element is mobilized in rice during cell differentiation.

    abstract:BACKGROUND:Miniature inverted repeat transposable element (MITE) is one type of transposable element (TE), which is largely found in eukaryotic genomes and involved in a wide variety of biological events. However, only few MITEs were proved to be currently active and their physiological function remains largely unknown...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-135

    authors: Dong HT,Zhang L,Zheng KL,Yao HG,Chen J,Yu FC,Yu XX,Mao BZ,Zhao D,Yao J,Li DB

    更新日期:2012-04-13 00:00:00

  • De novo assembly, characterization and functional annotation of Senegalese sole (Solea senegalensis) and common sole (Solea solea) transcriptomes: integration in a database and design of a microarray.

    abstract:BACKGROUND:Senegalese sole (Solea senegalensis) and common sole (S. solea) are two economically and evolutionary important flatfish species both in fisheries and aquaculture. Although some genomic resources and tools were recently described in these species, further sequencing efforts are required to establish a comple...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-952

    authors: Benzekri H,Armesto P,Cousin X,Rovira M,Crespo D,Merlo MA,Mazurais D,Bautista R,Guerrero-Fernández D,Fernandez-Pozo N,Ponce M,Infante C,Zambonino JL,Nidelet S,Gut M,Rebordinos L,Planas JV,Bégout ML,Claros MG,Manchado

    更新日期:2014-11-03 00:00:00

  • Dual transcriptional profiling of mice and Toxoplasma gondii during acute and chronic infection.

    abstract:BACKGROUND:The obligate intracellular parasite Toxoplasma gondii establishes a life-long chronic infection within any warm-blooded host. After ingestion of an encysted parasite, T. gondii disseminates throughout the body as a rapidly replicating form during acute infection. Over time and after stimulation of the host i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-806

    authors: Pittman KJ,Aliota MT,Knoll LJ

    更新日期:2014-09-20 00:00:00

  • Cronobacter, the emergent bacterial pathogen Enterobacter sakazakii comes of age; MLST and whole genome sequence analysis.

    abstract:BACKGROUND:Following the association of Cronobacter spp. to several publicized fatal outbreaks in neonatal intensive care units of meningitis and necrotising enterocolitis, the World Health Organization (WHO) in 2004 requested the establishment of a molecular typing scheme to enable the international control of the org...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1121

    authors: Forsythe SJ,Dickins B,Jolley KA

    更新日期:2014-12-16 00:00:00

  • Obesity-related known and candidate SNP markers can significantly change affinity of TATA-binding protein for human gene promoters.

    abstract:BACKGROUND:Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e.,...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S13-S5

    authors: Arkova OV,Ponomarenko MP,Rasskazov DA,Drachkova IA,Arshinova TV,Ponomarenko PM,Savinkova LK,Kolchanov NA

    更新日期:2015-01-01 00:00:00

  • Transcriptomic analysis of α-synuclein knockdown after T3 spinal cord injury in rats.

    abstract:BACKGROUND:Endogenous α-synuclein (α-Syn) is involved in many pathophysiological processes in the secondary injury stage after acute spinal cord injury (SCI), and the mechanism governing these functions has not been thoroughly elucidated to date. This research aims to characterize the effect of α-Syn knockdown on trans...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6244-6

    authors: Zeng H,Yu BF,Liu N,Yang YY,Xing HY,Liu XX,Zhou MW

    更新日期:2019-11-14 00:00:00

  • Genome-wide analysis of the R2R3-MYB transcription factor genes in Chinese cabbage (Brassica rapa ssp. pekinensis) reveals their stress and hormone responsive patterns.

    abstract:BACKGROUND:The MYB superfamily is one of the most abundant transcription factor (TF) families in plants. MYB proteins include highly conserved N-terminal MYB repeats (1R, R2R3, 3R, and atypical) and various C-terminal sequences that confer extensive functions. However, the functions of most MYB genes are unknown, and h...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1216-y

    authors: Wang Z,Tang J,Hu R,Wu P,Hou XL,Song XM,Xiong AS

    更新日期:2015-01-23 00:00:00

  • Cytosine methylation is a conserved epigenetic feature found throughout the phylum Platyhelminthes.

    abstract:BACKGROUND:The phylum Platyhelminthes (flatworms) contains an important group of bilaterian organisms responsible for many debilitating and chronic infectious diseases of human and animal populations inhabiting the planet today. In addition to their biomedical and veterinary relevance, some platyhelminths are also freq...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-462

    authors: Geyer KK,Chalmers IW,Mackintosh N,Hirst JE,Geoghegan R,Badets M,Brophy PM,Brehm K,Hoffmann KF

    更新日期:2013-07-09 00:00:00