Abstract:
BACKGROUND:Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongyang' draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models. RESULTS:A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within 'Hongyang' The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned 'Hort16A' cDNAs and comparing with the predicted protein models for Red5 and both the original 'Hongyang' assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised 'Hongyang' annotation, respectively, compared with 90.9% to the Red5 models. CONCLUSIONS:Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Pilkington SM,Crowhurst R,Hilario E,Nardozza S,Fraser L,Peng Y,Gunaseelan K,Simpson R,Tahir J,Deroles SC,Templeton K,Luo Z,Davy M,Cheng C,McNeilage M,Scaglione D,Liu Y,Zhang Q,Datson P,De Silva N,Gardiner SE,Basdoi
10.1186/s12864-018-4656-3subject
Has Abstractpub_date
2018-04-16 00:00:00pages
257issue
1issn
1471-2164pii
10.1186/s12864-018-4656-3journal_volume
19pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Evolution of novel protein-coding genes is the bedrock of adaptive evolution. Recently, we identified six protein-coding genes with similar signal sequence from Schistosoma japonicum egg stage mRNA using signal sequence trap (SST). To find the mechanism underlying the origination of these genes with similar ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-260
更新日期:2012-06-20 00:00:00
abstract:BACKGROUND:The completed sequence of the Anopheles gambiae genome has enabled genome-wide analyses of gene expression and regulation in this principal vector of human malaria. These investigations have created a demand for efficient methods of cataloguing and analyzing the large quantities of data that have been produc...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-116
更新日期:2006-05-17 00:00:00
abstract:BACKGROUND:Much of the morphological diversity in eukaryotes results from differential regulation of gene expression in which transcription factors (TFs) play a central role. The nematode Caenorhabditis elegans is an established model organism for the study of the roles of TFs in controlling the spatiotemporal pattern ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-399
更新日期:2008-08-27 00:00:00
abstract:BACKGROUND:Systems biology and functional genomics require genome-wide datasets and resources. Complete sets of cloned open reading frames (ORFs) have been made for about a dozen bacterial species and allow researchers to express and study complete proteomes in a high-throughput fashion. RESULTS:We have constructed an...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-470
更新日期:2010-08-11 00:00:00
abstract:BACKGROUND:Sequencing data from The Cancer Genome Atlas (TGCA), the International Cancer Genome Consortium and other research institutes have revealed the presence of genetic alterations in several tumor types, including gastric cancer. These data have been combined into a catalog of significantly mutated genes for eac...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3166-4
更新日期:2016-10-26 00:00:00
abstract:BACKGROUND:Changes in gene regulation are thought to be crucial for the adaptation of organisms to their environment. Transcriptome analyses can be used to identify candidate genes for ecological adaptation, but can be complicated by variation in gene expression between tissues, sexes, or individuals. Here we use high-...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-654
更新日期:2012-11-21 00:00:00
abstract:BACKGROUND:Bacteriophages are the most abundant and diverse entities in the biosphere, and this diversity is driven by constant predator-prey evolutionary dynamics and horizontal gene transfer. Phage genome sequences are under-sampled and therefore present an untapped and uncharacterized source of genetic diversity, ty...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5647-8
更新日期:2019-05-09 00:00:00
abstract:BACKGROUND:Adenosine to inosine (A-to-I) RNA-editing is an essential post-transcriptional mechanism that occurs in numerous sites in the human transcriptome, mainly within Alu repeats. It has been shown to have consistent levels of editing across individuals in a few targets in the human brain and altered in several hu...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-608
更新日期:2010-10-28 00:00:00
abstract:BACKGROUND:The ballan wrasse (Labrus bergylta) belongs to a large teleost family containing more than 600 species showing several unique evolutionary traits such as lack of stomach and hermaphroditism. Agastric fish are found throughout the teleost phylogeny, in quite diverse and unrelated lineages, indicating stomach ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4570-8
更新日期:2018-03-06 00:00:00
abstract:BACKGROUND:Current technologies in next-generation sequencing are offering high throughput reads at low costs, but still suffer from various sequencing errors. Although pyro- and ion semiconductor sequencing both have the advantage of delivering long and high quality reads, problems might occur when sequencing homopoly...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4544-x
更新日期:2018-02-21 00:00:00
abstract:BACKGROUND:Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamica...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-520
更新日期:2013-07-30 00:00:00
abstract:BACKGROUND:The Pregnancy-associated glycoproteins (PAGs) belong to a large family of aspartic peptidases expressed exclusively in the placenta of species in the Artiodactyla order. In cattle, the PAG gene family is comprised of at least 22 transcribed genes, as well as some variants. Phylogenetic analyses have shown th...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-185
更新日期:2009-04-24 00:00:00
abstract:BACKGROUND:An underlying tenet of the epigenetic code hypothesis is the existence of protein domains that can recognize various chromatin structures. To date, two major candidates have emerged: (i) the bromodomain, which can recognize certain acetylation marks and (ii) the chromodomain, which can recognize certain meth...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-6
更新日期:2006-01-16 00:00:00
abstract:BACKGROUND:Nonspecific hybridization is currently a major concern with microarray technology. One of most effective approaches to estimating nonspecific hybridizations in oligonucleotide microarrays is the utilization of mismatch probes; however, this approach has not been used for longer oligonucleotide probes. RESUL...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-491
更新日期:2008-10-17 00:00:00
abstract:BACKGROUND:Oligonucleotide microarray-based comparative genomic hybridization (CGH) offers an attractive possible route for the rapid and cost-effective genome-wide discovery of deletion mutations. CGH typically involves comparison of the hybridization intensities of genomic DNA samples with microarray chip representat...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-224
更新日期:2014-03-22 00:00:00
abstract:BACKGROUND:Uranium (U) is a naturally occurring radionuclide that has been found in the aquatic environment due to anthropogenic activities. Exposure to U may pose risk to aquatic organisms due to its radiological and chemical toxicity. The present study aimed to characterize the chemical toxicity of U in Atlantic salm...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-694
更新日期:2014-08-20 00:00:00
abstract:BACKGROUND:The golden birdwing butterfly (Troides aeacus formosanus) is a rarely observed species in Taiwan. Recently, a typical symptom of nuclear polyhedrosis was found in reared T. aeacus larvae. From the previous Kimura-2 parameter (K-2-P) analysis based on the nucleotide sequence of three genes in this isolate, po...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5713-2
更新日期:2019-05-27 00:00:00
abstract:BACKGROUND:Interpreting gene expression profiles obtained from heterogeneous samples can be difficult because bulk gene expression measures are not resolved to individual cell populations. We have recently devised Population-Specific Expression Analysis (PSEA), a statistical method that identifies individual cell types...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-610
更新日期:2012-11-12 00:00:00
abstract:BACKGROUND:Poxviruses constitute one of the largest and most complex animal virus families known. The notorious smallpox disease has been eradicated and the virus contained, but its simian sister, monkeypox is an emerging, untreatable infectious disease, killing 1 to 10 % of its human victims. In the case of poxviruses...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2826-8
更新日期:2016-08-31 00:00:00
abstract:BACKGROUND:Moraxella catarrhalis is an important pathogen that often causes otitis media in children, a disease that is not currently vaccine preventable. Asymptomatic colonisation of the human upper respiratory tract is common and lack of clearance by the immune system is likely due to the emergence of seroresistant g...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2104-1
更新日期:2015-10-24 00:00:00
abstract:BACKGROUND:There is strong but mostly circumstantial evidence that genetic factors modulate the severity of influenza infection in humans. Using genetically diverse but fully inbred strains of mice it has been shown that host sequence variants have a strong influence on the severity of influenza A disease progression. ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-411
更新日期:2012-08-20 00:00:00
abstract:BACKGROUND:Lipopolysaccharide (LPS) from Gram-negative bacteria cause innate immune responses in animals and plants. The molecules involved in LPS signaling in animals are well studied, whereas those in plants are not yet as well documented. Recently, we identified Arabidopsis AtLBR-2, which binds to LPS from Pseudomon...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4372-4
更新日期:2017-12-29 00:00:00
abstract:BACKGROUND:The completion of the Plasmodium falciparum genome represents a milestone in malaria research. The genome sequence allows for the development of genome-wide approaches such as microarray and proteomics that will greatly facilitate our understanding of the parasite biology and accelerate new drug and vaccine ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-255
更新日期:2007-07-27 00:00:00
abstract:BACKGROUND:Color polymorphism in the nacre of pteriomorphian bivalves is of great interest for the pearl culture industry. The nacreous layer of the Polynesian black-lipped pearl oyster Pinctada margaritifera exhibits a large array of color variation among individuals including reflections of blue, green, yellow and pi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1776-x
更新日期:2015-08-01 00:00:00
abstract:BACKGROUND:Aspartic proteases are known to play an important role in the biology of nematode parasitism. This role is best characterised in blood-feeding nematodes, where they digest haemoglobin, but they are also likely to play important roles in the biology of nematode parasites that do not feed on blood. In the pres...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-611
更新日期:2009-12-16 00:00:00
abstract:BACKGROUND:Magnaporthe oryzae (anamorph Pyricularia oryzae) is the causal agent of blast disease of Poaceae crops and their wild relatives. To understand the genetic mechanisms that drive host specialization of M. oryzae, we carried out whole genome resequencing of four M. oryzae isolates from rice (Oryza sativa), one ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2690-6
更新日期:2016-05-18 00:00:00
abstract:BACKGROUND:Genome-scale functional genomic screens across large cell line panels provide a rich resource for discovering tumor vulnerabilities that can lead to the next generation of targeted therapies. Their data analysis typically has focused on identifying genes whose knockdown enhances response in various pre-defin...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2807-y
更新日期:2016-06-13 00:00:00
abstract:BACKGROUND:Campylobacter is the leading cause of foodborne diarrhoeal illness in humans and is mostly acquired from consumption or handling of contaminated poultry meat. In the absence of effective licensed vaccines and inhibitors, selection for chickens with increased resistance to Campylobacter could potentially redu...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2612-7
更新日期:2016-04-18 00:00:00
abstract:BACKGROUND:Copy number variants (CNVs) have been shown to increase risk for physical anomalies, developmental, psychiatric and medical disorders. Some of them have been associated with changes in weight, height, and other physical traits. As most studies have been performed on children and young people, these effects o...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5292-7
更新日期:2018-12-04 00:00:00
abstract:BACKGROUND:Cadmium (Cd) is a serious heavy metal (HM) soil pollutant. To alleviate or even eliminate HM pollution in soil, environmental-friendly methods are applied. One is that special plants are cultivated to absorb the HM in the contaminated soil. As an excellent economical plant with ornamental value and sound ada...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6152-9
更新日期:2019-11-20 00:00:00