A manually annotated Actinidia chinensis var. chinensis (kiwifruit) genome highlights the challenges associated with draft genomes and gene prediction in plants.

Abstract:

BACKGROUND:Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongyang' draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models. RESULTS:A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within 'Hongyang' The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned 'Hort16A' cDNAs and comparing with the predicted protein models for Red5 and both the original 'Hongyang' assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised 'Hongyang' annotation, respectively, compared with 90.9% to the Red5 models. CONCLUSIONS:Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Pilkington SM,Crowhurst R,Hilario E,Nardozza S,Fraser L,Peng Y,Gunaseelan K,Simpson R,Tahir J,Deroles SC,Templeton K,Luo Z,Davy M,Cheng C,McNeilage M,Scaglione D,Liu Y,Zhang Q,Datson P,De Silva N,Gardiner SE,Bas

doi

10.1186/s12864-018-4656-3

subject

Has Abstract

pub_date

2018-04-16 00:00:00

pages

257

issue

1

issn

1471-2164

pii

10.1186/s12864-018-4656-3

journal_volume

19

pub_type

杂志文章
  • Origin of a novel protein-coding gene family with similar signal sequence in Schistosoma japonicum.

    abstract:BACKGROUND:Evolution of novel protein-coding genes is the bedrock of adaptive evolution. Recently, we identified six protein-coding genes with similar signal sequence from Schistosoma japonicum egg stage mRNA using signal sequence trap (SST). To find the mechanism underlying the origination of these genes with similar ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-260

    authors: Mbanefo EC,Chuanxin Y,Kikuchi M,Shuaibu MN,Boamah D,Kirinoki M,Hayashi N,Chigusa Y,Osada Y,Hamano S,Hirayama K

    更新日期:2012-06-20 00:00:00

  • angaGEDUCI: Anopheles gambiae gene expression database with integrated comparative algorithms for identifying conserved DNA motifs in promoter sequences.

    abstract:BACKGROUND:The completed sequence of the Anopheles gambiae genome has enabled genome-wide analyses of gene expression and regulation in this principal vector of human malaria. These investigations have created a demand for efficient methods of cataloguing and analyzing the large quantities of data that have been produc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-116

    authors: Dissanayake SN,Marinotti O,Ribeiro JM,James AA

    更新日期:2006-05-17 00:00:00

  • Comparative analysis of function and interaction of transcription factors in nematodes: extensive conservation of orthology coupled to rapid sequence evolution.

    abstract:BACKGROUND:Much of the morphological diversity in eukaryotes results from differential regulation of gene expression in which transcription factors (TFs) play a central role. The nematode Caenorhabditis elegans is an established model organism for the study of the roles of TFs in controlling the spatiotemporal pattern ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-399

    authors: Haerty W,Artieri C,Khezri N,Singh RS,Gupta BP

    更新日期:2008-08-27 00:00:00

  • The Escherichia coli K-12 ORFeome: a resource for comparative molecular microbiology.

    abstract:BACKGROUND:Systems biology and functional genomics require genome-wide datasets and resources. Complete sets of cloned open reading frames (ORFs) have been made for about a dozen bacterial species and allow researchers to express and study complete proteomes in a high-throughput fashion. RESULTS:We have constructed an...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-470

    authors: Rajagopala SV,Yamamoto N,Zweifel AE,Nakamichi T,Huang HK,Mendez-Rios JD,Franca-Koh J,Boorgula MP,Fujita K,Suzuki K,Hu JC,Wanner BL,Mori H,Uetz P

    更新日期:2010-08-11 00:00:00

  • Comparison between two amplicon-based sequencing panels of different scales in the detection of somatic mutations associated with gastric cancer.

    abstract:BACKGROUND:Sequencing data from The Cancer Genome Atlas (TGCA), the International Cancer Genome Consortium and other research institutes have revealed the presence of genetic alterations in several tumor types, including gastric cancer. These data have been combined into a catalog of significantly mutated genes for eac...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3166-4

    authors: Hirotsu Y,Kojima Y,Okimoto K,Amemiya K,Mochizuki H,Omata M

    更新日期:2016-10-26 00:00:00

  • Population and sex differences in Drosophila melanogaster brain gene expression.

    abstract:BACKGROUND:Changes in gene regulation are thought to be crucial for the adaptation of organisms to their environment. Transcriptome analyses can be used to identify candidate genes for ecological adaptation, but can be complicated by variation in gene expression between tissues, sexes, or individuals. Here we use high-...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-654

    authors: Catalán A,Hutter S,Parsch J

    更新日期:2012-11-21 00:00:00

  • Staphylococci phages display vast genomic diversity and evolutionary relationships.

    abstract:BACKGROUND:Bacteriophages are the most abundant and diverse entities in the biosphere, and this diversity is driven by constant predator-prey evolutionary dynamics and horizontal gene transfer. Phage genome sequences are under-sampled and therefore present an untapped and uncharacterized source of genetic diversity, ty...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5647-8

    authors: Oliveira H,Sampaio M,Melo LDR,Dias O,Pope WH,Hatfull GF,Azeredo J

    更新日期:2019-05-09 00:00:00

  • Consistent levels of A-to-I RNA editing across individuals in coding sequences and non-conserved Alu repeats.

    abstract:BACKGROUND:Adenosine to inosine (A-to-I) RNA-editing is an essential post-transcriptional mechanism that occurs in numerous sites in the human transcriptome, mainly within Alu repeats. It has been shown to have consistent levels of editing across individuals in a few targets in the human brain and altered in several hu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-608

    authors: Greenberger S,Levanon EY,Paz-Yaacov N,Barzilai A,Safran M,Osenberg S,Amariglio N,Rechavi G,Eisenberg E

    更新日期:2010-10-28 00:00:00

  • Loss of stomach, loss of appetite? Sequencing of the ballan wrasse (Labrus bergylta) genome and intestinal transcriptomic profiling illuminate the evolution of loss of stomach function in fish.

    abstract:BACKGROUND:The ballan wrasse (Labrus bergylta) belongs to a large teleost family containing more than 600 species showing several unique evolutionary traits such as lack of stomach and hermaphroditism. Agastric fish are found throughout the teleost phylogeny, in quite diverse and unrelated lineages, indicating stomach ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4570-8

    authors: Lie KK,Tørresen OK,Solbakken MH,Rønnestad I,Tooming-Klunderud A,Nederbragt AJ,Jentoft S,Sæle Ø

    更新日期:2018-03-06 00:00:00

  • Analytical parameters and validation of homopolymer detection in a pyrosequencing-based next generation sequencing system.

    abstract:BACKGROUND:Current technologies in next-generation sequencing are offering high throughput reads at low costs, but still suffer from various sequencing errors. Although pyro- and ion semiconductor sequencing both have the advantage of delivering long and high quality reads, problems might occur when sequencing homopoly...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4544-x

    authors: Ivády G,Madar L,Dzsudzsák E,Koczok K,Kappelmayer J,Krulisova V,Macek M Jr,Horváth A,Balogh I

    更新日期:2018-02-21 00:00:00

  • Directional RNA-seq reveals highly complex condition-dependent transcriptomes in E. coli K12 through accurate full-length transcripts assembling.

    abstract:BACKGROUND:Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamica...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-520

    authors: Li S,Dong X,Su Z

    更新日期:2013-07-30 00:00:00

  • Characterization of the bovine pregnancy-associated glycoprotein gene family--analysis of gene sequences, regulatory regions within the promoter and expression of selected genes.

    abstract:BACKGROUND:The Pregnancy-associated glycoproteins (PAGs) belong to a large family of aspartic peptidases expressed exclusively in the placenta of species in the Artiodactyla order. In cattle, the PAG gene family is comprised of at least 22 transcribed genes, as well as some variants. Phylogenetic analyses have shown th...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-185

    authors: Telugu BP,Walker AM,Green JA

    更新日期:2009-04-24 00:00:00

  • The Epc-N domain: a predicted protein-protein interaction domain found in select chromatin associated proteins.

    abstract:BACKGROUND:An underlying tenet of the epigenetic code hypothesis is the existence of protein domains that can recognize various chromatin structures. To date, two major candidates have emerged: (i) the bromodomain, which can recognize certain acetylation marks and (ii) the chromodomain, which can recognize certain meth...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-6

    authors: Perry J

    更新日期:2006-01-16 00:00:00

  • Design and analysis of mismatch probes for long oligonucleotide microarrays.

    abstract:BACKGROUND:Nonspecific hybridization is currently a major concern with microarray technology. One of most effective approaches to estimating nonspecific hybridizations in oligonucleotide microarrays is the utilization of mismatch probes; however, this approach has not been used for longer oligonucleotide probes. RESUL...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-491

    authors: Deng Y,He Z,Van Nostrand JD,Zhou J

    更新日期:2008-10-17 00:00:00

  • Microarray-based ultra-high resolution discovery of genomic deletion mutations.

    abstract:BACKGROUND:Oligonucleotide microarray-based comparative genomic hybridization (CGH) offers an attractive possible route for the rapid and cost-effective genome-wide discovery of deletion mutations. CGH typically involves comparison of the hybridization intensities of genomic DNA samples with microarray chip representat...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-224

    authors: Belfield EJ,Brown C,Gan X,Jiang C,Baban D,Mithani A,Mott R,Ragoussis J,Harberd NP

    更新日期:2014-03-22 00:00:00

  • Hepatic transcriptomic profiling reveals early toxicological mechanisms of uranium in Atlantic salmon (Salmo salar).

    abstract:BACKGROUND:Uranium (U) is a naturally occurring radionuclide that has been found in the aquatic environment due to anthropogenic activities. Exposure to U may pose risk to aquatic organisms due to its radiological and chemical toxicity. The present study aimed to characterize the chemical toxicity of U in Atlantic salm...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-694

    authors: Song Y,Salbu B,Teien HC,Sørlie Heier L,Rosseland BO,Høgåsen T,Tollefsen KE

    更新日期:2014-08-20 00:00:00

  • Genomic sequencing of Troides aeacus nucleopolyhedrovirus (TraeNPV) from golden birdwing larvae (Troides aeacus formosanus) to reveal defective Autographa californica NPV genomic features.

    abstract:BACKGROUND:The golden birdwing butterfly (Troides aeacus formosanus) is a rarely observed species in Taiwan. Recently, a typical symptom of nuclear polyhedrosis was found in reared T. aeacus larvae. From the previous Kimura-2 parameter (K-2-P) analysis based on the nucleotide sequence of three genes in this isolate, po...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5713-2

    authors: Huang YF,Chen TH,Chang ZT,Wang TC,Lee SJ,Kim JC,Kim JS,Chiu KP,Nai YS

    更新日期:2019-05-27 00:00:00

  • Cell population-specific expression analysis of human cerebellum.

    abstract:BACKGROUND:Interpreting gene expression profiles obtained from heterogeneous samples can be difficult because bulk gene expression measures are not resolved to individual cell populations. We have recently devised Population-Specific Expression Analysis (PSEA), a statistical method that identifies individual cell types...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-610

    authors: Kuhn A,Kumar A,Beilina A,Dillman A,Cookson MR,Singleton AB

    更新日期:2012-11-12 00:00:00

  • Finishing monkeypox genomes from short reads: assembly analysis and a neural network method.

    abstract:BACKGROUND:Poxviruses constitute one of the largest and most complex animal virus families known. The notorious smallpox disease has been eradicated and the virus contained, but its simian sister, monkeypox is an emerging, untreatable infectious disease, killing 1 to 10 % of its human victims. In the case of poxviruses...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2826-8

    authors: Zhao K,Wohlhueter RM,Li Y

    更新日期:2016-08-31 00:00:00

  • Novel Moraxella catarrhalis prophages display hyperconserved non-structural genes despite their genomic diversity.

    abstract:BACKGROUND:Moraxella catarrhalis is an important pathogen that often causes otitis media in children, a disease that is not currently vaccine preventable. Asymptomatic colonisation of the human upper respiratory tract is common and lack of clearance by the immune system is likely due to the emergence of seroresistant g...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2104-1

    authors: Ariff A,Wise MJ,Kahler CM,Tay CY,Peters F,Perkins TT,Chang BJ

    更新日期:2015-10-24 00:00:00

  • Distinct gene loci control the host response to influenza H1N1 virus infection in a time-dependent manner.

    abstract:BACKGROUND:There is strong but mostly circumstantial evidence that genetic factors modulate the severity of influenza infection in humans. Using genetically diverse but fully inbred strains of mice it has been shown that host sequence variants have a strong influence on the severity of influenza A disease progression. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-411

    authors: Nedelko T,Kollmus H,Klawonn F,Spijker S,Lu L,Heßman M,Alberts R,Williams RW,Schughart K

    更新日期:2012-08-20 00:00:00

  • Transcriptome analysis reveals key roles of AtLBR-2 in LPS-induced defense responses in plants.

    abstract:BACKGROUND:Lipopolysaccharide (LPS) from Gram-negative bacteria cause innate immune responses in animals and plants. The molecules involved in LPS signaling in animals are well studied, whereas those in plants are not yet as well documented. Recently, we identified Arabidopsis AtLBR-2, which binds to LPS from Pseudomon...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4372-4

    authors: Iizasa S,Iizasa E,Watanabe K,Nagano Y

    更新日期:2017-12-29 00:00:00

  • cDNA sequences reveal considerable gene prediction inaccuracy in the Plasmodium falciparum genome.

    abstract:BACKGROUND:The completion of the Plasmodium falciparum genome represents a milestone in malaria research. The genome sequence allows for the development of genome-wide approaches such as microarray and proteomics that will greatly facilitate our understanding of the parasite biology and accelerate new drug and vaccine ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-255

    authors: Lu F,Jiang H,Ding J,Mu J,Valenzuela JG,Ribeiro JM,Su XZ

    更新日期:2007-07-27 00:00:00

  • Identification of genes associated with shell color in the black-lipped pearl oyster, Pinctada margaritifera.

    abstract:BACKGROUND:Color polymorphism in the nacre of pteriomorphian bivalves is of great interest for the pearl culture industry. The nacreous layer of the Polynesian black-lipped pearl oyster Pinctada margaritifera exhibits a large array of color variation among individuals including reflections of blue, green, yellow and pi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1776-x

    authors: Lemer S,Saulnier D,Gueguen Y,Planes S

    更新日期:2015-08-01 00:00:00

  • Identification of novel aspartic proteases from Strongyloides ratti and characterisation of their evolutionary relationships, stage-specific expression and molecular structure.

    abstract:BACKGROUND:Aspartic proteases are known to play an important role in the biology of nematode parasitism. This role is best characterised in blood-feeding nematodes, where they digest haemoglobin, but they are also likely to play important roles in the biology of nematode parasites that do not feed on blood. In the pres...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-611

    authors: Mello LV,O'Meara H,Rigden DJ,Paterson S

    更新日期:2009-12-16 00:00:00

  • Host specialization of the blast fungus Magnaporthe oryzae is associated with dynamic gain and loss of genes linked to transposable elements.

    abstract:BACKGROUND:Magnaporthe oryzae (anamorph Pyricularia oryzae) is the causal agent of blast disease of Poaceae crops and their wild relatives. To understand the genetic mechanisms that drive host specialization of M. oryzae, we carried out whole genome resequencing of four M. oryzae isolates from rice (Oryza sativa), one ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2690-6

    authors: Yoshida K,Saunders DG,Mitsuoka C,Natsume S,Kosugi S,Saitoh H,Inoue Y,Chuma I,Tosa Y,Cano LM,Kamoun S,Terauchi R

    更新日期:2016-05-18 00:00:00

  • Outlier analysis of functional genomic profiles enriches for oncology targets and enables precision medicine.

    abstract:BACKGROUND:Genome-scale functional genomic screens across large cell line panels provide a rich resource for discovering tumor vulnerabilities that can lead to the next generation of targeted therapies. Their data analysis typically has focused on identifying genes whose knockdown enhances response in various pre-defin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2807-y

    authors: Zhu Z,Ihle NT,Rejto PA,Zarrinkar PP

    更新日期:2016-06-13 00:00:00

  • The genomic architecture of resistance to Campylobacter jejuni intestinal colonisation in chickens.

    abstract:BACKGROUND:Campylobacter is the leading cause of foodborne diarrhoeal illness in humans and is mostly acquired from consumption or handling of contaminated poultry meat. In the absence of effective licensed vaccines and inhibitors, selection for chickens with increased resistance to Campylobacter could potentially redu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2612-7

    authors: Psifidi A,Fife M,Howell J,Matika O,van Diemen PM,Kuo R,Smith J,Hocking PM,Salmon N,Jones MA,Hume DA,Banos G,Stevens MP,Kaiser P

    更新日期:2016-04-18 00:00:00

  • Effects of pathogenic CNVs on physical traits in participants of the UK Biobank.

    abstract:BACKGROUND:Copy number variants (CNVs) have been shown to increase risk for physical anomalies, developmental, psychiatric and medical disorders. Some of them have been associated with changes in weight, height, and other physical traits. As most studies have been performed on children and young people, these effects o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5292-7

    authors: Owen D,Bracher-Smith M,Kendall KM,Rees E,Einon M,Escott-Price V,Owen MJ,O'Donovan MC,Kirov G

    更新日期:2018-12-04 00:00:00

  • Transcriptomic analysis of Verbena bonariensis roots in response to cadmium stress.

    abstract:BACKGROUND:Cadmium (Cd) is a serious heavy metal (HM) soil pollutant. To alleviate or even eliminate HM pollution in soil, environmental-friendly methods are applied. One is that special plants are cultivated to absorb the HM in the contaminated soil. As an excellent economical plant with ornamental value and sound ada...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6152-9

    authors: Wang MQ,Bai ZY,Xiao YF,Li Y,Liu QL,Zhang L,Pan YZ,Jiang BB,Zhang F

    更新日期:2019-11-20 00:00:00