Identification of "pathologs" (disease-related genes) from the RIKEN mouse cDNA dataset using human curation plus FACTS, a new biological information extraction system.

Abstract:

BACKGROUND:A major goal in the post-genomic era is to identify and characterise disease susceptibility genes and to apply this knowledge to disease prevention and treatment. Rodents and humans have remarkably similar genomes and share closely related biochemical, physiological and pathological pathways. In this work we utilised the latest information on the mouse transcriptome as revealed by the RIKEN FANTOM2 project to identify novel human disease-related candidate genes. We define a new term "patholog" to mean a homolog of a human disease-related gene encoding a product (transcript, anti-sense or protein) potentially relevant to disease. Rather than just focus on Mendelian inheritance, we applied the analysis to all potential pathologs regardless of their inheritance pattern. RESULTS:Bioinformatic analysis and human curation of 60,770 RIKEN full-length mouse cDNA clones produced 2,578 sequences that showed similarity (70-85% identity) to known human-disease genes. Using a newly developed biological information extraction and annotation tool (FACTS) in parallel with human expert analysis of 17,051 MEDLINE scientific abstracts we identified 182 novel potential pathologs. Of these, 36 were identified by computational tools only, 49 by human expert analysis only and 97 by both methods. These pathologs were related to neoplastic (53%), hereditary (24%), immunological (5%), cardio-vascular (4%), or other (14%), disorders. CONCLUSIONS:Large scale genome projects continue to produce a vast amount of data with potential application to the study of human disease. For this potential to be realised we need intelligent strategies for data categorisation and the ability to link sequence data with relevant literature. This paper demonstrates the power of combining human expert annotation with FACTS, a newly developed bioinformatics tool, to identify novel pathologs from within large-scale mouse transcript datasets.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Silva DG,Schönbach C,Brusic V,Socha LA,Nagashima T,Petrovsky N

doi

10.1186/1471-2164-5-28

keywords:

subject

Has Abstract

pub_date

2004-04-29 00:00:00

pages

28

issue

1

issn

1471-2164

pii

1471-2164-5-28

journal_volume

5

pub_type

杂志文章
  • Correction to: An Arabidopsis introgression zone studied at high spatio-temporal resolution: interglacial and multiple genetic contact exemplified using whole nuclear and plastid genomes.

    abstract::ᅟ: Upon publication of the original article [1], the authors had flagged that there was an error in Fig. 1c, as the key in this figure was displaying incorrectly. The colours had not displayed in the key in the final published article, and instead appear as plain white. ...

    journal_title:BMC genomics

    pub_type: 杂志文章,已发布勘误

    doi:10.1186/s12864-018-4614-0

    authors: Hohmann N,Koch MA

    更新日期:2018-04-11 00:00:00

  • Transcriptomic and proteomic analyses of a new cytoplasmic male sterile line with a wild Gossypium bickii genetic background.

    abstract:BACKGROUND:Cotton is an important fiber crop but has serious heterosis effects, and cytoplasmic male sterility (CMS) is the major cause of heterosis in plants. However, to the best of our knowledge, no studies have investigated CMS Yamian A in cotton with the genetic background of Australian wild Gossypium bickii. Conj...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07261-y

    authors: Zhao H,Wang J,Qu Y,Peng R,Magwanga RO,Liu F,Huang J

    更新日期:2020-12-02 00:00:00

  • Characterization of a novel chicken muscle disorder through differential gene expression and pathway analysis using RNA-sequencing.

    abstract:BACKGROUND:Improvements in poultry production within the past 50 years have led to increased muscle yield and growth rate, which may be contributing to an increased rate and development of new muscle disorders in chickens. Previously reported muscle disorders and conditions are generally associated with poor meat quali...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1623-0

    authors: Mutryn MF,Brannick EM,Fu W,Lee WR,Abasht B

    更新日期:2015-05-21 00:00:00

  • Genome-wide association study of eating and cooking qualities in different subpopulations of rice (Oryza sativa L.).

    abstract:BACKGROUND:Starch and protein are two major components of polished rice, and the amylose and protein contents affect eating and cooking qualities (ECQs). In the present study, genome-wide association study with high-quality re-sequencing data was performed for 10 ECQs in a panel of 227 non-glutinous rice accessions and...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3000-z

    authors: Xu F,Bao J,He Q,Park YJ

    更新日期:2016-08-20 00:00:00

  • A transcriptome approach towards understanding the development of ripening capacity in 'Bartlett' pears (Pyrus communis L.).

    abstract:BACKGROUND:The capacity of European pear fruit (Pyrus communis L.) to ripen after harvest develops during the final stages of growth on the tree. The objective of this study was to characterize changes in 'Bartlett' pear fruit physico-chemical properties and transcription profiles during fruit maturation leading to att...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1939-9

    authors: Nham NT,de Freitas ST,Macnish AJ,Carr KM,Kietikul T,Guilatco AJ,Jiang CZ,Zakharov F,Mitcham EJ

    更新日期:2015-10-09 00:00:00

  • Stringent comparative sequence analysis reveals SOX10 as a putative inhibitor of glial cell differentiation.

    abstract:BACKGROUND:The transcription factor SOX10 is essential for all stages of Schwann cell development including myelination. SOX10 cooperates with other transcription factors to activate the expression of key myelin genes in Schwann cells and is therefore a context-dependent, pro-myelination transcription factor. As such, ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3167-3

    authors: Gopinath C,Law WD,Rodríguez-Molina JF,Prasad AB,Song L,Crawford GE,Mullikin JC,Svaren J,Antonellis A

    更新日期:2016-11-07 00:00:00

  • Biclustering of transcriptome sequencing data reveals human tissue-specific circular RNAs.

    abstract:BACKGROUND:Emerging evidence has been experimentally confirmed the tissue-specific expression of circRNAs (circRNAs). Global identification of human tissue-specific circRNAs is crucial for the functionality study, which facilitates the discovery of circRNAs for potential diagnostic biomarkers. RESULTS:In this study, c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4335-9

    authors: Liu YC,Chiu YJ,Li JR,Sun CH,Liu CC,Huang HD

    更新日期:2018-01-19 00:00:00

  • Origin and fate of pseudogenes in Hemiascomycetes: a comparative analysis.

    abstract:BACKGROUND:Pseudogenes are ubiquitous genetic elements that derive from functional genes after mutational inactivation. Characterization of pseudogenes is important to understand genome dynamics and evolution, and its significance increases when several genomes of related organisms can be compared. Among yeasts, only t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-260

    authors: Lafontaine I,Dujon B

    更新日期:2010-04-22 00:00:00

  • Whole genome sequence analysis of the TALLYHO/Jng mouse.

    abstract:BACKGROUND:The TALLYHO/Jng (TH) mouse is a polygenic model for obesity and type 2 diabetes first described in the literature in 2001. The origin of the TH strain is an outbred colony of the Theiler Original strain and mice derived from this source were selectively bred for male hyperglycemia establishing an inbred stra...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3245-6

    authors: Denvir J,Boskovic G,Fan J,Primerano DA,Parkman JK,Kim JH

    更新日期:2016-11-11 00:00:00

  • Microarray profiling for differential gene expression in PMSG-hCG stimulated preovulatory ovarian follicles of Chinese Taihu and Large White sows.

    abstract:BACKGROUND:The Chinese Taihu is one of the most prolific pig breeds in the world, which farrows at least five more piglets per litter than Western pig breeds partly due to a greater ovulation rate. Variation of ovulation rate maybe associated with the differences in the transcriptome of Chinese Taihu and Large White ov...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-111

    authors: Sun X,Mei S,Tao H,Wang G,Su L,Jiang S,Deng C,Xiong Y,Li F

    更新日期:2011-02-16 00:00:00

  • Replicate exome-sequencing in a multiple-generation family: improved interpretation of next-generation sequencing data.

    abstract:BACKGROUND:Whole-exome sequencing (WES) is rapidly evolving into a tool of choice for rapid, and inexpensive identification of molecular genetic lesions within targeted regions of the human genome. While biases in WES coverage of nucleotides in targeted regions are recognized, it is not well understood how repetition o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2107-y

    authors: Cherukuri PF,Maduro V,Fuentes-Fajardo KV,Lam K,NISC Comparative Sequencing Program.,Adams DR,Tifft CJ,Mullikin JC,Gahl WA,Boerkoel CF

    更新日期:2015-11-25 00:00:00

  • Integrated "omics" profiling indicates that miRNAs are modulators of the ontogenetic venom composition shift in the Central American rattlesnake, Crotalus simus simus.

    abstract:BACKGROUND:Understanding the processes that drive the evolution of snake venom is a topic of great research interest in molecular and evolutionary toxinology. Recent studies suggest that ontogenetic changes in venom composition are genetically controlled rather than environmentally induced. However, the molecular mecha...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-234

    authors: Durban J,Pérez A,Sanz L,Gómez A,Bonilla F,Rodríguez S,Chacón D,Sasa M,Angulo Y,Gutiérrez JM,Calvete JJ

    更新日期:2013-04-10 00:00:00

  • Gene silencing pathways found in the green alga Volvox carteri reveal insights into evolution and origins of small RNA systems in plants.

    abstract:BACKGROUND:Volvox carteri (V. carteri) is a multicellular green alga used as model system for the evolution of multicellularity. So far, the contribution of small RNA pathways to these phenomena is not understood. Thus, we have sequenced V. carteri Argonaute 3 (VcAGO3)-associated small RNAs from different developmental...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3202-4

    authors: Dueck A,Evers M,Henz SR,Unger K,Eichner N,Merkl R,Berezikov E,Engelmann JC,Weigel D,Wenzl S,Meister G

    更新日期:2016-11-02 00:00:00

  • Bioinformatics and DNA-extraction strategies to reliably detect genetic variants from FFPE breast tissue samples.

    abstract:BACKGROUND:Archived formalin fixed paraffin embedded (FFPE) samples are valuable clinical resources to examine clinically relevant morphology features and also to study genetic changes. However, DNA quality and quantity of FFPE samples are often sub-optimal, and resulting NGS-based genetics variant detections are prone...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6056-8

    authors: Bhagwate AV,Liu Y,Winham SJ,McDonough SJ,Stallings-Mann ML,Heinzen EP,Davila JI,Vierkant RA,Hoskin TL,Frost M,Carter JM,Radisky DC,Cunningham JM,Degnim AC,Wang C

    更新日期:2019-09-02 00:00:00

  • Positive correlation between gene coexpression and positional clustering in the zebrafish genome.

    abstract:BACKGROUND:Co-expressing genes tend to cluster in eukaryotic genomes. This paper analyzes correlation between the proximity of eukaryotic genes and their transcriptional expression pattern in the zebrafish (Danio rerio) genome using available microarray data and gene annotation. RESULTS:The analyses show that neighbou...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-42

    authors: Ng YK,Wu W,Zhang L

    更新日期:2009-01-22 00:00:00

  • Genome-wide analysis of the R2R3-MYB transcription factor genes in Chinese cabbage (Brassica rapa ssp. pekinensis) reveals their stress and hormone responsive patterns.

    abstract:BACKGROUND:The MYB superfamily is one of the most abundant transcription factor (TF) families in plants. MYB proteins include highly conserved N-terminal MYB repeats (1R, R2R3, 3R, and atypical) and various C-terminal sequences that confer extensive functions. However, the functions of most MYB genes are unknown, and h...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1216-y

    authors: Wang Z,Tang J,Hu R,Wu P,Hou XL,Song XM,Xiong AS

    更新日期:2015-01-23 00:00:00

  • A platform independent RNA-Seq protocol for the detection of transcriptome complexity.

    abstract:BACKGROUND:Recent studies have demonstrated an unexpected complexity of transcription in eukaryotes. The majority of the genome is transcribed and only a little fraction of these transcripts is annotated as protein coding genes and their splice variants. Indeed, most transcripts are the result of antisense, overlapping...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-855

    authors: Calabrese C,Mangiulli M,Manzari C,Paluscio AM,Caratozzolo MF,Marzano F,Kurelac I,D'Erchia AM,D'Elia D,Licciulli F,Liuni S,Picardi E,Attimonelli M,Gasparre G,Porcelli AM,Pesole G,Sbisà E,Tullo A

    更新日期:2013-12-05 00:00:00

  • Comparative mitogenomic analysis of the superfamily Pentatomoidea (Insecta: Hemiptera: Heteroptera) and phylogenetic implications.

    abstract:BACKGROUND:Insect mitochondrial genomes (mitogenomes) are the most extensively used genetic marker for evolutionary and population genetics studies of insects. The Pentatomoidea superfamily is economically important and the largest superfamily within Pentatomomorpha with over 7,000 species. To better understand the div...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1679-x

    authors: Yuan ML,Zhang QL,Guo ZL,Wang J,Shen YY

    更新日期:2015-06-16 00:00:00

  • The complete mitochondrial genomes for three Toxocara species of human and animal health significance.

    abstract:BACKGROUND:Studying mitochondrial (mt) genomics has important implications for various fundamental areas, including mt biochemistry, physiology and molecular biology. In addition, mt genome sequences have provided useful markers for investigating population genetic structures, systematics and phylogenetics of organisms...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-224

    authors: Li MW,Lin RQ,Song HQ,Wu XY,Zhu XQ

    更新日期:2008-05-16 00:00:00

  • Metabolic modeling and analysis of the metabolic switch in Streptomyces coelicolor.

    abstract:BACKGROUND:The transition from exponential to stationary phase in Streptomyces coelicolor is accompanied by a major metabolic switch and results in a strong activation of secondary metabolism. Here we have explored the underlying reorganization of the metabolome by combining computational predictions based on constrain...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-202

    authors: Alam MT,Merlo ME,STREAM Consortium.,Hodgson DA,Wellington EM,Takano E,Breitling R

    更新日期:2010-03-26 00:00:00

  • "Integrative genomic analysis of the bioprospection of regulators and accessory enzymes associated with cellulose degradation in a filamentous fungus (Trichoderma harzianum)".

    abstract:BACKGROUND:Unveiling fungal genome structure and function reveals the potential biotechnological use of fungi. Trichoderma harzianum is a powerful CAZyme-producing fungus. We studied the genomic regions in T. harzianum IOC3844 containing CAZyme genes, transcription factors and transporters. RESULTS:We used bioinformat...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07158-w

    authors: Ferreira Filho JA,Horta MAC,Dos Santos CA,Almeida DA,Murad NF,Mendes JS,Sforça DA,Silva CBC,Crucello A,de Souza AP

    更新日期:2020-11-02 00:00:00

  • Discovery and profiling of small RNAs responsive to stress conditions in the plant pathogen Pectobacterium atrosepticum.

    abstract:BACKGROUND:Small RNAs (sRNAs) have emerged as important regulatory molecules and have been studied in several bacteria. However, to date, there have been no whole-transcriptome studies on sRNAs in any of the Soft Rot Enterobacteriaceae (SRE) group of pathogens. Although the main ecological niches for these pathogens ar...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2376-0

    authors: Kwenda S,Gorshkov V,Ramesh AM,Naidoo S,Rubagotti E,Birch PR,Moleleki LN

    更新日期:2016-01-12 00:00:00

  • Discovering large conserved functional components in global network alignment by graph matching.

    abstract:BACKGROUND:Aligning protein-protein interaction (PPI) networks is very important to discover the functionally conserved sub-structures between different species. In recent years, the global PPI network alignment problem has been extensively studied aiming at finding the one-to-one alignment with the maximum matching sc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5027-9

    authors: Zhu Y,Li Y,Liu J,Qin L,Yu JX

    更新日期:2018-09-24 00:00:00

  • Transcriptomic analyses of Aedes aegypti cultured cells and ex vivo midguts in response to an excess or deficiency of heme: a quest for transcriptionally-regulated heme transporters.

    abstract:BACKGROUND:Aedes aegypti is the principle vector of many arboviruses, including dengue virus and Zika virus, which are transmitted when an infected female mosquito takes a blood meal in order to initiate vitellogenesis. During blood digestion, ~ 10 mM heme-iron is ingested into the midgut lumen. While heme acts as both...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06981-5

    authors: Eggleston H,Adelman ZN

    更新日期:2020-08-31 00:00:00

  • "Does replication groups scoring reduce false positive rate in SNP interaction discovery? Response".

    abstract:BACKGROUND:The genomewide evaluation of genetic epistasis is a computationally demanding task, and a current challenge in Genetics. HFCC (Hypothesis-Free Clinical Cloning) is one of the methods that have been suggested for genomewide epistasis analysis. In order to perform an exhaustive search of epistasis, HFCC has im...

    journal_title:BMC genomics

    pub_type: 评论,杂志文章

    doi:10.1186/1471-2164-11-403

    authors: Gayán J,González-Pérez A,Ruiz A

    更新日期:2010-06-24 00:00:00

  • Tests for differential gene expression using weights in oligonucleotide microarray experiments.

    abstract:BACKGROUND:Microarray data analysts commonly filter out genes based on a number of ad hoc criteria prior to any high-level statistical analysis. Such ad hoc approaches could lead to conflicting conclusions with no clear guidance as to which method is most likely to be reproducible. Furthermore, the number of tests perf...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-33

    authors: Hu P,Beyene J,Greenwood CM

    更新日期:2006-02-22 00:00:00

  • Semantic integration of gene expression analysis tools and data sources using software connectors.

    abstract:BACKGROUND:The study and analysis of gene expression measurements is the primary focus of functional genomics. Once expression data is available, biologists are faced with the task of extracting (new) knowledge associated to the underlying biological phenomenon. Most often, in order to perform this task, biologists exe...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-S6-S2

    authors: Miyazaki FA,Guardia GD,Vêncio RZ,de Farias CR

    更新日期:2013-10-25 00:00:00

  • Transcriptome profiling provides insights into dormancy release during cold storage of Lilium pumilum.

    abstract:BACKGROUND:Bulbs of the ornamental flower Lilium pumilum enter a period of dormancy after flowering in spring, and require exposure to cold for a period of time in order to release dormancy. Previous studies focused mainly on anatomical, physiological and biochemical changes during dormancy release. There are no dorman...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4536-x

    authors: Wang W,Su X,Tian Z,Liu Y,Zhou Y,He M

    更新日期:2018-03-14 00:00:00

  • Telomere length de novo assembly of all 7 chromosomes and mitogenome sequencing of the model entomopathogenic fungus, Metarhizium brunneum, by means of a novel assembly pipeline.

    abstract:BACKGROUND:More accurate and complete reference genomes have improved understanding of gene function, biology, and evolutionary mechanisms. Hybrid genome assembly approaches leverage benefits of both long, relatively error-prone reads from third-generation sequencing technologies and short, accurate reads from second-g...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-021-07390-y

    authors: Saud Z,Kortsinoglou AM,Kouvelis VN,Butt TM

    更新日期:2021-01-28 00:00:00

  • Motif depletion in bacteriophages infecting hosts with CRISPR systems.

    abstract:BACKGROUND:CRISPR is a microbial immune system likely to be involved in host-parasite coevolution. It functions using target sequences encoded by the bacterial genome, which interfere with invading nucleic acids using a homology-dependent system. The system also requires protospacer associated motifs (PAMs), short moti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-663

    authors: Kupczok A,Bollback JP

    更新日期:2014-08-08 00:00:00