Reproducible probe-level analysis of the Affymetrix Exon 1.0 ST array with R/Bioconductor.

Abstract:

:The presence of different transcripts of a gene across samples can be analysed by whole-transcriptome microarrays. Reproducing results from published microarray data represents a challenge owing to the vast amounts of data and the large variety of preprocessing and filtering steps used before the actual analysis is carried out. To guarantee a firm basis for methodological development where results with new methods are compared with previous results, it is crucial to ensure that all analyses are completely reproducible for other researchers. We here give a detailed workflow on how to perform reproducible analysis of the GeneChip®Human Exon 1.0 ST Array at probe and probeset level solely in R/Bioconductor, choosing packages based on their simplicity of use. To exemplify the use of the proposed workflow, we analyse differential splicing and differential gene expression in a publicly available dataset using various statistical methods. We believe this study will provide other researchers with an easy way of accessing gene expression data at different annotation levels and with the sufficient details needed for developing their own tools for reproducible analysis of the GeneChip®Human Exon 1.0 ST Array.

journal_name

Brief Bioinform

authors

Rodrigo-Domingo M,Waagepetersen R,Bødker JS,Falgreen S,Kjeldsen MK,Johnsen HE,Dybkær K,Bøgsted M

doi

10.1093/bib/bbt011

subject

Has Abstract

pub_date

2014-07-01 00:00:00

pages

519-33

issue

4

eissn

1467-5463

issn

1477-4054

pii

bbt011

journal_volume

15

pub_type

杂志文章
  • CeRNASeek: an R package for identification and analysis of ceRNA regulation.

    abstract::Competitive endogenous RNA (ceRNA) represents a novel layer of gene regulation that controls both physiological and pathological processes. However, there is still lack of computational tools for quickly identifying ceRNA regulation. To address this problem, we presented an R-package, CeRNASeek, which allows identifyi...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa048

    authors: Zhang M,Jin X,Li J,Tian Y,Wang Q,Li X,Xu J,Li Y,Li X

    更新日期:2020-05-04 00:00:00

  • A computing platform to map ecological metabolism by integrating functional mapping and the metabolic theory of ecology.

    abstract::Whole-organism metabolic rate co-varies allometrically with body mass, and is also affected by temperature through different biochemical mechanisms. Here we implement a computational platform to map specific quantitative trait loci (QTLs) that govern the dependence of metabolic rate on size and temperature. The model ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbv116

    authors: Yan Q,Zhu X,Jiang L,Ye M,Sun L,Terblanche JS,Wu R

    更新日期:2017-01-01 00:00:00

  • Extended application of genomic selection to screen multiomics data for prognostic signatures of prostate cancer.

    abstract::Prognostic tests using expression profiles of several dozen genes help provide treatment choices for prostate cancer (PCa). However, these tests require improvement to meet the clinical need for resolving overtreatment, which continues to be a pervasive problem in PCa management. Genomic selection (GS) methodology, wh...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa197

    authors: Li R,Wang S,Cui Y,Qu H,Chater JM,Zhang L,Wei J,Wang M,Xu Y,Yu L,Lu J,Feng Y,Zhou R,Huang Y,Ma R,Zhu J,Zhong W,Jia Z

    更新日期:2020-09-08 00:00:00

  • The mechanistic, diagnostic and therapeutic novel nucleic acids for hepatocellular carcinoma emerging in past score years.

    abstract::Despite The Central Dogma states the destiny of gene as 'DNA makes RNA and RNA makes protein', the nucleic acids not only store and transmit genetic information but also, surprisingly, join in intracellular vital movement as a regulator of gene expression. Bioinformatics has contributed to knowledge for a series of em...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa023

    authors: Zhang S,Zhou Y,Wang Y,Wang Z,Xiao Q,Zhang Y,Lou Y,Qiu Y,Zhu F

    更新日期:2020-04-06 00:00:00

  • Computational aspects of host-parasite phylogenies.

    abstract::Computational aspects of host-parasite phylogenies form part of a set of general associations between areas and organisms, hosts and parasites, and species and genes. The problem is not new and the commonalities of exploring vicariance biogeography (organisms tracking areas) and host-parasite co-speciation (parasites ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/5.4.339

    authors: Stevens J

    更新日期:2004-12-01 00:00:00

  • Improving structure-based virtual screening performance via learning from scoring function components.

    abstract::Scoring functions (SFs) based on complex machine learning (ML) algorithms have gradually emerged as a promising alternative to overcome the weaknesses of classical SFs. However, extensive efforts have been devoted to the development of SFs based on new protein-ligand interaction representations and advanced alternativ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa094

    authors: Xiong GL,Ye WL,Shen C,Lu AP,Hou TJ,Cao DS

    更新日期:2020-06-04 00:00:00

  • An open-pollinated design for mapping imprinting genes in natural populations.

    abstract::With the increasing recognition of its role in trait and disease development, it is crucial to account for genetic imprinting to illustrate the genetic architecture of complex traits. Genetic mapping can be innovated to test and estimate effects of genetic imprinting in a segregating population derived from experiment...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbu019

    authors: Sun L,Zhu X,Bo W,Xu F,Cheng T,Zhang Q,Wu R

    更新日期:2015-05-01 00:00:00

  • Tools for the functional interpretation of metabolomic experiments.

    abstract::The so-called 'omics' approaches used in modern biology aim at massively characterizing the molecular repertories of living systems at different levels. Metabolomics is one of the last additions to the 'omics' family and it deals with the characterization of the set of metabolites in a given biological system. As meta...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbs055

    authors: Chagoyen M,Pazos F

    更新日期:2013-11-01 00:00:00

  • Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools.

    abstract::Cell-penetrating peptides (CPPs) facilitate the delivery of therapeutically relevant molecules, including DNA, proteins and oligonucleotides, into cells both in vitro and in vivo. This unique ability explores the possibility of CPPs as therapeutic delivery and its potential applications in clinical therapy. Over the l...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby124

    authors: Su R,Hu J,Zou Q,Manavalan B,Wei L

    更新日期:2020-03-23 00:00:00

  • Elucidating the editome: bioinformatics approaches for RNA editing detection.

    abstract::RNA editing is a widespread co/posttranscriptional mechanism affecting primary RNAs by specific nucleotide modifications, which plays relevant roles in molecular processes including regulation of gene expression and/or the processing of noncoding RNAs. In recent years, the detection of editing sites has been improved ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbx129

    authors: Diroma MA,Ciaccia L,Pesole G,Picardi E

    更新日期:2019-03-22 00:00:00

  • Conceptual framework and pilot study to benchmark phylogenomic databases based on reference gene trees.

    abstract::Phylogenomic databases provide orthology predictions for species with fully sequenced genomes. Although the goal seems well-defined, the content of these databases differs greatly. Seven ortholog databases (Ensembl Compara, eggNOG, HOGENOM, InParanoid, OMA, OrthoDB, Panther) were compared on the basis of reference tre...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbr034

    authors: Boeckmann B,Robinson-Rechavi M,Xenarios I,Dessimoz C

    更新日期:2011-09-01 00:00:00

  • Unraveling chloroplast transcriptomes with ChloroSeq, an organelle RNA-Seq bioinformatics pipeline.

    abstract::Online sequence repositories are teeming with RNA sequencing (RNA-Seq) data from a wide range of eukaryotes. Although most of these data sets contain large numbers of organelle-derived reads, researchers tend to ignore these data, focusing instead on the nuclear-derived transcripts. Consequently, GenBank contains mass...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw088

    authors: Smith DR,Sanitá Lima M

    更新日期:2017-11-01 00:00:00

  • MloDisDB: a manually curated database of the relations between membraneless organelles and diseases.

    abstract::Cells are compartmentalized by numerous membrane-bounded organelles and membraneless organelles (MLOs) to ensure temporal and spatial regulation of various biological processes. A number of MLOs, such as nucleoli, nuclear speckles and stress granules, exist as liquid droplets within the cells and arise from the conden...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa271

    authors: Hou C,Xie H,Fu Y,Ma Y,Li T

    更新日期:2020-10-30 00:00:00

  • A feature-based approach to predict hot spots in protein-DNA binding interfaces.

    abstract::DNA-binding hot spot residues of proteins are dominant and fundamental interface residues that contribute most of the binding free energy of protein-DNA interfaces. As experimental methods for identifying hot spots are expensive and time consuming, computational approaches are urgently required in predicting hot spots...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz037

    authors: Zhang S,Zhao L,Zheng CH,Xia J

    更新日期:2020-05-21 00:00:00

  • Characteristics and evolution of the ecosystem of software tools supporting research in molecular biology.

    abstract::Daily work in molecular biology presently depends on a large number of computational tools. An in-depth, large-scale study of that 'ecosystem' of Web tools, its characteristics, interconnectivity, patterns of usage/citation, temporal evolution and rate of decay is crucial for understanding the forces that shape it and...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby001

    authors: Pazos F,Chagoyen M

    更新日期:2019-07-19 00:00:00

  • Towards scaling elementary flux mode computation.

    abstract::While elementary flux mode (EFM) analysis is now recognized as a cornerstone computational technique for cellular pathway analysis and engineering, EFM application to genome-scale models remains computationally prohibitive. This article provides a review of aspects of EFM computation that elucidates bottlenecks in sca...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz094

    authors: Ullah E,Yosafshahi M,Hassoun S

    更新日期:2020-12-01 00:00:00

  • LiBis: an ultrasensitive alignment augmentation for low-input bisulfite sequencing.

    abstract::The cell-free DNA (cfDNA) methylation profile in liquid biopsy has been utilized to diagnose early-stage disease and estimate therapy response. However, typical clinical procedures are capable of purifying only very small amounts of cfDNA. Whole-genome bisulfite sequencing (WGBS) is the gold standard for measuring DNA...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa332

    authors: Yin Y,Li J,Li J,Lee M,Zhao S,Guo L,Li J,Zhang M,Huang Y,Li XN,Deng Z,Sun D

    更新日期:2020-12-15 00:00:00

  • In silico signaling modeling to understand cancer pathways and treatment responses.

    abstract::Precision medicine has changed thinking in cancer therapy, highlighting a better understanding of the individual clinical interventions. But what role do the drivers and pathways identified from pan-cancer genome analysis play in the tumor? In this letter, we will highlight the importance of in silico modeling in prec...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz033

    authors: Kunz M,Jeromin J,Fuchs M,Christoph J,Veronesi G,Flentje M,Nietzer S,Dandekar G,Dandekar T

    更新日期:2020-05-21 00:00:00

  • The computational challenges of applying comparative-based computational methods to whole genomes.

    abstract::The explosion in genomic sequence available in public databases has resulted in an unprecedented opportunity for computational whole genome analyses. A number of promising comparative-based approaches have been developed for gene finding, regulatory element discovery and other purposes, and it is clear that these tool...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/3.1.18

    authors: Dubchak I,Pachter L

    更新日期:2002-03-01 00:00:00

  • SurvivalMeth: a web server to investigate the effect of DNA methylation-related functional elements on prognosis.

    abstract::Aberrant DNA methylation is a fundamental characterization of epigenetics for carcinogenesis. Abnormality of DNA methylation-related functional elements (DMFEs) may lead to dysfunction of regulatory genes in the progression of cancers, contributing to prognosis of many cancers. There is an urgent need to construct a t...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa162

    authors: Zhang C,Zhao N,Zhang X,Xiao J,Li J,Lv D,Zhou W,Li Y,Xu J,Li X

    更新日期:2020-08-11 00:00:00

  • Closing the gap between formats for storing layout information in systems biology.

    abstract::The understanding of complex biological networks often relies on both a dedicated layout and a topology. Currently, there are three major competing layout-aware systems biology formats, but there are no software tools or software libraries supporting all of them. This complicates the management of molecular network la...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz067

    authors: Hoksza D,Gawron P,Ostaszewski M,Hasenauer J,Schneider R

    更新日期:2020-07-15 00:00:00

  • Comparison of haplotype-based tests for detecting gene-environment interactions with rare variants.

    abstract::Dissecting the genetic mechanism underlying a complex disease hinges on discovering gene-environment interactions (GXE). However, detecting GXE is a challenging problem especially when the genetic variants under study are rare. Haplotype-based tests have several advantages over the so-called collapsing tests for detec...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz031

    authors: Papachristou C,Biswas S

    更新日期:2020-05-21 00:00:00

  • TrimNet: learning molecular representation from triplet messages for biomedicine.

    abstract:MOTIVATION:Computational methods accelerate drug discovery and play an important role in biomedicine, such as molecular property prediction and compound-protein interaction (CPI) identification. A key challenge is to learn useful molecular representation. In the early years, molecular properties are mainly calculated b...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa266

    authors: Li P,Li Y,Hsieh CY,Zhang S,Liu X,Liu H,Song S,Yao X

    更新日期:2020-11-04 00:00:00

  • Are dropout imputation methods for scRNA-seq effective for scHi-C data?

    abstract::The prevalence of dropout events is a serious problem for single-cell Hi-C (scHiC) data due to insufficient sequencing depth and data coverage, which brings difficulties in downstream studies such as clustering and structural analysis. Complicating things further is the fact that dropouts are confounded with structura...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa289

    authors: Han C,Xie Q,Lin S

    更新日期:2020-11-17 00:00:00

  • GenoPheno: cataloging large-scale phenotypic and next-generation sequencing data within human datasets.

    abstract::Precision medicine promises to revolutionize treatment, shifting therapeutic approaches from the classical one-size-fits-all to those more tailored to the patient's individual genomic profile, lifestyle and environmental exposures. Yet, to advance precision medicine's main objective-ensuring the optimum diagnosis, tre...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa033

    authors: Gutiérrez-Sacristán A,De Niz C,Kothari C,Kong SW,Mandl KD,Avillach P

    更新日期:2021-01-18 00:00:00

  • Iteratively reweighted LASSO for mapping multiple quantitative trait loci.

    abstract::The iteratively reweighted least square (IRLS) method is mostly identical to maximum likelihood (ML) method in terms of parameter estimation and power of quantitative trait locus (QTL) detection. But the IRLS is greatly superior to ML in terms of computing speed and the robustness of parameter estimation. In conjuncti...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbs062

    authors: Liu Y,Yang T,Li H,Yang R

    更新日期:2014-01-01 00:00:00

  • Probe mapping across multiple microarray platforms.

    abstract::Access to gene expression data has become increasingly common in recent years; however, analysis has become more difficult as it is often desirable to integrate data from different platforms. Probe mapping across microarray platforms is the first and most crucial step for data integration. In this article, we systemat...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbr076

    authors: Allen JD,Wang S,Chen M,Girard L,Minna JD,Xie Y,Xiao G

    更新日期:2012-09-01 00:00:00

  • Small noncoding RNA discovery and profiling with sRNAtools based on high-throughput sequencing.

    abstract::Small noncoding RNAs (sRNA/sncRNAs) are generated from different genomic loci and play important roles in biological processes, such as cell proliferation and the regulation of gene expression. Next-generation sequencing (NGS) has provided an unprecedented opportunity to discover and quantify diverse kinds of sncRNA, ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz151

    authors: Liu Q,Ding C,Lang X,Guo G,Chen J,Su X

    更新日期:2021-01-18 00:00:00

  • Molecular dynamics simulations for genetic interpretation in protein coding regions: where we are, where to go and when.

    abstract::The increasing ease with which massive genetic information can be obtained from patients or healthy individuals has stimulated the development of interpretive bioinformatics tools as aids in clinical practice. Most such tools analyze evolutionary information and simple physical-chemical properties to predict whether r...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz146

    authors: Galano-Frutos JJ,García-Cebollada H,Sancho J

    更新日期:2021-01-18 00:00:00

  • Bioinformatics tools and challenges in structural analysis of lipidomics MS/MS data.

    abstract::Lipidomics, the systematic study of the lipid composition of a cell or tissue, is an invaluable complement to knowledge gained by genomics and proteomics research. Mass spectrometry provides a means to detect hundreds of lipids in parallel, and this includes low abundance species of lipids. Nevertheless, frequently oc...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbs030

    authors: Hartler J,Tharakan R,Köfeler HC,Graham DR,Thallinger GG

    更新日期:2013-05-01 00:00:00