SMaSH: Sample matching using SNPs in humans.

Abstract:

BACKGROUND:Inadvertent sample swaps are a real threat to data quality in any medium to large scale omics studies. While matches between samples from the same individual can in principle be identified from a few well characterized single nucleotide polymorphisms (SNPs), omics data types often only provide low to moderate coverage, thus requiring integration of evidence from a large number of SNPs to determine if two samples derive from the same individual or not. METHODS:We select about six thousand SNPs in the human genome and develop a Bayesian framework that is able to robustly identify sample matches between next generation sequencing data sets. RESULTS:We validate our approach on a variety of data sets. Most importantly, we show that our approach can establish identity between different omics data types such as Exome, RNA-Seq, and MethylCap-Seq. We demonstrate how identity detection degrades with sample quality and read coverage, but show that twenty million reads of a fairly low quality RNA-Seq sample are still sufficient for reliable sample identification. CONCLUSION:Our tool, SMASH, is able to identify sample mismatches in next generation sequencing data sets between different sequencing modalities and for low quality sequencing data.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Westphal M,Frankhouser D,Sonzone C,Shields PG,Yan P,Bundschuh R

doi

10.1186/s12864-019-6332-7

subject

Has Abstract

pub_date

2019-12-30 00:00:00

pages

1001

issue

Suppl 12

issn

1471-2164

pii

10.1186/s12864-019-6332-7

journal_volume

20

pub_type

杂志文章
  • The Escherichia coli K-12 ORFeome: a resource for comparative molecular microbiology.

    abstract:BACKGROUND:Systems biology and functional genomics require genome-wide datasets and resources. Complete sets of cloned open reading frames (ORFs) have been made for about a dozen bacterial species and allow researchers to express and study complete proteomes in a high-throughput fashion. RESULTS:We have constructed an...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-470

    authors: Rajagopala SV,Yamamoto N,Zweifel AE,Nakamichi T,Huang HK,Mendez-Rios JD,Franca-Koh J,Boorgula MP,Fujita K,Suzuki K,Hu JC,Wanner BL,Mori H,Uetz P

    更新日期:2010-08-11 00:00:00

  • A modified sequence capture approach allowing standard and methylation analyses of the same enriched genomic DNA sample.

    abstract:BACKGROUND:Bread wheat has a large complex genome that makes whole genome resequencing costly. Therefore, genome complexity reduction techniques such as sequence capture make re-sequencing cost effective. With a high-quality draft wheat genome now available it is possible to design capture probe sets and to use them to...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4640-y

    authors: Olohan L,Gardiner LJ,Lucaci A,Steuernagel B,Wulff B,Kenny J,Hall N,Hall A

    更新日期:2018-04-13 00:00:00

  • Characterization of SR3 reveals abundance of non-LTR retrotransposons of the RTE clade in the genome of the human blood fluke, Schistosoma mansoni.

    abstract:BACKGROUND:It is becoming apparent that perhaps as much as half of the genome of the human blood fluke Schistosoma mansoni is constituted of mobile genetic element-related sequences. Non-long terminal repeat (LTR) retrotransposons, related to the LINE elements of mammals, comprise much of this repetitive component of t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-154

    authors: Laha T,Kewgrai N,Loukas A,Brindley PJ

    更新日期:2005-11-04 00:00:00

  • Genome-wide identification of novel intergenic enhancer-like elements: implications in the regulation of transcription in Plasmodium falciparum.

    abstract:BACKGROUND:The molecular mechanisms of transcriptional regulation are poorly understood in Plasmodium falciparum. In addition, most of the genes in Plasmodium falciparum are transcriptionally poised and only a handful of cis-regulatory elements are known to operate in transcriptional regulation. Here, we employed an ep...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4052-4

    authors: Ubhe S,Rawat M,Verma S,Anamika K,Karmodiya K

    更新日期:2017-08-23 00:00:00

  • PLAIDOH: a novel method for functional prediction of long non-coding RNAs identifies cancer-specific LncRNA activities.

    abstract:BACKGROUND:Long non-coding RNAs (lncRNAs) exhibit remarkable cell-type specificity and disease association. LncRNA's functional versatility includes epigenetic modification, nuclear domain organization, transcriptional control, regulation of RNA splicing and translation, and modulation of protein activity. However, mos...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5497-4

    authors: Pyfrom SC,Luo H,Payton JE

    更新日期:2019-02-15 00:00:00

  • Unravelling the complex trait of harvest index in rapeseed (Brassica napus L.) with association mapping.

    abstract:BACKGROUND:Harvest index (HI), the ratio of grain yield to total biomass, is considered as a measure of biological success in partitioning assimilated photosynthate to the harvestable product. While crop production can be dramatically improved by increasing HI, the underlying molecular genetic mechanism of HI in rapese...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1607-0

    authors: Luo X,Ma C,Yue Y,Hu K,Li Y,Duan Z,Wu M,Tu J,Shen J,Yi B,Fu T

    更新日期:2015-05-12 00:00:00

  • Genome and Transcriptome sequence of Finger millet (Eleusine coracana (L.) Gaertn.) provides insights into drought tolerance and nutraceutical properties.

    abstract:BACKGROUND:Finger millet (Eleusine coracana (L.) Gaertn.) is an important staple food crop widely grown in Africa and South Asia. Among the millets, finger millet has high amount of calcium, methionine, tryptophan, fiber, and sulphur containing amino acids. In addition, it has C4 photosynthetic carbon assimilation mech...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3850-z

    authors: Hittalmani S,Mahesh HB,Shirke MD,Biradar H,Uday G,Aruna YR,Lohithaswa HC,Mohanrao A

    更新日期:2017-06-15 00:00:00

  • Heterologous oligonucleotide microarrays for transcriptomics in a non-model species; a proof-of-concept study of drought stress in Musa.

    abstract:BACKGROUND:'Systems-wide' approaches such as microarray RNA-profiling are ideally suited to the study of the complex overlapping responses of plants to biotic and abiotic stresses. However, commercial microarrays are only available for a limited number of plant species and development costs are so substantial as to be ...

    journal_title:BMC genomics

    pub_type: 杂志文章,meta分析

    doi:10.1186/1471-2164-10-436

    authors: Davey MW,Graham NS,Vanholme B,Swennen R,May ST,Keulemans J

    更新日期:2009-09-16 00:00:00

  • miR-27b shapes the presynaptic transcriptome and influences neurotransmission by silencing the polycomb group protein Bmi1.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are short non-coding RNAs that are emerging as important post-transcriptional regulators of neuronal and synaptic development. The precise impact of miRNAs on presynaptic function and neurotransmission remains, however, poorly understood. RESULTS:Here, we identify miR-27b-an abundant neur...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3139-7

    authors: Poon VY,Gu M,Ji F,VanDongen AM,Fivaz M

    更新日期:2016-10-04 00:00:00

  • Identification and functional analysis of early gene expression induced by circadian light-resetting in Drosophila.

    abstract:BACKGROUND:The environmental light-dark cycle is the dominant cue that maintains 24-h biological rhythms in multicellular organisms. In Drosophila, light entrainment is mediated by the photosensitive protein CRYPTOCHROME, but the role and extent of transcription regulation in light resetting of the dipteran clock is ye...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1787-7

    authors: Adewoye AB,Kyriacou CP,Tauber E

    更新日期:2015-08-01 00:00:00

  • Celiac disease T-cell epitopes from gamma-gliadins: immunoreactivity depends on the genome of origin, transcript frequency, and flanking protein variation.

    abstract:BACKGROUND:Celiac disease (CD) is caused by an uncontrolled immune response to gluten, a heterogeneous mixture of wheat storage proteins. The CD-toxicity of these proteins and their derived peptides is depending on the presence of specific T-cell epitopes (9-mer peptides; CD epitopes) that mediate the stimulation of HL...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-277

    authors: Salentijn EM,Mitea DC,Goryunova SV,van der Meer IM,Padioleau I,Gilissen LJ,Koning F,Smulders MJ

    更新日期:2012-06-22 00:00:00

  • Systems perspectives on erythromycin biosynthesis by comparative genomic and transcriptomic analyses of S. erythraea E3 and NRRL23338 strains.

    abstract:BACKGROUND:S. erythraea is a Gram-positive filamentous bacterium used for the industrial-scale production of erythromycin A which is of high clinical importance. In this work, we sequenced the whole genome of a high-producing strain (E3) obtained by random mutagenesis and screening from the wild-type strain NRRL23338, ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-523

    authors: Li YY,Chang X,Yu WB,Li H,Ye ZQ,Yu H,Liu BH,Zhang Y,Zhang SL,Ye BC,Li YX

    更新日期:2013-07-31 00:00:00

  • Oocyte-somatic cells interactions, lessons from evolution.

    abstract:BACKGROUND:Despite the known importance of somatic cells for oocyte developmental competence acquisition, the overall mechanisms underlying the acquisition of full developmental competence are far from being understood, especially in non-mammalian species. The present work aimed at identifying key molecular signals fro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-560

    authors: Charlier C,Montfort J,Chabrol O,Brisard D,Nguyen T,Le Cam A,Richard-Parpaillon L,Moreews F,Pontarotti P,Uzbekova S,Chesnel F,Bobe J

    更新日期:2012-10-19 00:00:00

  • The alcohol dehydrogenase gene family in sugarcane and its involvement in cold stress regulation.

    abstract:BACKGROUND:Alcohol dehydrogenases (ADHs) in plants are encoded by a multigene family. ADHs participate in growth, development, and adaptation in many plant species, but the evolution and function of the ADH gene family in sugarcane is still unclear. RESULTS:In the present study, 151 ADH genes from 17 species including...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06929-9

    authors: Su W,Ren Y,Wang D,Su Y,Feng J,Zhang C,Tang H,Xu L,Muhammad K,Que Y

    更新日期:2020-07-29 00:00:00

  • Dual activation of pathways regulated by steroid receptors and peptide growth factors in primary prostate cancer revealed by Factor Analysis of microarray data.

    abstract:BACKGROUND:We use an approach based on Factor Analysis to analyze datasets generated for transcriptional profiling. The method groups samples into biologically relevant categories, and enables the identification of genes and pathways most significantly associated to each phenotypic group, while allowing for the partici...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-109

    authors: Lozano JJ,Soler M,Bermudo R,Abia D,Fernandez PL,Thomson TM,Ortiz AR

    更新日期:2005-08-17 00:00:00

  • Transcriptome dynamics of Arabidopsis thaliana root penetration by the oomycete pathogen Phytophthora parasitica.

    abstract:BACKGROUND:Oomycetes are a group of filamentous microorganisms that includes both animal and plant pathogens and causes major agricultural losses. Phytophthora species can infect most crops and plants from natural ecosystems. Despite their tremendous economic and ecologic importance, few effective methods exist for lim...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-538

    authors: Attard A,Evangelisti E,Kebdani-Minet N,Panabières F,Deleury E,Maggio C,Ponchet M,Gourgues M

    更新日期:2014-06-29 00:00:00

  • σ54-dependent regulome in Desulfovibrio vulgaris Hildenborough.

    abstract:BACKGROUND:The σ(54) subunit controls a unique class of promoters in bacteria. Such promoters, without exception, require enhancer binding proteins (EBPs) for transcription initiation. Desulfovibrio vulgaris Hildenborough, a model bacterium for sulfate reduction studies, has a high number of EBPs, more than most sequen...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2176-y

    authors: Kazakov AE,Rajeev L,Chen A,Luning EG,Dubchak I,Mukhopadhyay A,Novichkov PS

    更新日期:2015-11-10 00:00:00

  • Highly expressed captured genes and cross-kingdom domains present in Helitrons create novel diversity in Pleurotus ostreatus and other fungi.

    abstract:BACKGROUND:Helitrons are class-II eukaryotic transposons that transpose via a rolling circle mechanism. Due to their ability to capture and mobilize gene fragments, they play an important role in the evolution of their host genomes. We have used a bioinformatics approach for the identification of helitrons in two Pleur...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1071

    authors: Castanera R,Pérez G,López L,Sancho R,Santoyo F,Alfaro M,Gabaldón T,Pisabarro AG,Oguiza JA,Ramírez L

    更新日期:2014-12-05 00:00:00

  • Comparative DNA methylome analysis of endometrial carcinoma reveals complex and distinct deregulation of cancer promoters and enhancers.

    abstract:BACKGROUND:Aberrant DNA methylation is a hallmark of many cancers. Classically there are two types of endometrial cancer, endometrioid adenocarcinoma (EAC), or Type I, and uterine papillary serous carcinoma (UPSC), or Type II. However, the whole genome DNA methylation changes in these two classical types of endometrial...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-868

    authors: Zhang B,Xing X,Li J,Lowdon RF,Zhou Y,Lin N,Zhang B,Sundaram V,Chiappinelli KB,Hagemann IS,Mutch DG,Goodfellow PJ,Wang T

    更新日期:2014-10-06 00:00:00

  • How to evaluate performance of prediction methods? Measures and their interpretation in variation effect analysis.

    abstract:BACKGROUND:Prediction methods are increasingly used in biosciences to forecast diverse features and characteristics. Binary two-state classifiers are the most common applications. They are usually based on machine learning approaches. For the end user it is often problematic to evaluate the true performance and applica...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S4-S2

    authors: Vihinen M

    更新日期:2012-06-18 00:00:00

  • Generation of a reference transcriptome for evaluating rainbow trout responses to various stressors.

    abstract:BACKGROUND:Fish under intensive culture conditions are exposed to a variety of acute and chronic stressors, including high rearing densities, sub-optimal water quality, and severe thermal fluctuations. Such stressors are inherent in aquaculture production and can induce physiological responses with adverse effects on t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-626

    authors: Sánchez CC,Weber GM,Gao G,Cleveland BM,Yao J,Rexroad CE 3rd

    更新日期:2011-12-21 00:00:00

  • CEG: a database of essential gene clusters.

    abstract:BACKGROUND:Essential genes are indispensable for the survival of living entities. They are the cornerstones of synthetic biology, and are potential candidate targets for antimicrobial and vaccine design. DESCRIPTION:Here we describe the Cluster of Essential Genes (CEG) database, which contains clusters of orthologous ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-769

    authors: Ye YN,Hua ZG,Huang J,Rao N,Guo FB

    更新日期:2013-11-09 00:00:00

  • An improved approach for the segmentation of starch granules in microscopic images.

    abstract:BACKGROUND:Starches are the main storage polysaccharides in plants and are distributed widely throughout plants including seeds, roots, tubers, leaves, stems and so on. Currently, microscopic observation is one of the most important ways to investigate and analyze the structure of starches. The position, shape, and siz...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S2-S13

    authors: Guo S,Tang J,Deng Y,Xia Q

    更新日期:2010-11-02 00:00:00

  • Variant detection and runs of homozygosity in next generation sequencing data elucidate the genetic background of Lundehund syndrome.

    abstract:BACKGROUND:The Lundehund is a highly specialized breed characterized by a unique flexibility of the joints and polydactyly in all four limbs. The extremely small population size and high inbreeding has promoted a high frequency of diseased dogs affected by the Lundehund syndrome (LS), a severe gastro-enteropathic disea...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2844-6

    authors: Metzger J,Pfahler S,Distl O

    更新日期:2016-08-02 00:00:00

  • A manually annotated Actinidia chinensis var. chinensis (kiwifruit) genome highlights the challenges associated with draft genomes and gene prediction in plants.

    abstract:BACKGROUND:Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongy...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4656-3

    authors: Pilkington SM,Crowhurst R,Hilario E,Nardozza S,Fraser L,Peng Y,Gunaseelan K,Simpson R,Tahir J,Deroles SC,Templeton K,Luo Z,Davy M,Cheng C,McNeilage M,Scaglione D,Liu Y,Zhang Q,Datson P,De Silva N,Gardiner SE,Bas

    更新日期:2018-04-16 00:00:00

  • Genome-wide association and prediction of direct genomic breeding values for composition of fatty acids in Angus beef cattle.

    abstract:BACKGROUND:As consumers continue to request food products that have health advantages, it will be important for the livestock industry to supply a product that meet these demands. One such nutrient is fatty acids, which have been implicated as playing a role in cardiovascular disease. Therefore, the objective of this s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-730

    authors: Saatchi M,Garrick DJ,Tait RG Jr,Mayes MS,Drewnoski M,Schoonmaker J,Diaz C,Beitz DC,Reecy JM

    更新日期:2013-10-25 00:00:00

  • Muscle regeneration in dystrophin-deficient mdx mice studied by gene expression profiling.

    abstract:BACKGROUND:Duchenne muscular dystrophy (DMD), caused by mutations in the dystrophin gene, is lethal. In contrast, dystrophin-deficient mdx mice recover due to effective regeneration of affected muscle tissue. To characterize the molecular processes associated with regeneration, we compared gene expression levels in hin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-98

    authors: Turk R,Sterrenburg E,de Meijer EJ,van Ommen GJ,den Dunnen JT,'t Hoen PA

    更新日期:2005-07-13 00:00:00

  • Transcriptomic analysis of flower induction for long-day pitaya by supplementary lighting in short-day winter season.

    abstract:BACKGROUND:Pitayas are currently attracting considerable interest as a tropical fruit with numerous health benefits. However, as a long-day plant, pitaya plants cannot flower in the winter season from November to April in Hainan, China. To harvest pitayas with high economic value in the winter season, it is necessary t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6726-6

    authors: Xiong R,Liu C,Xu M,Wei SS,Huang JQ,Tang H

    更新日期:2020-04-29 00:00:00

  • Transcriptomic analyses reveal physiological changes in sweet orange roots affected by citrus blight.

    abstract:BACKGROUND:Citrus blight is a very important progressive decline disease of commercial citrus. The etiology is unknown, although the disease can be transmitted by root grafts, suggesting a viral etiology. Diagnosis is made by demonstrating physical blockage of xylem cells that prevents the movement of water. This test ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6339-0

    authors: Fu S,Shao J,Roy A,Brlansky RH,Zhou C,Hartung JS

    更新日期:2019-12-11 00:00:00

  • Systems genetics analysis of body weight and energy metabolism traits in Drosophila melanogaster.

    abstract:BACKGROUND:Obesity and phenotypic traits associated with this condition exhibit significant heritability in natural populations of most organisms. While a number of genes and genetic pathways have been implicated to play a role in obesity associated traits, the genetic architecture that underlies the natural variation ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-297

    authors: Jumbo-Lucioni P,Ayroles JF,Chambers MM,Jordan KW,Leips J,Mackay TF,De Luca M

    更新日期:2010-05-11 00:00:00