当前位置： SCI文献检索 > BMC BIOINFORMATICS期刊下所有文献 > Genotype calling in tetraploid species from bi-allelic marker data using mixture models.

Genotype calling in tetraploid species from bi-allelic marker data using mixture models.

Abstract：

BACKGROUND:Automated genotype calling in tetraploid species was until recently not possible, which hampered genetic analysis. Modern genotyping assays often produce two signals, one for each allele of a bi-allelic marker. While ample software is available to obtain genotypes (homozygous for either allele, or heterozygous) for diploid species from these signals, such software is not available for tetraploid species which may be scored as five alternative genotypes (aaaa, baaa, bbaa, bbba and bbbb; nulliplex to quadruplex). RESULTS:We present a novel algorithm, implemented in the R package fitTetra, to assign genotypes for bi-allelic markers to tetraploid samples from genotyping assays that produce intensity signals for both alleles. The algorithm is based on the fitting of several mixture models with five components, one for each of the five possible genotypes. The models have different numbers of parameters specifying the relation between the five component means, and some of them impose a constraint on the mixing proportions to conform to Hardy-Weinberg equilibrium (HWE) ratios. The software rejects markers that do not allow a reliable genotyping for the majority of the samples, and it assigns a missing score to samples that cannot be scored into one of the five possible genotypes with sufficient confidence. CONCLUSIONS:We have validated the software with data of a collection of 224 potato varieties assayed with an Illumina GoldenGate™ 384 SNP array and shown that all SNPs with informative ratio distributions are fitted. Almost all fitted models appear to be correct based on visual inspection and comparison with diploid samples. When the collection of potato varieties is analyzed as if it were a population, almost all markers seem to be in Hardy-Weinberg equilibrium. The R package fitTetra is freely available under the GNU Public License from http://www.plantbreeding.wur.nl/UK/software_fitTetra.html and as Additional files with this article.

journal_name

BMC Bioinformatics

journal_title

BMC bioinformatics

authors

Voorrips RE,Gort G,Vosman B

doi

10.1186/1471-2105-12-172

subject

Has Abstract

pub_date

2011-05-19 00:00:00

pages

172

issn

1471-2105

pii

1471-2105-12-172

journal_volume

pub_type

杂志文章

在线工具

Genotype calling in tetraploid species from bi-allelic marker data using mixture models.