The impact of contaminants on the accuracy of genome skimming and the effectiveness of exclusion read filters.

Abstract:

:The ability to detect the identity of a sample obtained from its environment is a cornerstone of molecular ecological research. Thanks to the falling price of shotgun sequencing, genome skimming, the acquisition of short reads spread across the genome at low coverage, is emerging as an alternative to traditional barcoding. By obtaining far more data across the whole genome, skimming has the promise to increase the precision of sample identification beyond traditional barcoding while keeping the costs manageable. While methods for assembly-free sample identification based on genome skims are now available, little is known about how these methods react to the presence of DNA from organisms other than the target species. In this paper, we show that the accuracy of distances computed between a pair of genome skims based on k-mer similarity can degrade dramatically if the skims include contaminant reads; i.e., any reads originating from other organisms. We establish a theoretical model of the impact of contamination. We then suggest and evaluate a solution to the contamination problem: Query reads in a genome skim against an extensive database of possible contaminants (e.g., all microbial organisms) and filter out any read that matches. We evaluate the effectiveness of this strategy when implemented using Kraken-II, in detailed analyses. Our results show substantial improvements in accuracy as a result of filtering but also point to limitations, including a need for relatively close matches in the contaminant database.

journal_name

Mol Ecol Resour

authors

Rachtman E,Balaban M,Bafna V,Mirarab S

doi

10.1111/1755-0998.13135

subject

Has Abstract

pub_date

2020-05-01 00:00:00

issue

3

eissn

1755-098X

issn

1755-0998

journal_volume

20

pub_type

杂志文章
  • Characterization and analysis of a de novo transcriptome from the pygmy grasshopper Tetrix japonica.

    abstract::The pygmy grasshopper Tetrix japonica is a common insect distributed throughout the world, and it has the potential for use in studies of body colour polymorphism, genomics and the biology of Tetrigoidea (Insecta: Orthoptera). However, limited biological information is available for this insect. Here, we conducted a d...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.12553

    authors: Qiu Z,Liu F,Lu H,Huang Y

    更新日期:2017-05-01 00:00:00

  • Development of microsatellite markers for the wood cricket, Nemobius sylvestris (Orthoptera: Gryllidae).

    abstract::Thirty-four novel microsatellite markers developed for wood cricket (Nemobius sylvestris) were tested and optimized. Twenty-five microsatellite loci were polymorphic, exhibiting between two and nine alleles. Observed heterozygosities ranged from 0.038 to 0.925. The microsatellites were also tested in a species belongi...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2008.02269.x

    authors: Vanhala T,Cottrell J

    更新日期:2008-11-01 00:00:00

  • Applicability of RAD-tag genotyping for interfamilial comparisons: empirical data from two cetaceans.

    abstract::Restriction-site-associated DNA tag (RAD-tag) sequencing has become a popular approach to generate thousands of SNPs used to address diverse questions in population genomics. Comparatively, the suitability of RAD-tag genotyping to address evolutionary questions across divergent species has been the subject of only a f...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.12206

    authors: Viricel A,Pante E,Dabin W,Simon-Bouhet B

    更新日期:2014-05-01 00:00:00

  • Isolation and characterization of 11 polymerase chain reaction primers for microsatellite loci for the Chilean marine isopod Excirolana hirsuticauda.

    abstract::The Chilean isopod Excirolana hirsuticauda is a marine benthic brooder with wide distributional range and low potential for long-distance dispersal. Eleven microsatellite markers were developed for E. hirsuticauda using enriched libraries. Characterization of those loci in 35 individuals from Playa Blanca beach showed...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2008.02186.x

    authors: Haye PA,Marchant S

    更新日期:2008-09-01 00:00:00

  • Rapid identification of thousands of copperhead snake (Agkistrodon contortrix) microsatellite loci from modest amounts of 454 shotgun genome sequence.

    abstract::Optimal integration of next-generation sequencing into mainstream research requires re-evaluation of how problems can be reasonably overcome and what questions can be asked. One potential application is the rapid acquisition of genomic information to identify microsatellite loci for evolutionary, population genetic an...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2009.02750.x

    authors: Castoe TA,Poole AW,Gu W,Jason de Koning AP,Daza JM,Smith EN,Pollock DD

    更新日期:2010-03-01 00:00:00

  • A new version of PRT software for sibling groups reconstruction with comments regarding several issues in the sibling reconstruction problem.

    abstract::Pedigree reconstruction using genotypic markers has become an important tool for the study of natural populations. The nonstandard nature of the underlying statistical problems has led to the necessity of developing specialized statistical and computational methods. In this article, a new version of pedigree reconstru...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2011.03061.x

    authors: Almudevar A,Anderson EC

    更新日期:2012-01-01 00:00:00

  • Skyline-plot methods for estimating demographic history from nucleotide sequences.

    abstract::Estimation of demographic history from nucleotide sequences represents an important component of many studies in molecular ecology. For example, knowledge of a population's history can allow us to test hypotheses about the impact of climatic and anthropogenic factors. In the past, demographic analysis was typically li...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章,评审

    doi:10.1111/j.1755-0998.2011.02988.x

    authors: Ho SY,Shapiro B

    更新日期:2011-05-01 00:00:00

  • Multiplex 16S rRNA haplotype-specific PCR, a rapid and convenient method for fish species identification: an application to West African Clupeiform larvae.

    abstract::A multiplex haplotype-specific polymerase chain reaction (MHS-PCR) method was developed, which identified seven Clupeiform species living in the tropical Eastern Atlantic region: Sardinella aurita, Sardinella maderensis, Ethmalosa fimbriata, Sardina pilchardus, Engraulis encrasicholus, Pellonula leonensis and Ilisha a...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2009.02776.x

    authors: Durand JD,Diatta MA,Diop K,Trape S

    更新日期:2010-05-01 00:00:00

  • PERMANENT GENETIC RESOURCES: Development of microsatellite markers of the Mexican understorey palm Chamaedorea elegans, cross-species genotyping, and amplification in congeners.

    abstract::With striking morphological diversity and adaptability, Chamaedorea palms constitute an ecologically and economically important understorey component of Neotropical forests. Nine loci developed for Chamaedorea elegans evaluated in three Veracruz populations resulted in a large number of alleles (8-18), and high expect...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1471-8286.2007.01942.x

    authors: Cibrián-Jaramillo A,Hahn WJ,Desalle R

    更新日期:2008-03-01 00:00:00

  • Isolation and characterization of nine microsatellite loci in an ant-tended treehopper Publilia concava.

    abstract::Publilia concava is an eastern North American membracid commonly occurring in large but spatially patchy aggregations, primarily on the host plant Solidago altissima. Like other myrmecophiles, P. concava provides sugary excretions to ants in return for the various protective, competitive or even sanitary benefits that...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2009.02598.x

    authors: Chhatre V,Morales MA,Abbot P

    更新日期:2009-07-01 00:00:00

  • Spider-specific probe set for ultraconserved elements offers new perspectives on the evolutionary history of spiders (Arachnida, Araneae).

    abstract::Phylogenomic methods have proven useful for resolving deep nodes and recalcitrant groups in the spider tree of life. Across arachnids, transcriptomic approaches may generate thousands of loci, and target-capture methods, using the previously designed arachnid-specific probe set, can target a maximum of about 1,000 loc...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.13099

    authors: Kulkarni S,Wood H,Lloyd M,Hormiga G

    更新日期:2020-01-01 00:00:00

  • Isolation and characterization of microsatellites in the bird-pollinated, autohexaploid, Eremophila glabra ssp. glabra (R.Br. (Ostenf.)) (Myoporaceae), an Australian endemic plant.

    abstract::Thirty-eight microsatellite loci were developed for the bird pollinated, autohexaploid, Eremophila glabra ssp. glabra. A genomic library was screened with dinucleotide and trinucleotide sequence repeats. Polymorphism ranged from one to 21 alleles per locus. Twenty-four loci exhibited null alleles, based on patterns of...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2009.02618.x

    authors: Elliott CP

    更新日期:2009-07-01 00:00:00

  • DNA metabarcoding multiplexing and validation of data accuracy for diet assessment: application to omnivorous diet.

    abstract::Ecological understanding of the role of consumer-resource interactions in natural food webs is limited by the difficulty of accurately and efficiently determining the complex variety of food types animals have eaten in the field. We developed a method based on DNA metabarcoding multiplexing and next-generation sequenc...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.12188

    authors: De Barba M,Miquel C,Boyer F,Mercier C,Rioux D,Coissac E,Taberlet P

    更新日期:2014-03-01 00:00:00

  • A hierarchical Bayesian Beta regression approach to study the effects of geographical genetic structure and spatial autocorrelation on species distribution range shifts.

    abstract::Global climate change (GCC) may be causing distribution range shifts in many organisms worldwide. Multiple efforts are currently focused on the development of models to better predict distribution range shifts due to GCC. We addressed this issue by including intraspecific genetic structure and spatial autocorrelation ...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.13024

    authors: Martínez-Minaya J,Conesa D,Fortin MJ,Alonso-Blanco C,Picó FX,Marcer A

    更新日期:2019-07-01 00:00:00

  • Ten new microsatellite markers for the buttonwood mangrove (Conocarpus erectus L., Combretaceae).

    abstract::We present 10 microsatellite markers for the buttonwood mangrove, Conocarpus erectus, a wide-range mangrove associate species. Polymorphism was assessed among individuals from six different populations along the Pacific Coast of Mexico and Costa Rica, as well as in two individuals from the Yucatan Peninsula in the Atl...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2008.02088.x

    authors: Nettel A,Dodd RS,Cid-Becerra JA,DE LA Rosa-Velez J

    更新日期:2008-07-01 00:00:00

  • Optimized construction of microsatellite-enriched libraries.

    abstract::The construction of microsatellite-enriched libraries is an indispensable tool to search for molecular markers as complete genome sequences are still not available for the majority of species of interest. Numerous protocols are available in the literature for the construction of these libraries; however, sometimes the...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2009.02802.x

    authors: Techen N,Arias RS,Glynn NC,Pan Z,Khan IA,Scheffler BE

    更新日期:2010-05-01 00:00:00

  • Permanent genetic resources added to molecular ecology resources database 1 October 2012-30 November 2012.

    abstract::This article documents the addition of 153 microsatellite marker loci to the Molecular Ecology Resources Database. Loci were developed for the following species: Brassica oleracea, Brycon amazonicus, Dimorphandra wilsonii, Eupallasella percnurus, Helleborus foetidus, Ipomoea purpurea, Phrynops geoffroanus, Prochilodus...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.12061

    authors: Molecular Ecology Resources Primer Development Consortium.,Aksoy S,Almeida-Val VM,Azevedo VC,Baucom R,Bazaga P,Beheregaray LB,Bennetzen JL,Brassaloti RA,Burgess TI,Caccone A,Chang SM,Ciampi AY,Ciancaleoni S,Clímaco GT,Cloue

    更新日期:2013-03-01 00:00:00

  • Identification of molecular markers for DNA barcoding in the Aphidiinae (Hym. Braconidae).

    abstract::Reliable identification of Aphidiinae species (Braconidae) is a prerequisite for conducting studies on aphid-parasitoid interactions at the community level. However, morphological identification of Aphidiinae species remains problematic even for specialists and is almost impossible with larval stages. Here, we compare...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2011.03083.x

    authors: Derocles SA,LE Ralec A,Plantegenest M,Chaubet B,Cruaud C,Cruaud A,Rasplus JY

    更新日期:2012-03-01 00:00:00

  • Strategies for complete plastid genome sequencing.

    abstract::Plastid sequencing is an essential tool in the study of plant evolution. This high-copy organelle is one of the most technically accessible regions of the genome, and its sequence conservation makes it a valuable region for comparative genome evolution, phylogenetic analysis and population studies. Here, we discuss re...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章,评审

    doi:10.1111/1755-0998.12626

    authors: Twyford AD,Ness RW

    更新日期:2017-09-01 00:00:00

  • PERMANENT GENETIC RESOURCES: Consensus primers of cyp73 genes discriminate willow species and hybrids (Salix, Salicaceae).

    abstract::Consensus primers, based on exon sequences of the cyp73 gene family coding for cinnamate 4-hydroxylase (C4H) of the lignin biosynthesis pathway, were designed for the tetraploid willow species Salix alba and Salix fragilis. Diagnostic alleles at species level were observed among introns of three cyp73 genes and allowe...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1471-8286.2007.01991.x

    authors: Trung le Q,VAN Puyvelde K,Triest L

    更新日期:2008-03-01 00:00:00

  • Microsatellite markers for mungbean developed from sequence database.

    abstract::A novel set of microsatellite markers for mungbean [Vigna radiata (L.) Wilczek] was developed from the public sequence database. Seventy-eight primers were designed and evaluated for polymorphism among 22 cultivated accessions. Eight polymorphic loci detected two to three alleles per locus with an average of 2.25. The...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2009.02655.x

    authors: Seehalak W,Somta P,Sommanas W,Srinives P

    更新日期:2009-05-01 00:00:00

  • Leader of the pack: faecal pellet deposition order impacts PCR amplification in wombats.

    abstract::DNA sourced from faeces is notoriously less reliable than that from tissue. Hence, understanding whether faecal pellet quality varies within faecal piles may be important for sample selection. We hypothesized that the order in which faecal pellets are deposited may influence microsatellite polymerase chain reaction (P...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2009.02582.x

    authors: Walker FM,Horsup A,Taylor AC

    更新日期:2009-05-01 00:00:00

  • Isolation and characterization of microsatellite loci for the Potentilla core group (Rosaceae) using 454 sequencing.

    abstract::Microsatellites are valuable markers for the analysis of genetic diversity, linkage mapping or genotyping. The limited availability of microsatellites for the genus Potentilla (Rosaceae) stipulated the isolation of markers from a representative (Potentilla pusilla Host) of the Potentilla core group that constitutes th...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2012.03134.x

    authors: Dobeš Ch,Scheffknecht S

    更新日期:2012-07-01 00:00:00

  • Genomic Resources Notes Accepted 1 June 2015-31 July 2015.

    abstract::This article documents the public availability of (i) microbiomes in diet and gut of larvae from the dipteran Dilophus febrilis using massive parallel sequencing, (ii) SNP and SSR discovery and characterization in the transcriptome of the Atlantic mackerel (Scomber scombrus, L) and (iii) assembled transcriptome for an...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.12454

    authors: Genomic Resources Development Consortium.,Álvarez P,Arthofer W,Coelho MM,Conklin D,Estonba A,Grosso AR,Helyar SJ,Langa J,Machado MP,Montes I,Pinho J,Rief A,Schartl M,Schlick-Steiner BC,Seeber J,Steiner FM,Vilas C

    更新日期:2015-11-01 00:00:00

  • A robust, cost-effective method for DNA, RNA and protein co-extraction from soil, other complex microbiomes and pure cultures.

    abstract::The soil microbiome is inherently complex with high biological diversity, and spatial heterogeneity typically occurring on the submillimetre scale. To study the microbial ecology of soils, and other microbiomes, biomolecules, that is, nucleic acids and proteins, must be efficiently and reliably co-recovered from the s...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.12979

    authors: Thorn CE,Bergesch C,Joyce A,Sambrano G,McDonnell K,Brennan F,Heyer R,Benndorf D,Abram F

    更新日期:2019-03-01 00:00:00

  • Multiscale resistant kernel surfaces derived from inferred gene flow: An application with vernal pool breeding salamanders.

    abstract::The importance of assessing spatial data at multiple scales when modelling species-environment relationships has been highlighted by several empirical studies. However, no landscape genetics studies have optimized landscape resistance surfaces by evaluating relevant spatial predictors at multiple spatial scales. Here,...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.13089

    authors: Winiarski KJ,Peterman WE,Whiteley AR,McGarigal K

    更新日期:2020-01-01 00:00:00

  • Developing a series of conservative anchor markers and their application to phylogenomics of Laurasiatherian mammals.

    abstract::The availability of numerous universal markers and suitable phylogenetic analysis methods are both very important for phylogenomics inference. Based on PCR amplification, a total of 122 markers, which were amplified in 19 representative species, were developed for Laurasiatherian phylogenomics. Subsequently, we illust...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2010.02903.x

    authors: Zhou X,Xu S,Zhang P,Yang G

    更新日期:2011-01-01 00:00:00

  • Cross-species amplification of 92 microsatellites of Medicago truncatula.

    abstract::Medicago species are important genetic sources for forage crops and nitrogen sources for various ecosystems. The ongoing genome sequencing of the model legume, Medicago truncatula, provides a wealth of genetic markers potentially useful for characterizing the population genetic structure and evolutionary history, and ...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/j.1755-0998.2009.02730.x

    authors: Chu HJ,Yan J,Hu Y,Wang HC,Li JQ

    更新日期:2010-01-01 00:00:00

  • Diurnal variation in opsin expression and common housekeeping genes necessitates comprehensive normalization methods for quantitative real-time PCR analyses.

    abstract::To determine the visual sensitivities of an organism of interest, quantitative reverse transcription-polymerase chain reaction (qRT-PCR) is often used to quantify expression of the light-sensitive opsins in the retina. While qRT-PCR is an affordable, high-throughput method for measuring expression, it comes with inher...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.13062

    authors: Yourick MR,Sandkam BA,Gammerdinger WJ,Escobar-Camacho D,Nandamuri SP,Clark FE,Joyce B,Conte MA,Kocher TD,Carleton KL

    更新日期:2019-11-01 00:00:00

  • Extracting DNA from 'jaws': high yield and quality from archived tiger shark (Galeocerdo cuvier) skeletal material.

    abstract::Archived specimens are highly valuable sources of DNA for retrospective genetic/genomic analysis. However, often limited effort has been made to evaluate and optimize extraction methods, which may be crucial for downstream applications. Here, we assessed and optimized the usefulness of abundant archived skeletal mater...

    journal_title:Molecular ecology resources

    pub_type: 杂志文章

    doi:10.1111/1755-0998.12580

    authors: Nielsen EE,Morgan JAT,Maher SL,Edson J,Gauthier M,Pepperell J,Holmes BJ,Bennett MB,Ovenden JR

    更新日期:2017-05-01 00:00:00