The effect of 16S rRNA region choice on bacterial community metabarcoding results.

Abstract:

:In this work, we compare the resolution of V2-V3 and V3-V4 16S rRNA regions for the purposes of estimating microbial community diversity using paired-end Illumina MiSeq reads, and show that the fragment, including V2 and V3 regions, has higher resolution for lower-rank taxa (genera and species). It allows for a more precise distance-based clustering of reads into species-level OTUs. Statistically convergent estimates of the diversity of major species (defined as those that together are covered by 95% of reads) can be achieved at the sample sizes of 10000 to 15000 reads. The relative error of the Shannon index estimate for this condition is lower than 4%.

journal_name

Sci Data

journal_title

Scientific data

authors

Bukin YS,Galachyants YP,Morozov IV,Bukin SV,Zakharenko AS,Zemskaya TI

doi

10.1038/sdata.2019.7

subject

Has Abstract

pub_date

2019-02-05 00:00:00

pages

190007

issn

2052-4463

pii

sdata20197

journal_volume

6

pub_type

  • A global compendium of human Crimean-Congo haemorrhagic fever virus occurrence.

    abstract::In order to map global disease risk, a geographic database of human Crimean-Congo haemorrhagic fever virus (CCHFV) occurrence was produced by surveying peer-reviewed literature and case reports, as well as informal online sources. Here we present this database, comprising occurrence data linked to geographic point or ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.16

    authors: Messina JP,Pigott DM,Duda KA,Brownstein JS,Myers MF,George DB,Hay SI

    更新日期:2015-04-14 00:00:00

  • Genome-wide identification of accessible chromatin regions in bumblebee by ATAC-seq.

    abstract::Bumblebees (Hymenoptera: Apidae) are important pollinating insects that play pivotal roles in crop production and natural ecosystem services. Although protein-coding genes in bumblebees have been extensively annotated, regulatory sequences of the genome, such as promoters and enhancers, have been poorly annotated. To ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00713-w

    authors: Zhao X,Su L,Xu W,Schaack S,Sun C

    更新日期:2020-10-26 00:00:00

  • A lake data set for the Tibetan Plateau from the 1960s, 2005, and 2014.

    abstract::Long-term datasets of number and size of lakes over the Tibetan Plateau (TP) are among the most critical components for better understanding the interactions among the cryosphere, hydrosphere, and atmosphere at regional and global scales. Due to the harsh environment and the scarcity of data over the TP, data accumula...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.39

    authors: Wan W,Long D,Hong Y,Ma Y,Yuan Y,Xiao P,Duan H,Han Z,Gu X

    更新日期:2016-06-21 00:00:00

  • Gene-gene and gene-environment interaction data for platinum-based chemotherapy in non-small cell lung cancer.

    abstract::Gene-gene (GXG) and gene-environment (GXE) interactions play important roles in pharmacogenetics study. Simultaneously incorporating multiple single nucleotide polymorphisms (SNPs) and clinical factors is needed to explore the association of their interactions with drug response and toxicity phenotypes. We genotyped 5...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.284

    authors: Wang LY,Cui JJ,Liu JY,Guo AX,Zhao ZY,Liu YZ,Wu JC,Li M,Hu CP,Gao Y,Zhou HH,Yin JY

    更新日期:2018-12-11 00:00:00

  • Publisher Correction: The Scales Project, a cross-national dataset on the interpretation of thermal perception scales.

    abstract::An amendment to this paper has been published and can be accessed via a link at the top of the paper. ...

    journal_title:Scientific data

    pub_type: 杂志文章,已发布勘误

    doi:10.1038/s41597-019-0348-3

    authors: Schweiker M,Abdul-Zahra A,André M,Al-Atrash F,Al-Khatri H,Alprianti RR,Alsaad H,Amin R,Ampatzi E,Arsano AY,Azadeh M,Azar E,Bahareh B,Batagarawa A,Becker S,Buonocore C,Cao B,Choi JH,Chun C,Daanen H,Damiati SA,Dan

    更新日期:2020-01-06 00:00:00

  • Multi-year whole-blood transcriptome data for the study of onset and progression of Parkinson's Disease.

    abstract::Parkinson's disease (PD) is an age-related, chronic and progressive neurodegenerative disorder characterized by a loss of multifocal neurons, resulting in both non-motor and motor symptoms. While several genetic and environmental contributory risk factors have been identified, more exact methods for diagnosing and ass...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0022-9

    authors: Valentine MNZ,Hashimoto K,Fukuhara T,Saiki S,Ishikawa KI,Hattori N,Carninci P

    更新日期:2019-04-05 00:00:00

  • dbPSP 2.0, an updated database of protein phosphorylation sites in prokaryotes.

    abstract::In prokaryotes, protein phosphorylation plays a critical role in regulating a broad spectrum of biological processes and occurs mainly on various amino acids, including serine (S), threonine (T), tyrosine (Y), arginine (R), aspartic acid (D), histidine (H) and cysteine (C) residues of protein substrates. Through liter...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0506-7

    authors: Shi Y,Zhang Y,Lin S,Wang C,Zhou J,Peng D,Xue Y

    更新日期:2020-05-29 00:00:00

  • A comprehensive collection of systems biology data characterizing the host response to viral infection.

    abstract::The Systems Biology for Infectious Diseases Research program was established by the U.S. National Institute of Allergy and Infectious Diseases to investigate host-pathogen interactions at a systems level. This program generated 47 transcriptomic and proteomic datasets from 30 studies that investigate in vivo and in vi...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2014.33

    authors: Aevermann BD,Pickett BE,Kumar S,Klem EB,Agnihothram S,Askovich PS,Bankhead A 3rd,Bolles M,Carter V,Chang J,Clauss TR,Dash P,Diercks AH,Eisfeld AJ,Ellis A,Fan S,Ferris MT,Gralinski LE,Green RR,Gritsenko MA,Hatta M

    更新日期:2014-10-14 00:00:00

  • Serial scanning electron microscopy of anti-PKHD1L1 immuno-gold labeled mouse hair cell stereocilia bundles.

    abstract::Serial electron microscopy techniques have proven to be a powerful tool in biology. Unfortunately, the data sets they generate lack robust and accurate automated segmentation algorithms. In this data descriptor publication, we introduce a serial focused ion beam scanning electron microscopy (FIB-SEM) dataset consistin...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0509-4

    authors: Ivanchenko MV,Cicconet M,Jandal HA,Wu X,Corey DP,Indzhykulian AA

    更新日期:2020-06-17 00:00:00

  • An annotated fluorescence image dataset for training nuclear segmentation methods.

    abstract::Fully-automated nuclear image segmentation is the prerequisite to ensure statistically significant, quantitative analyses of tissue preparations,applied in digital pathology or quantitative microscopy. The design of segmentation methods that work independently of the tissue type or preparation is complex, due to varia...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00608-w

    authors: Kromp F,Bozsaky E,Rifatbegovic F,Fischer L,Ambros M,Berneder M,Weiss T,Lazic D,Dörr W,Hanbury A,Beiske K,Ambros PF,Ambros IM,Taschner-Mandl S

    更新日期:2020-08-11 00:00:00

  • Transcriptome dataset of human corneal endothelium based on ribosomal RNA-depleted RNA-Seq data.

    abstract::The corneal endothelium maintains corneal transparency; consequently, damage to this endothelium by a number of pathological conditions results in severe vision loss. Publicly available expression databases of human tissues are useful for investigating the pathogenesis of diseases and for developing new therapeutic mo...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00754-1

    authors: Tokuda Y,Okumura N,Komori Y,Hanada N,Tashiro K,Koizumi N,Nakano M

    更新日期:2020-11-20 00:00:00

  • A multi-species repository of social networks.

    abstract::Social network analysis is an invaluable tool to understand the patterns, evolution, and consequences of sociality. Comparative studies over a range of social systems across multiple taxonomic groups are particularly valuable. Such studies however require quantitative social association or interaction data across mult...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0056-z

    authors: Sah P,Méndez JD,Bansal S

    更新日期:2019-04-29 00:00:00

  • A high-throughput drug combination screen of targeted small molecule inhibitors in cancer cell lines.

    abstract::While there is a high interest in drug combinations in cancer therapy, openly accessible datasets for drug combination responses are sparse. Here we present a dataset comprising 171 pairwise combinations of 19 individual drugs targeting signal transduction mechanisms across eight cancer cell lines, where the effect of...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0255-7

    authors: Flobak Å,Niederdorfer B,Nakstad VT,Thommesen L,Klinkenberg G,Lægreid A

    更新日期:2019-10-29 00:00:00

  • Harmonised LUCAS in-situ land cover and use database for field surveys from 2006 to 2018 in the European Union.

    abstract::Accurately characterizing land surface changes with Earth Observation requires geo-located ground truth. In the European Union (EU), a tri-annual surveyed sample of land cover and land use has been collected since 2006 under the Land Use/Cover Area frame Survey (LUCAS). A total of 1351293 observations at 651780 unique...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00675-z

    authors: d'Andrimont R,Yordanov M,Martinez-Sanchez L,Eiselt B,Palmieri A,Dominici P,Gallego J,Reuter HI,Joebges C,Lemoine G,van der Velde M

    更新日期:2020-10-16 00:00:00

  • De novo transcriptomes of 14 gammarid individuals for proteogenomic analysis of seven taxonomic groups.

    abstract::Gammarids are amphipods found worldwide distributed in fresh and marine waters. They play an important role in aquatic ecosystems and are well established sentinel species in ecotoxicology. In this study, we sequenced the transcriptomes of a male individual and a female individual for seven different taxonomic groups ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0192-5

    authors: Cogne Y,Degli-Esposti D,Pible O,Gouveia D,François A,Bouchez O,Eché C,Ford A,Geffard O,Armengaud J,Chaumot A,Almunia C

    更新日期:2019-09-27 00:00:00

  • Author Correction: Hybrid de novo whole-genome assembly and annotation of the model tapeworm Hymenolepis diminuta.

    abstract::An amendment to this paper has been published and can be accessed via a link at the top of the paper. ...

    journal_title:Scientific data

    pub_type: 杂志文章,已发布勘误

    doi:10.1038/s41597-020-0394-x

    authors: Nowak RM,Jastrzębski JP,Kuśmirek W,Sałamatin R,Rydzanicz M,Sobczyk-Kopcioł A,Sulima-Celińska A,Paukszto Ł,Makowczenko KG,Płoski R,Tkach VV,Basałaj K,Młocicki D

    更新日期:2020-02-10 00:00:00

  • Multiple-data-based monthly geopotential model set LDCmgm90.

    abstract::While the GRACE (Gravity Recovery and Climate Experiment) satellite mission is of great significance in understanding various branches of Earth sciences, the quality of GRACE monthly products can be unsatisfactory due to strong longitudinal stripe-pattern errors and other flaws. Based on corrected GRACE Mascon (mass c...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0239-7

    authors: Chen W,Luo J,Ray J,Yu N,Li JC

    更新日期:2019-10-23 00:00:00

  • Complementary proteomics strategies capture an ataxin-1 interactome in Neuro-2a cells.

    abstract::Ataxin-1 mutation, arising from a polyglutamine (polyQ) tract expansion, is the underlying genetic cause of the late-onset neurodegenerative disease Spinocerebellar ataxia type 1 (SCA1). To identify protein partners of polyQ-ataxin-1 in neuronal cells under control or stress conditions, here we report our complementar...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.262

    authors: Zhang S,Williamson NA,Bogoyevitch MA

    更新日期:2018-11-20 00:00:00

  • Viruses of the Nahant Collection, characterization of 251 marine Vibrionaceae viruses.

    abstract::Viruses are highly discriminating in their interactions with host cells and are thought to play a major role in maintaining diversity of environmental microbes. However, large-scale ecological and genomic studies of co-occurring virus-host pairs, required to characterize the mechanistic and genomic foundations of viru...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.114

    authors: Kauffman KM,Brown JM,Sharma RS,VanInsberghe D,Elsherbini J,Polz M,Kelly L

    更新日期:2018-07-03 00:00:00

  • High resolution multi-facies realizations of sedimentary reservoir and aquifer analogs.

    abstract::Geological structures are by nature inaccessible to direct observation. This can cause difficulties in applications where a spatially explicit representation of such structures is required, in particular when modelling fluid migration in geological formations. An increasing trend in recent years has been to use analog...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.33

    authors: Bayer P,Comunian A,Höyng D,Mariethoz G

    更新日期:2015-07-07 00:00:00

  • 7 Tesla MRI of the ex vivo human brain at 100 micron resolution.

    abstract::We present an ultra-high resolution MRI dataset of an ex vivo human brain specimen. The brain specimen was donated by a 58-year-old woman who had no history of neurological disease and died of non-neurological causes. After fixation in 10% formalin, the specimen was imaged on a 7 Tesla MRI scanner at 100 µm isotropic ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0254-8

    authors: Edlow BL,Mareyam A,Horn A,Polimeni JR,Witzel T,Tisdall MD,Augustinack JC,Stockmann JP,Diamond BR,Stevens A,Tirrell LS,Folkerth RD,Wald LL,Fischl B,van der Kouwe A

    更新日期:2019-10-30 00:00:00

  • Whole genome characterization of sequence diversity of 15,220 Icelanders.

    abstract::Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing G...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.115

    authors: Jónsson H,Sulem P,Kehr B,Kristmundsdottir S,Zink F,Hjartarson E,Hardarson MT,Hjorleifsson KE,Eggertsson HP,Gudjonsson SA,Ward LD,Arnadottir GA,Helgason EA,Helgason H,Gylfason A,Jonasdottir A,Jonasdottir A,Rafnar T,Bes

    更新日期:2017-09-21 00:00:00

  • Two-colour serial femtosecond crystallography dataset from gadoteridol-derivatized lysozyme for MAD phasing.

    abstract::We provide a detailed description of a gadoteridol-derivatized lysozyme (gadolinium lysozyme) two-colour serial femtosecond crystallography (SFX) dataset for multiple wavelength anomalous dispersion (MAD) structure determination. The data was collected at the Spring-8 Angstrom Compact free-electron LAser (SACLA) facil...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.188

    authors: Gorel A,Motomura K,Fukuzawa H,Doak RB,Grünbein ML,Hilpert M,Inoue I,Kloos M,Nass Kovács G,Nango E,Nass K,Roome CM,Shoeman RL,Tanaka R,Tono K,Foucar L,Joti Y,Yabashi M,Iwata S,Ueda K,Barends TRM,Schlichting I

    更新日期:2017-12-12 00:00:00

  • Data for training and testing radiation detection algorithms in an urban environment.

    abstract::The detection, identification, and localization of illicit nuclear materials in urban environments is of utmost importance for national security. Most often, the process of performing these operations consists of a team of trained individuals equipped with radiation detection devices that have built-in algorithms to a...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00672-2

    authors: Ghawaly JM Jr,Nicholson AD,Peplow DE,Anderson-Cook CM,Myers KL,Archer DE,Willis MJ,Quiter BJ

    更新日期:2020-10-05 00:00:00

  • Transcriptomic profiling of 39 commonly-used neuroblastoma cell lines.

    abstract::Neuroblastoma cell lines are an important and cost-effective model used to study oncogenic drivers of the disease. While many of these cell lines have been previously characterized with SNP, methylation, and/or mRNA expression microarrays, there has not been an effort to comprehensively sequence these cell lines. Here...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.33

    authors: Harenza JL,Diamond MA,Adams RN,Song MM,Davidson HL,Hart LS,Dent MH,Fortina P,Reynolds CP,Maris JM

    更新日期:2017-03-28 00:00:00

  • The systematic identification of cytoskeletal genes required for Drosophila melanogaster muscle maintenance.

    abstract::Animal muscles must maintain their function and structure while bearing substantial mechanical loads. How muscles withstand persistent mechanical strain is presently not well understood. Understanding the mechanisms by which tissues maintain their complex architecture is a key goal of cell biology. This dataset repres...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2014.2

    authors: Perkins AD,Lee MJ,Tanentzapf G

    更新日期:2014-03-11 00:00:00

  • Erratum: Genomes and phenomes of a population of outbred rats and its progenitors.

    abstract::[This corrects the article DOI: 10.1038/sdata.2014.11.]. ...

    journal_title:Scientific data

    pub_type: 已发布勘误

    doi:10.1038/sdata.2014.16

    authors: Baud A,Guryev V,Hummel O,Johannesson M,Rat Genome Sequencing and Mapping Consortium.,Flint J

    更新日期:2014-07-08 00:00:00

  • Human pluripotent stem cell derived HLC transcriptome data enables molecular dissection of hepatogenesis.

    abstract::Induced pluripotent stem cells (iPSCs) and human embryonic stem cells (hESCs) differentiated into hepatocyte-like cells (HLCs) provide a defined and renewable source of cells for drug screening, toxicology and regenerative medicine. We previously reprogrammed human fetal foreskin fibroblast cells (HFF1) into iPSCs emp...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.35

    authors: Wruck W,Adjaye J

    更新日期:2018-03-13 00:00:00

  • Nationwide registry of sepsis patients in Japan focused on disseminated intravascular coagulation 2011-2013.

    abstract::Sepsis is a syndrome with physiologic, pathologic, and biochemical abnormalities induced by infection. Sepsis can induce the dysregulation of systemic coagulation and fibrinolytic systems, resulting in disseminated intravascular coagulation (DIC), which is associated with a high mortality rate. Although there is no in...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.243

    authors: Hayakawa M,Yamakawa K,Saito S,Uchino S,Kudo D,Iizuka Y,Sanui M,Takimoto K,Mayumi T

    更新日期:2018-12-11 00:00:00

  • An annual time series of weekly size-resolved aerosol properties in the megacity of Metro Manila, Philippines.

    abstract::Size-resolved aerosol samples were collected in Metro Manila between July 2018 and October 2019. Two Micro-Orifice Uniform Deposit Impactors (MOUDI) were deployed at Manila Observatory in Quezon City, Metro Manila with samples collected on a weekly basis for water-soluble speciation and mass quantification. Additional...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0466-y

    authors: Stahl C,Cruz MT,Bañaga PA,Betito G,Braun RA,Aghdam MA,Cambaliza MO,Lorenzo GR,MacDonald AB,Pabroa PC,Yee JR,Simpas JB,Sorooshian A

    更新日期:2020-04-29 00:00:00