Abstract:
:In this work, we compare the resolution of V2-V3 and V3-V4 16S rRNA regions for the purposes of estimating microbial community diversity using paired-end Illumina MiSeq reads, and show that the fragment, including V2 and V3 regions, has higher resolution for lower-rank taxa (genera and species). It allows for a more precise distance-based clustering of reads into species-level OTUs. Statistically convergent estimates of the diversity of major species (defined as those that together are covered by 95% of reads) can be achieved at the sample sizes of 10000 to 15000 reads. The relative error of the Shannon index estimate for this condition is lower than 4%.
journal_name
Sci Datajournal_title
Scientific dataauthors
Bukin YS,Galachyants YP,Morozov IV,Bukin SV,Zakharenko AS,Zemskaya TIdoi
10.1038/sdata.2019.7subject
Has Abstractpub_date
2019-02-05 00:00:00pages
190007issn
2052-4463pii
sdata20197journal_volume
6pub_type
相关文献
Scientific Data文献大全abstract::In order to map global disease risk, a geographic database of human Crimean-Congo haemorrhagic fever virus (CCHFV) occurrence was produced by surveying peer-reviewed literature and case reports, as well as informal online sources. Here we present this database, comprising occurrence data linked to geographic point or ...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2015.16
更新日期:2015-04-14 00:00:00
abstract::Bumblebees (Hymenoptera: Apidae) are important pollinating insects that play pivotal roles in crop production and natural ecosystem services. Although protein-coding genes in bumblebees have been extensively annotated, regulatory sequences of the genome, such as promoters and enhancers, have been poorly annotated. To ...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00713-w
更新日期:2020-10-26 00:00:00
abstract::Long-term datasets of number and size of lakes over the Tibetan Plateau (TP) are among the most critical components for better understanding the interactions among the cryosphere, hydrosphere, and atmosphere at regional and global scales. Due to the harsh environment and the scarcity of data over the TP, data accumula...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2016.39
更新日期:2016-06-21 00:00:00
abstract::Gene-gene (GXG) and gene-environment (GXE) interactions play important roles in pharmacogenetics study. Simultaneously incorporating multiple single nucleotide polymorphisms (SNPs) and clinical factors is needed to explore the association of their interactions with drug response and toxicity phenotypes. We genotyped 5...
journal_title:Scientific data
pub_type:
doi:10.1038/sdata.2018.284
更新日期:2018-12-11 00:00:00
abstract::An amendment to this paper has been published and can be accessed via a link at the top of the paper. ...
journal_title:Scientific data
pub_type: 杂志文章,已发布勘误
doi:10.1038/s41597-019-0348-3
更新日期:2020-01-06 00:00:00
abstract::Parkinson's disease (PD) is an age-related, chronic and progressive neurodegenerative disorder characterized by a loss of multifocal neurons, resulting in both non-motor and motor symptoms. While several genetic and environmental contributory risk factors have been identified, more exact methods for diagnosing and ass...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0022-9
更新日期:2019-04-05 00:00:00
abstract::In prokaryotes, protein phosphorylation plays a critical role in regulating a broad spectrum of biological processes and occurs mainly on various amino acids, including serine (S), threonine (T), tyrosine (Y), arginine (R), aspartic acid (D), histidine (H) and cysteine (C) residues of protein substrates. Through liter...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-0506-7
更新日期:2020-05-29 00:00:00
abstract::The Systems Biology for Infectious Diseases Research program was established by the U.S. National Institute of Allergy and Infectious Diseases to investigate host-pathogen interactions at a systems level. This program generated 47 transcriptomic and proteomic datasets from 30 studies that investigate in vivo and in vi...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2014.33
更新日期:2014-10-14 00:00:00
abstract::Serial electron microscopy techniques have proven to be a powerful tool in biology. Unfortunately, the data sets they generate lack robust and accurate automated segmentation algorithms. In this data descriptor publication, we introduce a serial focused ion beam scanning electron microscopy (FIB-SEM) dataset consistin...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-0509-4
更新日期:2020-06-17 00:00:00
abstract::Fully-automated nuclear image segmentation is the prerequisite to ensure statistically significant, quantitative analyses of tissue preparations,applied in digital pathology or quantitative microscopy. The design of segmentation methods that work independently of the tissue type or preparation is complex, due to varia...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00608-w
更新日期:2020-08-11 00:00:00
abstract::The corneal endothelium maintains corneal transparency; consequently, damage to this endothelium by a number of pathological conditions results in severe vision loss. Publicly available expression databases of human tissues are useful for investigating the pathogenesis of diseases and for developing new therapeutic mo...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00754-1
更新日期:2020-11-20 00:00:00
abstract::Social network analysis is an invaluable tool to understand the patterns, evolution, and consequences of sociality. Comparative studies over a range of social systems across multiple taxonomic groups are particularly valuable. Such studies however require quantitative social association or interaction data across mult...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0056-z
更新日期:2019-04-29 00:00:00
abstract::While there is a high interest in drug combinations in cancer therapy, openly accessible datasets for drug combination responses are sparse. Here we present a dataset comprising 171 pairwise combinations of 19 individual drugs targeting signal transduction mechanisms across eight cancer cell lines, where the effect of...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0255-7
更新日期:2019-10-29 00:00:00
abstract::Accurately characterizing land surface changes with Earth Observation requires geo-located ground truth. In the European Union (EU), a tri-annual surveyed sample of land cover and land use has been collected since 2006 under the Land Use/Cover Area frame Survey (LUCAS). A total of 1351293 observations at 651780 unique...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00675-z
更新日期:2020-10-16 00:00:00
abstract::Gammarids are amphipods found worldwide distributed in fresh and marine waters. They play an important role in aquatic ecosystems and are well established sentinel species in ecotoxicology. In this study, we sequenced the transcriptomes of a male individual and a female individual for seven different taxonomic groups ...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0192-5
更新日期:2019-09-27 00:00:00
abstract::An amendment to this paper has been published and can be accessed via a link at the top of the paper. ...
journal_title:Scientific data
pub_type: 杂志文章,已发布勘误
doi:10.1038/s41597-020-0394-x
更新日期:2020-02-10 00:00:00
abstract::While the GRACE (Gravity Recovery and Climate Experiment) satellite mission is of great significance in understanding various branches of Earth sciences, the quality of GRACE monthly products can be unsatisfactory due to strong longitudinal stripe-pattern errors and other flaws. Based on corrected GRACE Mascon (mass c...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0239-7
更新日期:2019-10-23 00:00:00
abstract::Ataxin-1 mutation, arising from a polyglutamine (polyQ) tract expansion, is the underlying genetic cause of the late-onset neurodegenerative disease Spinocerebellar ataxia type 1 (SCA1). To identify protein partners of polyQ-ataxin-1 in neuronal cells under control or stress conditions, here we report our complementar...
journal_title:Scientific data
pub_type:
doi:10.1038/sdata.2018.262
更新日期:2018-11-20 00:00:00
abstract::Viruses are highly discriminating in their interactions with host cells and are thought to play a major role in maintaining diversity of environmental microbes. However, large-scale ecological and genomic studies of co-occurring virus-host pairs, required to characterize the mechanistic and genomic foundations of viru...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2018.114
更新日期:2018-07-03 00:00:00
abstract::Geological structures are by nature inaccessible to direct observation. This can cause difficulties in applications where a spatially explicit representation of such structures is required, in particular when modelling fluid migration in geological formations. An increasing trend in recent years has been to use analog...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2015.33
更新日期:2015-07-07 00:00:00
abstract::We present an ultra-high resolution MRI dataset of an ex vivo human brain specimen. The brain specimen was donated by a 58-year-old woman who had no history of neurological disease and died of non-neurological causes. After fixation in 10% formalin, the specimen was imaged on a 7 Tesla MRI scanner at 100 µm isotropic ...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0254-8
更新日期:2019-10-30 00:00:00
abstract::Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing G...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2017.115
更新日期:2017-09-21 00:00:00
abstract::We provide a detailed description of a gadoteridol-derivatized lysozyme (gadolinium lysozyme) two-colour serial femtosecond crystallography (SFX) dataset for multiple wavelength anomalous dispersion (MAD) structure determination. The data was collected at the Spring-8 Angstrom Compact free-electron LAser (SACLA) facil...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2017.188
更新日期:2017-12-12 00:00:00
abstract::The detection, identification, and localization of illicit nuclear materials in urban environments is of utmost importance for national security. Most often, the process of performing these operations consists of a team of trained individuals equipped with radiation detection devices that have built-in algorithms to a...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00672-2
更新日期:2020-10-05 00:00:00
abstract::Neuroblastoma cell lines are an important and cost-effective model used to study oncogenic drivers of the disease. While many of these cell lines have been previously characterized with SNP, methylation, and/or mRNA expression microarrays, there has not been an effort to comprehensively sequence these cell lines. Here...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2017.33
更新日期:2017-03-28 00:00:00
abstract::Animal muscles must maintain their function and structure while bearing substantial mechanical loads. How muscles withstand persistent mechanical strain is presently not well understood. Understanding the mechanisms by which tissues maintain their complex architecture is a key goal of cell biology. This dataset repres...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2014.2
更新日期:2014-03-11 00:00:00
abstract::[This corrects the article DOI: 10.1038/sdata.2014.11.]. ...
journal_title:Scientific data
pub_type: 已发布勘误
doi:10.1038/sdata.2014.16
更新日期:2014-07-08 00:00:00
abstract::Induced pluripotent stem cells (iPSCs) and human embryonic stem cells (hESCs) differentiated into hepatocyte-like cells (HLCs) provide a defined and renewable source of cells for drug screening, toxicology and regenerative medicine. We previously reprogrammed human fetal foreskin fibroblast cells (HFF1) into iPSCs emp...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2018.35
更新日期:2018-03-13 00:00:00
abstract::Sepsis is a syndrome with physiologic, pathologic, and biochemical abnormalities induced by infection. Sepsis can induce the dysregulation of systemic coagulation and fibrinolytic systems, resulting in disseminated intravascular coagulation (DIC), which is associated with a high mortality rate. Although there is no in...
journal_title:Scientific data
pub_type:
doi:10.1038/sdata.2018.243
更新日期:2018-12-11 00:00:00
abstract::Size-resolved aerosol samples were collected in Metro Manila between July 2018 and October 2019. Two Micro-Orifice Uniform Deposit Impactors (MOUDI) were deployed at Manila Observatory in Quezon City, Metro Manila with samples collected on a weekly basis for water-soluble speciation and mass quantification. Additional...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-0466-y
更新日期:2020-04-29 00:00:00